1
|
Ghildiyal K, Nayak SS, Rajawat D, Sharma A, Chhotaray S, Bhushan B, Dutt T, Panigrahi M. Genomic insights into the conservation of wild and domestic animal diversity: A review. Gene 2023; 886:147719. [PMID: 37597708 DOI: 10.1016/j.gene.2023.147719] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 06/20/2023] [Accepted: 08/16/2023] [Indexed: 08/21/2023]
Abstract
Due to environmental change and anthropogenic activities, global biodiversity has suffered an unprecedented loss, and the world is now heading toward the sixth mass extinction event. This urges the need to step up our efforts to promote the sustainable use of animal genetic resources and plan effective strategies for their conservation. Although habitat preservation and restoration are the primary means of conserving biodiversity, genomic technologies offer a variety of novel tools for identifying biodiversity hotspots and thus, support conservation efforts. Conservation genomics is a broad area of science that encompasses the application of genomic data from thousands or tens of thousands of genome-wide markers to address important conservation biology concerns. Genomic approaches have revolutionized the way we understand and manage animal populations, providing tools to identify and preserve unique genetic variants and alleles responsible for adaptive genetic variation, reducing the deleterious consequences of inbreeding, and increasing the adaptive potential of threatened species. The advancement of genomic technologies, particularly comparative genomic approaches, and the increased accessibility of genomic resources in the form of genome-enabled taxa for non-model organisms, provides a distinct advantage in defining conservation units over traditional genetics approaches. The objective of this review is to provide an exhaustive overview of the concept of conservation genomics, discuss the rationale behind the transition from conservation genetics to genomic approaches, and emphasize the potential applications of genomic techniques for conservation purposes. We also highlight interesting case studies in both livestock and wildlife species where genomic techniques have been used to accomplish conservation goals. Finally, we address some challenges and future perspectives in this field.
Collapse
Affiliation(s)
- Kanika Ghildiyal
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Sonali Sonejita Nayak
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Divya Rajawat
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Anurodh Sharma
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Supriya Chhotaray
- Animal Genetics and Breeding Division, ICAR-National Dairy Research Institute, Karnal, Haryana, India
| | - Bharat Bhushan
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Triveni Dutt
- Livestock Production and Management Section, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India
| | - Manjit Panigrahi
- Division of Animal Genetics, Indian Veterinary Research Institute, Izatnagar, Bareilly 243122, UP, India.
| |
Collapse
|
2
|
Bettridge JM, Snow LC, Tang Y, Petrovska L, Lawes J, Smith RP. Using SNP addresses for Salmonella Typhimurium DT104 in routine veterinary outbreak detection. Epidemiol Infect 2023; 151:e187. [PMID: 37876041 PMCID: PMC10644063 DOI: 10.1017/s0950268823001723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/27/2023] [Revised: 09/12/2023] [Accepted: 09/17/2023] [Indexed: 10/26/2023] Open
Abstract
SNP addresses are a pathogen typing method based on whole-genome sequences (WGSs), assigning groups at seven different levels of genetic similarity. Public health surveillance uses it for several gastro-intestinal infections; this work trialled its use in veterinary surveillance for salmonella outbreak detection. Comparisons were made between temporal and spatio-temporal cluster detection models that either defined cases by their SNP address or by phage type, using historical data sets. Clusters of SNP incidents were effectively detected by both methods, but spatio-temporal models consistently detected these clusters earlier than the corresponding temporal models. Unlike phage type, SNP addresses appeared spatially and temporally limited, which facilitated the differentiation of novel, stable, or expanding clusters in spatio-temporal models. Furthermore, these models flagged spatio-temporal clusters containing only two to three cases at first detection, compared with a median of seven cases in phage-type models. The large number of SNP addresses will require automated methods to implement these detection models routinely. Further work is required to explore how temporal changes and different host species may impact the sensitivity and specificity of cluster detection. In conclusion, given validation with more sequencing data, SNP addresses are likely to be a valuable addition to early warning systems in veterinary surveillance.
Collapse
Affiliation(s)
- J. M. Bettridge
- Department of Epidemiological Sciences, Animal and Plant Health Agency, Weybridge, UK
- Natural Resources Institute, University of Greenwich, Chatham, UK
| | - L. C. Snow
- Department of Epidemiological Sciences, Animal and Plant Health Agency, Weybridge, UK
| | - Y. Tang
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, UK
| | - L. Petrovska
- Department of Bacteriology, Animal and Plant Health Agency, Weybridge, UK
| | - J. Lawes
- Department of Epidemiological Sciences, Animal and Plant Health Agency, Weybridge, UK
| | - R. P. Smith
- Department of Epidemiological Sciences, Animal and Plant Health Agency, Weybridge, UK
| |
Collapse
|
3
|
Beichman AC, Robinson J, Lin M, Moreno-Estrada A, Nigenda-Morales S, Harris K. Evolution of the Mutation Spectrum Across a Mammalian Phylogeny. Mol Biol Evol 2023; 40:msad213. [PMID: 37770035 PMCID: PMC10566577 DOI: 10.1093/molbev/msad213] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2023] [Revised: 08/21/2023] [Accepted: 09/19/2023] [Indexed: 10/03/2023] Open
Abstract
Although evolutionary biologists have long theorized that variation in DNA repair efficacy might explain some of the diversity of lifespan and cancer incidence across species, we have little data on the variability of normal germline mutagenesis outside of humans. Here, we shed light on the spectrum and etiology of mutagenesis across mammals by quantifying mutational sequence context biases using polymorphism data from thirteen species of mice, apes, bears, wolves, and cetaceans. After normalizing the mutation spectrum for reference genome accessibility and k-mer content, we use the Mantel test to deduce that mutation spectrum divergence is highly correlated with genetic divergence between species, whereas life history traits like reproductive age are weaker predictors of mutation spectrum divergence. Potential bioinformatic confounders are only weakly related to a small set of mutation spectrum features. We find that clock-like mutational signatures previously inferred from human cancers cannot explain the phylogenetic signal exhibited by the mammalian mutation spectrum, despite the ability of these signatures to fit each species' 3-mer spectrum with high cosine similarity. In contrast, parental aging signatures inferred from human de novo mutation data appear to explain much of the 1-mer spectrum's phylogenetic signal in combination with a novel mutational signature. We posit that future models purporting to explain the etiology of mammalian mutagenesis need to capture the fact that more closely related species have more similar mutation spectra; a model that fits each marginal spectrum with high cosine similarity is not guaranteed to capture this hierarchy of mutation spectrum variation among species.
Collapse
Affiliation(s)
- Annabel C Beichman
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | - Jacqueline Robinson
- Institute for Human Genetics, University of California, San Francisco, CA, USA
| | - Meixi Lin
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA, USA
| | - Andrés Moreno-Estrada
- National Laboratory of Genomics for Biodiversity, Advanced Genomics Unit (UGA-LANGEBIO), CINVESTAV, Irapuato, Mexico
| | - Sergio Nigenda-Morales
- Department of Biological Sciences, California State University, San Marcos, San Marcos, CA, USA
| | - Kelley Harris
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Herbold Computational Biology Program, Fred Hutchinson Cancer Center, Seattle, WA, USA
| |
Collapse
|
4
|
Beichman AC, Robinson J, Lin M, Moreno-Estrada A, Nigenda-Morales S, Harris K. "Evolution of the mutation spectrum across a mammalian phylogeny". BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.05.31.543114. [PMID: 37398383 PMCID: PMC10312511 DOI: 10.1101/2023.05.31.543114] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2023]
Abstract
Little is known about how the spectrum and etiology of germline mutagenesis might vary among mammalian species. To shed light on this mystery, we quantify variation in mutational sequence context biases using polymorphism data from thirteen species of mice, apes, bears, wolves, and cetaceans. After normalizing the mutation spectrum for reference genome accessibility and k -mer content, we use the Mantel test to deduce that mutation spectrum divergence is highly correlated with genetic divergence between species, whereas life history traits like reproductive age are weaker predictors of mutation spectrum divergence. Potential bioinformatic confounders are only weakly related to a small set of mutation spectrum features. We find that clocklike mutational signatures previously inferred from human cancers cannot explain the phylogenetic signal exhibited by the mammalian mutation spectrum, despite the ability of these clocklike signatures to fit each species' 3-mer spectrum with high cosine similarity. In contrast, parental aging signatures inferred from human de novo mutation data appear to explain much of the mutation spectrum's phylogenetic signal when fit to non-context-dependent mutation spectrum data in combination with a novel mutational signature. We posit that future models purporting to explain the etiology of mammalian mutagenesis need to capture the fact that more closely related species have more similar mutation spectra; a model that fits each marginal spectrum with high cosine similarity is not guaranteed to capture this hierarchy of mutation spectrum variation among species.
Collapse
Affiliation(s)
| | - Jacqueline Robinson
- Institute for Human Genetics, University of California, San Francisco, San Francisco, CA
| | - Meixi Lin
- Department of Plant Biology, Carnegie Institution for Science, Stanford, CA
| | - Andrés Moreno-Estrada
- National Laboratory of Genomics for Biodiversity, Advanced Genomics Unit (UGA-LANGEBIO), CINVESTAV, Irapuato, Mexico
| | - Sergio Nigenda-Morales
- Department of Biological Sciences, California State University, San Marcos, San Marcos CA
| | - Kelley Harris
- Department of Genome Sciences, University of Washington, Seattle WA
| |
Collapse
|
5
|
Raheem DC, Gower DJ, Breugelmans K, Ranawana KB, Backeljau T. The systematics and evolution of the Sri Lankan rainforest land snail Corilla: New insights from RADseq-based phylogenetics. Mol Phylogenet Evol 2023; 182:107731. [PMID: 36781030 DOI: 10.1016/j.ympev.2023.107731] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2022] [Revised: 01/20/2023] [Accepted: 02/09/2023] [Indexed: 02/13/2023]
Abstract
The stylommatophoran land-snail genus Corilla is endemic to Sri Lanka and India's Western Ghats. On the basis of habitat distribution and shell morphology, the 10 extant Sri Lankan species fall into two distinct groups, lowland and montane. Here, we use phylogenetic analyses of restriction-site-associated DNA sequencing (RADseq) data and ancestral-state reconstructions of habitat association and shell morphology to clarify the systematics and evolution of Sri Lankan Corilla. Our dataset consists of 9 species of Corilla. Phylogenetic analyses were based on 88 assemblies (9,604-4,132,850 bp) generated by the RADseq assembler ipyrad, using four parameter combinations and different levels of missing data. Trees were inferred using a maximum likelihood (ML) approach. Ancestral states were reconstructed using maximum parsimony (MP) and ML approaches, with 1 binary state character analysed for habitat association (lowland vs montane) and 6 binary state characters analysed for shell morphology (shape, colour, lip width, length of upper palatal folds, orientation of upper palatal folds and collabral sculpture). Over a wide range of missing data (40-87 % missing individuals per locus) and assembly sizes (62,279-4,132,850 bp), nearly all trees conformed to one of two topologies (A and B), most relationships were strongly supported and total branch support approached the maximal value. Apart from the position of Corilla odontophora 'south', topologies A and B showed similar, well-resolved relationships at and above the species level. Our study agrees with the shell-based taxonomy of C. adamsi, C. beddomeae, C. carabinata, C. colletti and C. humberti (all maximally supported as monophyletic species). It shows that C. erronea and C. fryae constitute a single relatively widespread species (for which the valid name is C. erronea) and that the names C. gudei and C. odontophora each apply to at least two distinct, yet conchologically-cryptic species. The MP and ML ancestral-state reconstructions yielded broadly similar results and provide firm evidence that diversification in Sri Lankan Corilla has involved evolutionary convergence in the shell morphology of lowland lineages, with a pale shell and wide lip having evolved on at least two separate occasions (in C. carabinata and C. colletti) from montane ancestors having a dark, narrow-lipped shell.
Collapse
Affiliation(s)
- Dinarzarde C Raheem
- Department of Biological Sciences, Faculty of Applied Sciences, Rajarata University of Sri Lanka, Mihintale 50300, Sri Lanka; Department of Life Sciences, Natural History Museum, London SW7 5BD, UK.
| | - David J Gower
- Department of Life Sciences, Natural History Museum, London SW7 5BD, UK
| | - Karin Breugelmans
- Royal Belgian Institute of Natural Sciences, Vautierstraat 29, B-1000 Brussels, Belgium
| | - Kithsiri B Ranawana
- Department of Zoology, Faculty of Science, University of Peradeniya, Peradeniya, Sri Lanka
| | - Thierry Backeljau
- Royal Belgian Institute of Natural Sciences, Vautierstraat 29, B-1000 Brussels, Belgium; Evolutionary Ecology Group, University of Antwerp, Universiteitsplein 1, B-2610 Antwerp, Belgium
| |
Collapse
|
6
|
Oomen RA, Hutchings JA. Genomic reaction norms inform predictions of plastic and adaptive responses to climate change. J Anim Ecol 2022; 91:1073-1087. [PMID: 35445402 PMCID: PMC9325537 DOI: 10.1111/1365-2656.13707] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Accepted: 04/05/2022] [Indexed: 12/11/2022]
Abstract
Genomic reaction norms represent the range of gene expression phenotypes (usually mRNA transcript levels) expressed by a genotype along an environmental gradient. Reaction norms derived from common‐garden experiments are powerful approaches for disentangling plastic and adaptive responses to environmental change in natural populations. By treating gene expression as a phenotype in itself, genomic reaction norms represent invaluable tools for exploring causal mechanisms underlying organismal responses to climate change across multiple levels of biodiversity. Our goal is to provide the context, framework and motivation for applying genomic reaction norms to study the responses of natural populations to climate change. Here, we describe the utility of integrating genomics with common‐garden‐gradient experiments under a reaction norm analytical framework to answer fundamental questions about phenotypic plasticity, local adaptation, their interaction (i.e. genetic variation in plasticity) and future adaptive potential. An experimental and analytical framework for constructing and analysing genomic reaction norms is presented within the context of polygenic climate change responses of structured populations with gene flow. Intended for a broad eco‐evo readership, we first briefly review adaptation with gene flow and the importance of understanding the genomic basis and spatial scale of adaptation for conservation and management of structured populations under anthropogenic change. Then, within a high‐dimensional reaction norm framework, we illustrate how to distinguish plastic, differentially expressed (difference in reaction norm intercepts) and differentially plastic (difference in reaction norm slopes) genes, highlighting the areas of opportunity for applying these concepts. We conclude by discussing how genomic reaction norms can be incorporated into a holistic framework to understand the eco‐evolutionary dynamics of climate change responses from molecules to ecosystems. We aim to inspire researchers to integrate gene expression measurements into common‐garden experimental designs to investigate the genomics of climate change responses as sequencing costs become increasingly accessible.
Collapse
Affiliation(s)
- Rebekah A Oomen
- Centre for Ecological and Evolutionary Synthesis (CEES), Department of Biosciences, University of Oslo, Oslo, Norway.,Centre for Coastal Research (CCR), University of Agder, Kristiansand, Norway
| | - Jeffrey A Hutchings
- Centre for Coastal Research (CCR), University of Agder, Kristiansand, Norway.,Department of Biology, Dalhousie University, Halifax, Nova Scotia, Canada.,Institute of Marine Research, Flødevigen Marine Research Station, His, Norway
| |
Collapse
|
7
|
Chaguza C, Ebruke C, Senghore M, Lo SW, Tientcheu PE, Gladstone RA, Tonkin-Hill G, Cornick JE, Yang M, Worwui A, McGee L, Breiman RF, Klugman KP, Kadioglu A, Everett DB, Mackenzie G, Croucher NJ, Roca A, Kwambana-Adams BA, Antonio M, Bentley SD. Comparative Genomics of Disease and Carriage Serotype 1 Pneumococci. Genome Biol Evol 2022; 14:evac052. [PMID: 35439297 PMCID: PMC9048925 DOI: 10.1093/gbe/evac052] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/12/2022] [Indexed: 11/14/2022] Open
Abstract
The isolation of Streptococcus pneumoniae serotypes in systemic tissues of patients with invasive disease versus the nasopharynx of healthy individuals with asymptomatic carriage varies widely. Some serotypes are hyper-invasive, particularly serotype 1, but the underlying genetics remain poorly understood due to the rarity of carriage isolates, reducing the power of comparison with invasive isolates. Here, we use a well-controlled genome-wide association study to search for genetic variation associated with invasiveness of serotype 1 pneumococci from a serotype 1 endemic setting in Africa. We found no consensus evidence that certain genomic variation is overrepresented among isolates from patients with invasive disease than asymptomatic carriage. Overall, the genomic variation explained negligible phenotypic variability, suggesting a minimal effect on the disease status. Furthermore, changes in lineage distribution were seen with lineages replacing each other over time, highlighting the importance of continued pathogen surveillance. Our findings suggest that the hyper-invasiveness is an intrinsic property of the serotype 1 strains, not specific for a "disease-associated" subpopulation disproportionately harboring unique genomic variation.
Collapse
Affiliation(s)
- Chrispin Chaguza
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Darwin College, University of Cambridge, Silver Street, Cambridge, UK
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Chinelo Ebruke
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Madikay Senghore
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- Department of Epidemiology, Center for Communicable Disease Dynamics, Harvard T.H. Chan School of Public Health, Boston, MA, USA
| | - Stephanie W. Lo
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Peggy-Estelle Tientcheu
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Rebecca A. Gladstone
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
- Department of Biostatistics, University of Oslo, Oslo, Norway
| | - Gerry Tonkin-Hill
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| | - Jennifer E. Cornick
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
- Malawi-Liverpool-Wellcome Trust Clinical Research Programme, Blantyre, Malawi
| | - Marie Yang
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Archibald Worwui
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
| | - Lesley McGee
- Respiratory Diseases Branch, Centers for Disease Control and Prevention, Atlanta, GA, USA
| | - Robert F. Breiman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Keith P. Klugman
- Hubert Department of Global Health, Rollins School of Public Health, Emory University, Atlanta, GA, USA
| | - Aras Kadioglu
- Department of Clinical Infection, Microbiology and Immunology, Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Dean B. Everett
- College of Medicine and Health Sciences, Khalifa University, Abu Dhabi, UAE
| | - Grant Mackenzie
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- Murdoch Children’s Research Institute, Parkville, Melbourne, VIC, Australia
- London School of Hygiene & Tropical Medicine, London, UK
| | - Nicholas J. Croucher
- MRC Centre for Global Infectious Disease Analysis, Department of Infectious Disease Epidemiology, School of Public Health, Imperial College London, London, UK
| | - Anna Roca
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- London School of Hygiene & Tropical Medicine, London, UK
| | - Brenda A. Kwambana-Adams
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- NIHR Global Health Research Unit on Mucosal Pathogens, Division of Infection and Immunity, University College London, London, UK
| | - Martin Antonio
- Medical Research Council (MRC) Unit The Gambia at the London School of Hygiene and Tropical Medicine, Fajara, The Gambia
- London School of Hygiene & Tropical Medicine, London, UK
- Warwick Medical School, University of Warwick, Coventry, UK
| | - Stephen D. Bentley
- Parasites and Microbes Programme, Wellcome Sanger Institute, Wellcome Genome Campus, Cambridge, UK
| |
Collapse
|
8
|
Rayne A, Blair S, Dale M, Flack B, Hollows J, Moraga R, Parata RN, Rupene M, Tamati‐Elliffe P, Wehi PM, Wylie MJ, Steeves TE. Weaving place‐based knowledge for culturally significant species in the age of genomics: Looking to the past to navigate the future. Evol Appl 2022; 15:751-772. [PMID: 35603033 PMCID: PMC9108313 DOI: 10.1111/eva.13367] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Revised: 02/16/2022] [Accepted: 02/24/2022] [Indexed: 11/28/2022] Open
Abstract
Relationships with place provide critical context for characterizing biocultural diversity. Yet, genetic and genomic studies are rarely informed by Indigenous or local knowledge, processes, and practices, including the movement of culturally significant species. Here, we show how place‐based knowledge can better reveal the biocultural complexities of genetic or genomic data derived from culturally significant species. As a case study, we focus on culturally significant southern freshwater kōura (crayfish) in Aotearoa me Te Waipounamu (New Zealand, herein Aotearoa NZ). Our results, based on genotyping‐by‐sequencing markers, reveal strong population genetic structure along with signatures of population admixture in 19 genetically depauperate populations across the east coast of Te Waipounamu. Environment association and differentiation analyses for local adaptation also indicate a role for hydroclimatic variables—including temperature, precipitation, and water flow regimes—in shaping local adaptation in kōura. Through trusted partnerships between community and researchers, weaving genomic markers with place‐based knowledge has both provided invaluable context for the interpretation of data and created opportunities to reconnect people and place. We envisage such trusted partnerships guiding future genomic research for culturally significant species in Aotearoa NZ and beyond.
Collapse
Affiliation(s)
- Aisling Rayne
- University of Canterbury School of Biological Sciences Christchurch New Zealand
| | | | - Matthew Dale
- Waterscape Connections Ltd Dunedin New Zealand
- Te Rūnanga o Ngāi Tahu Dunedin New Zealand
| | - Brendan Flack
- Kāti Huirapa Rūnaka ki Puketeraki Karitane New Zealand
| | | | - Roger Moraga
- Tea Break Bioinformatics Ltd Palmerston North New Zealand
| | | | - Makarini Rupene
- University of Canterbury Ngāi Tahu Research Centre Christchurch New Zealand
- Environment Canterbury Christchurch New Zealand
| | | | - Priscilla M Wehi
- University of Otago, Centre for Sustainability Dunedin New Zealand
| | - Matthew J Wylie
- Kāti Huirapa Rūnaka ki Puketeraki Karitane New Zealand
- The New Zealand Institute for Plant and Food Research Limited Nelson New Zealand
| | - Tammy E Steeves
- University of Canterbury School of Biological Sciences Christchurch New Zealand
| |
Collapse
|
9
|
Ba R, Geffard E, Douillard V, Simon F, Mesnard L, Vince N, Gourraud PA, Limou S. Surfing the Big Data Wave: Omics Data Challenges in Transplantation. Transplantation 2022; 106:e114-e125. [PMID: 34889882 DOI: 10.1097/tp.0000000000003992] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
Abstract
In both research and care, patients, caregivers, and researchers are facing a leap forward in the quantity of data that are available for analysis and interpretation, marking the daunting "big data era." In the biomedical field, this quantitative shift refers mostly to the -omics that permit measuring and analyzing biological features of the same type as a whole. Omics studies have greatly impacted transplantation research and highlighted their potential to better understand transplant outcomes. Some studies have emphasized the contribution of omics in developing personalized therapies to avoid graft loss. However, integrating omics data remains challenging in terms of analytical processes. These data come from multiple sources. Consequently, they may contain biases and systematic errors that can be mistaken for relevant biological information. Normalization methods and batch effects have been developed to tackle issues related to data quality and homogeneity. In addition, imputation methods handle data missingness. Importantly, the transplantation field represents a unique analytical context as the biological statistical unit is the donor-recipient pair, which brings additional complexity to the omics analyses. Strategies such as combined risk scores between 2 genomes taking into account genetic ancestry are emerging to better understand graft mechanisms and refine biological interpretations. The future omics will be based on integrative biology, considering the analysis of the system as a whole and no longer the study of a single characteristic. In this review, we summarize omics studies advances in transplantation and address the most challenging analytical issues regarding these approaches.
Collapse
Affiliation(s)
- Rokhaya Ba
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
- Département Informatique et Mathématiques, Ecole Centrale de Nantes, Nantes, France
| | - Estelle Geffard
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
| | - Venceslas Douillard
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
| | - Françoise Simon
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
- Mount Sinai School of Medicine, New York, NY
| | - Laurent Mesnard
- Urgences Néphrologiques et Transplantation Rénale, Hôpital Tenon, Assistance Publique-Hôpitaux de Paris, Paris, France
- Sorbonne Université, Paris, France
| | - Nicolas Vince
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
| | - Pierre-Antoine Gourraud
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
| | - Sophie Limou
- Université de Nantes, Centre Hospitalier Universitaire Nantes, Institute of Health and Medical Research, Centre de Recherche en Transplantation et Immunologie, UMR 1064, Institut de Transplantation Urologie-Néphrologie, Nantes, France
- Département Informatique et Mathématiques, Ecole Centrale de Nantes, Nantes, France
| |
Collapse
|
10
|
Leigh DM, Peranić K, Prospero S, Cornejo C, Ćurković-Perica M, Kupper Q, Nuskern L, Rigling D, Ježić M. Long-read sequencing reveals the evolutionary drivers of intra-host diversity across natural RNA mycovirus infections. Virus Evol 2021; 7:veab101. [PMID: 35299787 PMCID: PMC8923234 DOI: 10.1093/ve/veab101] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/10/2021] [Revised: 11/23/2021] [Accepted: 12/01/2021] [Indexed: 01/05/2023] Open
Abstract
Intra-host dynamics are a core component of virus evolution but most intra-host data come from a narrow range of hosts or experimental infections. Gaining broader information on the intra-host diversity and dynamics of naturally occurring virus infections is essential to our understanding of evolution across the virosphere. Here we used PacBio long-read HiFi sequencing to characterize the intra-host populations of natural infections of the RNA mycovirus Cryphonectria hypovirus 1 (CHV1). CHV1 is a biocontrol agent for the chestnut blight fungus (Cryphonectria parasitica), which co-invaded Europe alongside the fungus. We characterized the mutational and haplotypic intra-host virus diversity of thirty-eight natural CHV1 infections spread across four locations in Croatia and Switzerland. Intra-host CHV1 diversity values were shaped by purifying selection and accumulation of mutations over time as well as epistatic interactions within the host genome at defense loci. Geographical landscape features impacted CHV1 inter-host relationships through restricting dispersal and causing founder effects. Interestingly, a small number of intra-host viral haplotypes showed high sequence similarity across large geographical distances unlikely to be linked by dispersal.
Collapse
Affiliation(s)
- Deborah M Leigh
- Phytopathology, Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Birmensdorf CH-8903, Switzerland
| | - Karla Peranić
- Faculty of Science, University of Zagreb, Zagreb, Grad Zagreb 10000, Croatia
| | - Simone Prospero
- Phytopathology, Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Birmensdorf CH-8903, Switzerland
| | - Carolina Cornejo
- Phytopathology, Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Birmensdorf CH-8903, Switzerland
| | | | | | - Lucija Nuskern
- Faculty of Science, University of Zagreb, Zagreb, Grad Zagreb 10000, Croatia
| | - Daniel Rigling
- Phytopathology, Swiss Federal Institute for Forest, Snow and Landscape Research WSL, Birmensdorf CH-8903, Switzerland
| | - Marin Ježić
- Faculty of Science, University of Zagreb, Zagreb, Grad Zagreb 10000, Croatia
| |
Collapse
|
11
|
Lou RN, Therkildsen NO. Batch effects in population genomic studies with low-coverage whole genome sequencing data: Causes, detection and mitigation. Mol Ecol Resour 2021; 22:1678-1692. [PMID: 34825778 DOI: 10.1111/1755-0998.13559] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2021] [Revised: 11/05/2021] [Accepted: 11/11/2021] [Indexed: 01/04/2023]
Abstract
Over the past few decades, there has been an explosion in the amount of publicly available sequencing data. This opens new opportunities for combining data sets to achieve unprecedented sample sizes, spatial coverage or temporal replication in population genomic studies. However, a common concern is that nonbiological differences between data sets may generate patterns of variation in the data that can confound real biological patterns, a problem known as batch effects. In this paper, we compare two batches of low-coverage whole genome sequencing (lcWGS) data generated from the same populations of Atlantic cod (Gadus morhua). First, we show that with a "batch-effect-naive" bioinformatic pipeline, batch effects systematically biased our genetic diversity estimates, population structure inference and selection scans. We then demonstrate that these batch effects resulted from multiple technical differences between our data sets, including the sequencing chemistry (four-channel vs. two-channel), sequencing run, read type (single-end vs. paired-end), read length (125 vs. 150 bp), DNA degradation level (degraded vs. well preserved) and sequencing depth (0.8× vs. 0.3× on average). Lastly, we illustrate that a set of simple bioinformatic strategies (such as different read trimming and single nucleotide polymorphism filtering) can be used to detect batch effects in our data and substantially mitigate their impact. We conclude that combining data sets remains a powerful approach as long as batch effects are explicitly accounted for. We focus on lcWGS data in this paper, which may be particularly vulnerable to certain causes of batch effects, but many of our conclusions also apply to other sequencing strategies.
Collapse
|
12
|
Abstract
The rapidly emerging field of macrogenetics focuses on analysing publicly accessible genetic datasets from thousands of species to explore large-scale patterns and predictors of intraspecific genetic variation. Facilitated by advances in evolutionary biology, technology, data infrastructure, statistics and open science, macrogenetics addresses core evolutionary hypotheses (such as disentangling environmental and life-history effects on genetic variation) with a global focus. Yet, there are important, often overlooked, limitations to this approach and best practices need to be considered and adopted if macrogenetics is to continue its exciting trajectory and reach its full potential in fields such as biodiversity monitoring and conservation. Here, we review the history of this rapidly growing field, highlight knowledge gaps and future directions, and provide guidelines for further research.
Collapse
|
13
|
Smith BT, Mauck WM, Benz BW, Andersen MJ. Uneven Missing Data Skew Phylogenomic Relationships within the Lories and Lorikeets. Genome Biol Evol 2021; 12:1131-1147. [PMID: 32470111 PMCID: PMC7486955 DOI: 10.1093/gbe/evaa113] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 05/26/2020] [Indexed: 01/21/2023] Open
Abstract
The resolution of the Tree of Life has accelerated with advances in DNA sequencing technology. To achieve dense taxon sampling, it is often necessary to obtain DNA from historical museum specimens to supplement modern genetic samples. However, DNA from historical material is generally degraded, which presents various challenges. In this study, we evaluated how the coverage at variant sites and missing data among historical and modern samples impacts phylogenomic inference. We explored these patterns in the brush-tongued parrots (lories and lorikeets) of Australasia by sampling ultraconserved elements in 105 taxa. Trees estimated with low coverage characters had several clades where relationships appeared to be influenced by whether the sample came from historical or modern specimens, which were not observed when more stringent filtering was applied. To assess if the topologies were affected by missing data, we performed an outlier analysis of sites and loci, and a data reduction approach where we excluded sites based on data completeness. Depending on the outlier test, 0.15% of total sites or 38% of loci were driving the topological differences among trees, and at these sites, historical samples had 10.9× more missing data than modern ones. In contrast, 70% data completeness was necessary to avoid spurious relationships. Predictive modeling found that outlier analysis scores were correlated with parsimony informative sites in the clades whose topologies changed the most by filtering. After accounting for biased loci and understanding the stability of relationships, we inferred a more robust phylogenetic hypothesis for lories and lorikeets.
Collapse
Affiliation(s)
- Brian Tilston Smith
- Department of Ornithology, American Museum of Natural History, New York, New York
| | - William M Mauck
- Department of Ornithology, American Museum of Natural History, New York, New York.,New York Genome Center, New York, New York
| | - Brett W Benz
- Museum of Zoology and Department of Ecology and Evolutionary Biology, University of Michigan
| | - Michael J Andersen
- Department of Biology and Museum of Southwestern Biology, University of New Mexico
| |
Collapse
|
14
|
Kessler C, Brambilla A, Waldvogel D, Camenisch G, Biebach I, Leigh DM, Grossen C, Croll D. A robust sequencing assay of a thousand amplicons for the high-throughput population monitoring of Alpine ibex immunogenetics. Mol Ecol Resour 2021; 22:66-85. [PMID: 34152681 PMCID: PMC9292246 DOI: 10.1111/1755-0998.13452] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2020] [Revised: 06/10/2021] [Accepted: 06/14/2021] [Indexed: 11/27/2022]
Abstract
Polymorphism for immune functions can explain significant variation in health and reproductive success within species. Drastic loss in genetic diversity at such loci constitutes an extinction risk and should be monitored in species of conservation concern. However, effective implementations of genome‐wide immune polymorphism sets into high‐throughput genotyping assays are scarce. Here, we report the design and validation of a microfluidics‐based amplicon sequencing assay to comprehensively capture genetic variation in Alpine ibex (Capra ibex). This species represents one of the most successful large mammal restorations recovering from a severely depressed census size and a massive loss in diversity at the major histocompatibility complex (MHC). We analysed 65 whole‐genome sequencing sets of the Alpine ibex and related species to select the most representative markers and to prevent primer binding failures. In total, we designed ~1,000 amplicons densely covering the MHC, further immunity‐related genes as well as randomly selected genome‐wide markers for the assessment of neutral population structure. Our analysis of 158 individuals shows that the genome‐wide markers perform equally well at resolving population structure as RAD‐sequencing or low‐coverage genome sequencing data sets. Immunity‐related loci show unexpectedly high degrees of genetic differentiation within the species. Such information can now be used to define highly targeted individual translocations. Our design strategy can be realistically implemented into genetic surveys of a large range of species. In conclusion, leveraging whole‐genome sequencing data sets to design targeted amplicon assays allows the simultaneous monitoring of multiple genetic risk factors and can be translated into species conservation recommendations.
Collapse
Affiliation(s)
- Camille Kessler
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| | - Alice Brambilla
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland.,Alpine Wildlife Research Center, Gran Paradiso National Park, Italy
| | - Dominique Waldvogel
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland
| | - Glauco Camenisch
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland
| | - Iris Biebach
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland
| | - Deborah M Leigh
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland.,WSL Swiss Federal Research Institute, Birmensdorf, Switzerland
| | - Christine Grossen
- Department of Evolutionary Biology and Environmental Studies, University of Zürich, Zürich, Switzerland
| | - Daniel Croll
- Laboratory of Evolutionary Genetics, Institute of Biology, University of Neuchâtel, Neuchâtel, Switzerland
| |
Collapse
|
15
|
Leigh DM, Lischer HEL, Guillaume F, Grossen C, Günther T. Disentangling adaptation from drift in bottlenecked and reintroduced populations of Alpine ibex. Mol Ecol Resour 2021; 21:2350-2363. [PMID: 34097819 PMCID: PMC8518545 DOI: 10.1111/1755-0998.13442] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 05/27/2021] [Accepted: 06/01/2021] [Indexed: 01/25/2023]
Abstract
Identifying local adaptation in bottlenecked species is essential for conservation management. Selection detection methods have an important role in species management plans, assessments of adaptive capacity, and looking for responses to climate change. Yet, the allele frequency changes exploited in selection detection methods are similar to those caused by the strong neutral genetic drift expected during a bottleneck. Consequently, it is often unclear what accuracy selection detection methods have across bottlenecked populations. In this study, simulations were used to explore if signals of selection could be confidently distinguished from genetic drift across 23 bottlenecked and reintroduced populations of Alpine ibex (Capra ibex). The meticulously recorded demographic history of the Alpine ibex was used to generate comprehensive simulated SNP data. The simulated SNPs were then used to benchmark the confidence we could place in outliers identified in empirical Alpine ibex RADseq derived SNP data. Within the simulated data set, the false positive rates were high for all selection detection methods (FST outlier scans and Genetic‐Environment Association analyses) but fell substantially when two or more methods were combined. True positive rates were consistently low and became negligible with increased stringency. Despite finding many outlier loci in the empirical Alpine ibex SNPs, none could be distinguished from genetic drift‐driven false positives. Unfortunately, the low true positive rate also prevents the exclusion of recent local adaptation within the Alpine ibex. The baselines and stringent approach outlined here should be applied to other bottlenecked species to ensure the risk of false positive, or negative, signals of selection are accounted for in conservation management plans.
Collapse
Affiliation(s)
- Deborah M Leigh
- WSL Swiss Federal Research Institute, Birmensdorf, Switzerland
| | - Heidi E L Lischer
- Interfaculty Bioinformatics Unit, University of Bern, Bern, Switzerland.,Swiss Institute of Bioinformatics, Lausanne, Switzerland
| | - Frédéric Guillaume
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Christine Grossen
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Zurich, Switzerland
| | - Torsten Günther
- Human Evolution, Department of Organismal Biology, Evolutionary Biology Centre, Uppsala University, Uppsala, Sweden
| |
Collapse
|
16
|
De-Kayne R, Frei D, Greenway R, Mendes SL, Retel C, Feulner PGD. Sequencing platform shifts provide opportunities but pose challenges for combining genomic data sets. Mol Ecol Resour 2020; 21:653-660. [PMID: 33314612 DOI: 10.1111/1755-0998.13309] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/23/2020] [Accepted: 12/07/2020] [Indexed: 12/31/2022]
Abstract
Technological advances in DNA sequencing over the last decade now permit the production and curation of large genomic data sets in an increasing number of nonmodel species. Additionally, these new data provide the opportunity for combining data sets, resulting in larger studies with a broader taxonomic range. Whilst the development of new sequencing platforms has been beneficial, resulting in a higher throughput of data at a lower per-base cost, shifts in sequencing technology can also pose challenges for those wishing to combine new sequencing data with data sequenced on older platforms. Here, we outline the types of studies where the use of curated data might be beneficial, and highlight potential biases that might be introduced by combining data from different sequencing platforms. As an example of the challenges associated with combining data across sequencing platforms, we focus on the impact of the shift in Illumina's base calling technology from a four-channel system to a two-channel system. We caution that when data are combined from these two systems, erroneous guanine base calls that result from the two-channel chemistry can make their way through a bioinformatic pipeline, eventually leading to inaccurate and potentially misleading conclusions. We also suggest solutions for dealing with such potential artefacts, which make samples sequenced on different sequencing platforms appear more differentiated from one another than they really are. Finally, we stress the importance of archiving tissue samples and the associated sequences for the continued reproducibility and reusability of sequencing data in the face of ever-changing sequencing platform technology.
Collapse
Affiliation(s)
- Rishi De-Kayne
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - David Frei
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| | - Ryan Greenway
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland
| | - Sofia L Mendes
- Centre for Ecology, Evolution and Environmental Changes, Faculdade de Ciências, Universidade de Lisboa, Lisbon, Portugal
| | - Cas Retel
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland
| | - Philine G D Feulner
- Department of Fish Ecology and Evolution, Centre of Ecology, Evolution and Biogeochemistry, EAWAG Swiss Federal Institute of Aquatic Science and Technology, Kastanienbaum, Switzerland.,Division of Aquatic Ecology and Evolution, Institute of Ecology and Evolution, University of Bern, Bern, Switzerland
| |
Collapse
|
17
|
Reid BN, Moran RL, Kopack CJ, Fitzpatrick SW. Rapture-ready darters: Choice of reference genome and genotyping method (whole-genome or sequence capture) influence population genomic inference in Etheostoma. Mol Ecol Resour 2020; 21:404-420. [PMID: 33058399 DOI: 10.1111/1755-0998.13275] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2020] [Revised: 09/17/2020] [Accepted: 10/01/2020] [Indexed: 11/26/2022]
Abstract
Researchers studying nonmodel organisms have an increasing number of methods available for generating genomic data. However, the applicability of different methods across species, as well as the effect of reference genome choice on population genomic inference, remain difficult to predict in many cases. We evaluated the impact of data type (whole-genome vs. reduced representation) and reference genome choice on data quality and on population genomic and phylogenomic inference across several species of darters (subfamily Etheostomatinae), a highly diverse radiation of freshwater fish. We generated a high-quality reference genome and developed a hybrid RADseq/sequence capture (Rapture) protocol for the Arkansas darter (Etheostoma cragini). Rapture data from 1,900 individuals spanning four darter species showed recovery of most loci across darter species at high depth and consistent estimates of heterozygosity regardless of reference genome choice. Loci with baits spanning both sides of the restriction enzyme cut site performed especially well across species. For low-coverage whole-genome data, choice of reference genome affected read depth and inferred heterozygosity. For similar amounts of sequence data, Rapture performed better at identifying fine-scale genetic structure compared to whole-genome sequencing. Rapture loci also recovered an accurate phylogeny for the study species and demonstrated high phylogenetic informativeness across the evolutionary history of the genus Etheostoma. Low cost and high cross-species effectiveness regardless of reference genome suggest that Rapture and similar sequence capture methods may be worthwhile choices for studies of diverse species radiations.
Collapse
Affiliation(s)
- Brendan N Reid
- Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA
| | - Rachel L Moran
- Department of Evolution, Ecology, and Behavior, University of Minnesota, Saint Paul, MN, USA
| | | | - Sarah W Fitzpatrick
- Kellogg Biological Station, Michigan State University, Hickory Corners, MI, USA.,Department of Integrative Biology, Michigan State University, East Lansing, MI, USA
| |
Collapse
|
18
|
Wright AD, Garrison NL, Williams AS, Johnson PD, Whelan NV. Range reduction of Oblong Rocksnail, Leptoxis compacta, shapes riverscape genetic patterns. PeerJ 2020; 8:e9789. [PMID: 32953269 PMCID: PMC7473045 DOI: 10.7717/peerj.9789] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/13/2020] [Accepted: 07/31/2020] [Indexed: 12/21/2022] Open
Abstract
Many freshwater gastropod species face extinction, including 79% of species in the family Pleuroceridae. The Oblong Rocksnail, Leptoxis compacta, is a narrow range endemic pleurocerid from the Cahaba River basin in central Alabama that has seen rapid range contraction in the last 100 years. Such a decline is expected to negatively affect genetic diversity in the species. However, precise patterns of genetic variation and gene flow across the restricted range of L. compacta are unknown. This lack of information limits our understanding of human impacts on the Cahaba River system and Pleuroceridae. Here, we show that L. compacta has likely seen a species-wide decline in genetic diversity, but remaining populations have relatively high genetic diversity. We also report a contemporary range extension compared to the last published survey. Our findings indicate that historical range contraction has resulted in the absence of common genetic patterns seen in many riverine taxa like isolation by distance as the small distribution of L. compacta allows for relatively unrestricted gene flow across its remaining range despite limited dispersal abilities. Two collection sites had higher genetic diversity than others, and broodstock sites for future captive propagation and reintroduction efforts should utilize sites identified here as having the highest genetic diversity. Broadly, our results support the hypothesis that range contraction will result in the reduction of species-wide genetic diversity, and common riverscape genetic patterns cannot be assumed to be present in species facing extinction risk.
Collapse
Affiliation(s)
| | - Nicole L Garrison
- School of Fisheries, Aquaculture, and Aquatic Sciences, College of Agriculture, Auburn University, Auburn, AL, United States of America.,Department of Natural Resources and Mathematics, West Liberty University, West Liberty, WV, United States of America
| | - Ashantye' S Williams
- Southeast Conservation Genetics Lab, Warm Springs Fish Technology Center, United States Fish and Wildlife Service, Auburn, AL, United States of America
| | - Paul D Johnson
- Alabama Aquatic Biodiversity Center, Alabama Department of Conservation and Natural Resources, Marion, AL, United States of America
| | - Nathan V Whelan
- School of Fisheries, Aquaculture, and Aquatic Sciences, College of Agriculture, Auburn University, Auburn, AL, United States of America.,Southeast Conservation Genetics Lab, Warm Springs Fish Technology Center, United States Fish and Wildlife Service, Auburn, AL, United States of America
| |
Collapse
|
19
|
Wilder AP, Palumbi SR, Conover DO, Therkildsen NO. Footprints of local adaptation span hundreds of linked genes in the Atlantic silverside genome. Evol Lett 2020; 4:430-443. [PMID: 33014419 PMCID: PMC7523562 DOI: 10.1002/evl3.189] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2019] [Revised: 05/09/2020] [Accepted: 07/13/2020] [Indexed: 11/09/2022] Open
Abstract
The study of local adaptation in the presence of ongoing gene flow is the study of natural selection in action, revealing the functional genetic diversity most relevant to contemporary pressures. In addition to individual genes, genome‐wide architecture can itself evolve to enable adaptation. Distributed across a steep thermal gradient along the east coast of North America, Atlantic silversides (Menidia menidia) exhibit an extraordinary degree of local adaptation in a suite of traits, and the capacity for rapid adaptation from standing genetic variation, but we know little about the patterns of genomic variation across the species range that enable this remarkable adaptability. Here, we use low‐coverage, whole‐transcriptome sequencing of Atlantic silversides sampled along an environmental cline to show marked signatures of divergent selection across a gradient of neutral differentiation. Atlantic silversides sampled across 1371 km of the southern section of its distribution have very low genome‐wide differentiation (median FST = 0.006 across 1.9 million variants), consistent with historical connectivity and observations of recent migrants. Yet almost 14,000 single nucleotide polymorphisms (SNPs) are nearly fixed (FST > 0.95) for alternate alleles. Highly differentiated SNPs cluster into four tight linkage disequilibrium (LD) blocks that span hundreds of genes and several megabases. Variants in these LD blocks are disproportionately nonsynonymous and concentrated in genes enriched for multiple functions related to known adaptations in silversides, including variation in lipid storage, metabolic rate, and spawning behavior. Elevated levels of absolute divergence and demographic modeling suggest selection maintaining divergence across these blocks under gene flow. These findings represent an extreme case of heterogeneity in levels of differentiation across the genome, and highlight how gene flow shapes genomic architecture in continuous populations. Locally adapted alleles may be common features of populations distributed along environmental gradients, and will likely be key to conserving variation to enable future responses to environmental change.
Collapse
Affiliation(s)
- Aryn P Wilder
- Department of Natural Resources Cornell University Ithaca New York 14853.,Current address: San Diego Zoo Institute for Conservation Research Escondido California 92027
| | - Stephen R Palumbi
- Department of Biology, Hopkins Marine Station Stanford University Pacific Grove California 93950
| | - David O Conover
- Department of Biology University of Oregon Eugene Oregon 97403
| | | |
Collapse
|
20
|
Günther T, Nettelblad C. The presence and impact of reference bias on population genomic studies of prehistoric human populations. PLoS Genet 2019; 15:e1008302. [PMID: 31348818 PMCID: PMC6685638 DOI: 10.1371/journal.pgen.1008302] [Citation(s) in RCA: 103] [Impact Index Per Article: 20.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2019] [Revised: 08/07/2019] [Accepted: 07/10/2019] [Indexed: 11/18/2022] Open
Abstract
Haploid high quality reference genomes are an important resource in genomic research projects. A consequence is that DNA fragments carrying the reference allele will be more likely to map successfully, or receive higher quality scores. This reference bias can have effects on downstream population genomic analysis when heterozygous sites are falsely considered homozygous for the reference allele. In palaeogenomic studies of human populations, mapping against the human reference genome is used to identify endogenous human sequences. Ancient DNA studies usually operate with low sequencing coverages and fragmentation of DNA molecules causes a large proportion of the sequenced fragments to be shorter than 50 bp-reducing the amount of accepted mismatches, and increasing the probability of multiple matching sites in the genome. These ancient DNA specific properties are potentially exacerbating the impact of reference bias on downstream analyses, especially since most studies of ancient human populations use pseudo-haploid data, i.e. they randomly sample only one sequencing read per site. We show that reference bias is pervasive in published ancient DNA sequence data of prehistoric humans with some differences between individual genomic regions. We illustrate that the strength of reference bias is negatively correlated with fragment length. Most genomic regions we investigated show little to no mapping bias but even a small proportion of sites with bias can impact analyses of those particular loci or slightly skew genome-wide estimates. Therefore, reference bias has the potential to cause minor but significant differences in the results of downstream analyses such as population allele sharing, heterozygosity estimates and estimates of archaic ancestry. These spurious results highlight how important it is to be aware of these technical artifacts and that we need strategies to mitigate the effect. Therefore, we suggest some post-mapping filtering strategies to resolve reference bias which help to reduce its impact substantially.
Collapse
Affiliation(s)
- Torsten Günther
- Human Evolution, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Carl Nettelblad
- Division of Scientific Computing, Department of Information Technology, Science for Life Laboratory, Uppsala University, Uppsala, Sweden
| |
Collapse
|
21
|
Reference Genomes from Distantly Related Species Can Be Used for Discovery of Single Nucleotide Polymorphisms to Inform Conservation Management. Genes (Basel) 2018; 10:genes10010009. [PMID: 30583569 PMCID: PMC6356778 DOI: 10.3390/genes10010009] [Citation(s) in RCA: 36] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/08/2018] [Revised: 12/16/2018] [Accepted: 12/19/2018] [Indexed: 11/21/2022] Open
Abstract
Threatened species recovery programmes benefit from incorporating genomic data into conservation management strategies to enhance species recovery. However, a lack of readily available genomic resources, including conspecific reference genomes, often limits the inclusion of genomic data. Here, we investigate the utility of closely related high-quality reference genomes for single nucleotide polymorphism (SNP) discovery using the critically endangered kakī/black stilt (Himantopus novaezelandiae) and four Charadriiform reference genomes as proof of concept. We compare diversity estimates (i.e., nucleotide diversity, individual heterozygosity, and relatedness) based on kakī SNPs discovered from genotyping-by-sequencing and whole genome resequencing reads mapped to conordinal (killdeer, Charadrius vociferus), confamilial (pied avocet, Recurvirostra avosetta), congeneric (pied stilt, Himantopus himantopus) and conspecific reference genomes. Results indicate that diversity estimates calculated from SNPs discovered using closely related reference genomes correlate significantly with estimates calculated from SNPs discovered using a conspecific genome. Congeneric and confamilial references provide higher correlations and more similar measures of nucleotide diversity, individual heterozygosity, and relatedness. While conspecific genomes may be necessary to address other questions in conservation, SNP discovery using high-quality reference genomes of closely related species is a cost-effective approach for estimating diversity measures in threatened species.
Collapse
|
22
|
Santure AW, Garant D. Wild GWAS-association mapping in natural populations. Mol Ecol Resour 2018; 18:729-738. [PMID: 29782705 DOI: 10.1111/1755-0998.12901] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2018] [Revised: 05/15/2018] [Accepted: 05/16/2018] [Indexed: 12/27/2022]
Abstract
The increasing affordability of sequencing and genotyping technologies has transformed the field of molecular ecology in recent decades. By correlating marker variants with trait variation using association analysis, large-scale genotyping and phenotyping of individuals from wild populations has enabled the identification of genomic regions that contribute to phenotypic differences among individuals. Such "gene mapping" studies are enabling us to better predict evolutionary potential and the ability of populations to adapt to challenges, such as changing environment. These studies are also allowing us to gain insight into the evolutionary processes maintaining variation in natural populations, to better understand genotype-by-environment and epistatic interactions and to track the dynamics of allele frequency change at loci contributing to traits under selection. Gene mapping in the wild using genomewide association scans (GWAS) do, however, come with a number of methodological challenges, not least the population structure in space and time inherent to natural populations. We here provide an overview of these challenges, summarize the exciting methodological advances and applications of association mapping in natural populations reported in this special issue and provide some guidelines for future "wild GWAS" research.
Collapse
Affiliation(s)
- Anna W Santure
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Dany Garant
- Département de Biologie, Faculté des Sciences, Université de Sherbrooke, Sherbrooke, Québec, Canada
| |
Collapse
|