1
|
Mirchandani CD, Shultz AJ, Thomas GWC, Smith SJ, Baylis M, Arnold B, Corbett-Detig R, Enbody E, Sackton TB. A Fast, Reproducible, High-throughput Variant Calling Workflow for Population Genomics. Mol Biol Evol 2024; 41:msad270. [PMID: 38069903 PMCID: PMC10764099 DOI: 10.1093/molbev/msad270] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Revised: 10/27/2023] [Accepted: 11/22/2023] [Indexed: 01/05/2024] Open
Abstract
The increasing availability of genomic resequencing data sets and high-quality reference genomes across the tree of life present exciting opportunities for comparative population genomic studies. However, substantial challenges prevent the simple reuse of data across different studies and species, arising from variability in variant calling pipelines, data quality, and the need for computationally intensive reanalysis. Here, we present snpArcher, a flexible and highly efficient workflow designed for the analysis of genomic resequencing data in nonmodel organisms. snpArcher provides a standardized variant calling pipeline and includes modules for variant quality control, data visualization, variant filtering, and other downstream analyses. Implemented in Snakemake, snpArcher is user-friendly, reproducible, and designed to be compatible with high-performance computing clusters and cloud environments. To demonstrate the flexibility of this pipeline, we applied snpArcher to 26 public resequencing data sets from nonmammalian vertebrates. These variant data sets are hosted publicly to enable future comparative population genomic analyses. With its extensibility and the availability of public data sets, snpArcher will contribute to a broader understanding of genetic variation across species by facilitating the rapid use and reuse of large genomic data sets.
Collapse
Affiliation(s)
- Cade D Mirchandani
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Allison J Shultz
- Ornithology Department, Natural History Museum of Los Angeles County, Los Angeles, CA 90007, USA
| | | | - Sara J Smith
- Informatics Group, Harvard University, Cambridge, MA, USA
- Biology, Mount Royal University, Calgary, AB T3E 6K6, Canada
| | - Mara Baylis
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Brian Arnold
- Department of Ecology and Evolutionary Biology, Princeton University, Princeton, NJ, USA
- Center for Statistics and Machine Learning, Princeton University, Princeton, NJ, USA
| | - Russ Corbett-Detig
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Erik Enbody
- Department of Biomolecular Engineering, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | | |
Collapse
|
2
|
Wilder AP, Supple MA, Subramanian A, Mudide A, Swofford R, Serres-Armero A, Steiner C, Koepfli KP, Genereux DP, Karlsson EK, Lindblad-Toh K, Marques-Bonet T, Munoz Fuentes V, Foley K, Meyer WK, Ryder OA, Shapiro B, Andrews G, Armstrong JC, Bianchi M, Birren BW, Bredemeyer KR, Breit AM, Christmas MJ, Clawson H, Damas J, Di Palma F, Diekhans M, Dong MX, Eizirik E, Fan K, Fanter C, Foley NM, Forsberg-Nilsson K, Garcia CJ, Gatesy J, Gazal S, Genereux DP, Goodman L, Grimshaw J, Halsey MK, Harris AJ, Hickey G, Hiller M, Hindle AG, Hubley RM, Hughes GM, Johnson J, Juan D, Kaplow IM, Karlsson EK, Keough KC, Kirilenko B, Koepfli KP, Korstian JM, Kowalczyk A, Kozyrev SV, Lawler AJ, Lawless C, Lehmann T, Levesque DL, Lewin HA, Li X, Lind A, Lindblad-Toh K, Mackay-Smith A, Marinescu VD, Marques-Bonet T, Mason VC, Meadows JRS, Meyer WK, Moore JE, Moreira LR, Moreno-Santillan DD, Morrill KM, Muntané G, Murphy WJ, Navarro A, Nweeia M, Ortmann S, Osmanski A, Paten B, Paulat NS, Pfenning AR, Phan BN, Pollard KS, Pratt HE, Ray DA, Reilly SK, Rosen JR, Ruf I, Ryan L, Ryder OA, Sabeti PC, Schäffer DE, Serres A, Shapiro B, Smit AFA, Springer M, Srinivasan C, Steiner C, Storer JM, Sullivan KAM, Sullivan PF, Sundström E, Supple MA, Swofford R, Talbot JE, Teeling E, Turner-Maier J, Valenzuela A, Wagner F, Wallerman O, Wang C, Wang J, Weng Z, Wilder AP, Wirthlin ME, Xue JR, Zhang X. The contribution of historical processes to contemporary extinction risk in placental mammals. Science 2023; 380:eabn5856. [PMID: 37104572 PMCID: PMC10184782 DOI: 10.1126/science.abn5856] [Citation(s) in RCA: 8] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
Species persistence can be influenced by the amount, type, and distribution of diversity across the genome, suggesting a potential relationship between historical demography and resilience. In this study, we surveyed genetic variation across single genomes of 240 mammals that compose the Zoonomia alignment to evaluate how historical effective population size (Ne) affects heterozygosity and deleterious genetic load and how these factors may contribute to extinction risk. We find that species with smaller historical Ne carry a proportionally larger burden of deleterious alleles owing to long-term accumulation and fixation of genetic load and have a higher risk of extinction. This suggests that historical demography can inform contemporary resilience. Models that included genomic data were predictive of species' conservation status, suggesting that, in the absence of adequate census or ecological data, genomic information may provide an initial risk assessment.
Collapse
Affiliation(s)
- Aryn P Wilder
- Conservation Genetics, San Diego Zoo Wildlife Alliance, Escondido, CA 92027, USA
| | - Megan A Supple
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA 95064, USA
- Howard Hughes Medical Institute, University of California, Santa Cruz, CA 95064, USA
| | | | | | - Ross Swofford
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
| | - Aitor Serres-Armero
- Institute of Evolutionary Biology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona 08003, Spain
| | - Cynthia Steiner
- Conservation Genetics, San Diego Zoo Wildlife Alliance, Escondido, CA 92027, USA
| | - Klaus-Peter Koepfli
- Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA 22630, USA
- Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC 30008, USA
- Computer Technologies Laboratory, ITMO University, St. Petersburg 197101, Russia
| | | | - Elinor K Karlsson
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Program in Bioinformatics and Integrative Biology, University of Massachusetts Medical School, Worcester, MA 01605, USA
| | - Kerstin Lindblad-Toh
- Broad Institute of MIT and Harvard, Cambridge, MA 02139, USA
- Science for Life Laboratory, Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala 751 32, Sweden
| | - Tomas Marques-Bonet
- Institute of Evolutionary Biology, Department of Experimental and Health Sciences, Universitat Pompeu Fabra, Barcelona 08003, Spain
- Catalan Institution of Research and Advanced Studies, Barcelona 08010, Spain
- Centre for Genomic Regulation, Barcelona Institute of Science and Technology, Barcelona 08028, Spain
- Institut Català de Paleontologia Miquel Crusafont, Universitat Autònoma de Barcelona, Barcelona 08193, Spain
| | - Violeta Munoz Fuentes
- European Molecular Biology Laboratory-European Bioinformatics Institute, Wellcome Genome Campus, Hinxton CB10 1SD, UK
| | - Kathleen Foley
- College of Law, University of Iowa, Iowa City, IA 52242, USA
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | - Wynn K Meyer
- Department of Biological Sciences, Lehigh University, Bethlehem, PA 18015, USA
| | - Oliver A Ryder
- Conservation Genetics, San Diego Zoo Wildlife Alliance, Escondido, CA 92027, USA
- Department of Evolution, Behavior and Ecology, Division of Biology, University of California, San Diego, La Jolla, CA 92039, USA
| | - Beth Shapiro
- Department of Ecology and Evolutionary Biology, University of California, Santa Cruz, CA 95064, USA
- Howard Hughes Medical Institute, University of California, Santa Cruz, CA 95064, USA
| | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
3
|
Huang K, Ostevik KL, Elphinstone C, Todesco M, Bercovich N, Owens GL, Rieseberg LH. Mutation load in sunflower inversions is negatively correlated with inversion heterozygosity. Mol Biol Evol 2022; 39:6583099. [PMID: 35535689 PMCID: PMC9127631 DOI: 10.1093/molbev/msac101] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/12/2022] Open
Abstract
Recombination is critical both for accelerating adaptation and purging deleterious mutations. Chromosomal inversions can act as recombination modifiers that suppress local recombination in heterozygotes and thus, under some conditions, are predicted to accumulate such mutations. In this study, we investigated patterns of recombination, transposable element abundance and coding sequence evolution across the genomes of 1,445 individuals from three sunflower species, as well as within nine inversions segregating within species. We also analyzed the effects of inversion genotypes on 87 phenotypic traits to test for overdominance. We found significant negative correlations of long terminal repeat retrotransposon abundance and deleterious mutations with recombination rates across the genome in all three species. However, we failed to detect an increase in these features in the inversions, except for a modest increase in the proportion of stop codon mutations in several very large or rare inversions. Consistent with this finding, there was little evidence of overdominance of inversions in phenotypes that may relate to fitness. On the other hand, significantly greater load was observed for inversions in populations polymorphic for a given inversion compared to populations monomorphic for one of the arrangements, suggesting that the local state of inversion polymorphism affects deleterious load. These seemingly contradictory results can be explained by the low frequency of inversion heterozygotes in wild sunflower populations, apparently due to divergent selection and associated geographic structure. Inversions contributing to local adaptation represent ideal recombination modifiers, acting to facilitate adaptive divergence with gene flow, while largely escaping the accumulation of deleterious mutations.
Collapse
Affiliation(s)
- Kaichi Huang
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Kate L Ostevik
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada.,Department of Evolution, Ecology, and Organismal Biology, University of California, Riverside, CA, USA
| | - Cassandra Elphinstone
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Marco Todesco
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Natalia Bercovich
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| | - Gregory L Owens
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada.,Department of Biology, University of Victoria, Victoria, BC, Canada
| | - Loren H Rieseberg
- Department of Botany and Biodiversity Research Centre, University of British Columbia, Vancouver, BC, Canada
| |
Collapse
|
4
|
Kakioka R, Mori S, Kokita T, Hosoki TK, Nagano AJ, Ishikawa A, Kume M, Toyoda A, Kitano J. Multiple waves of freshwater colonization of the three-spined stickleback in the Japanese Archipelago. BMC Evol Biol 2020; 20:143. [PMID: 33143638 PMCID: PMC7641863 DOI: 10.1186/s12862-020-01713-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2020] [Accepted: 10/27/2020] [Indexed: 12/03/2022] Open
Abstract
Background The three-spined stickleback (Gasterosteus aculeatus) is a remarkable system to study the genetic mechanisms underlying parallel evolution during the transition from marine to freshwater habitats. Although the majority of previous studies on the parallel evolution of sticklebacks have mainly focused on postglacial freshwater populations in the Pacific Northwest of North America and northern Europe, we recently use Japanese stickleback populations for investigating shared and unique features of adaptation and speciation between geographically distant populations. However, we currently lack a comprehensive phylogeny of the Japanese three-spined sticklebacks, despite the fact that a good phylogeny is essential for any evolutionary and ecological studies. Here, we conducted a phylogenomic analysis of the three-spined stickleback in the Japanese Archipelago. Results We found that freshwater colonization occurred in multiple waves, each of which may reflect different interglacial isolations. Some of the oldest freshwater populations from the central regions of the mainland of Japan (hariyo populations) were estimated to colonize freshwater approximately 170,000 years ago. The next wave of colonization likely occurred approximately 100,000 years ago. The inferred origins of several human-introduced populations showed that introduction occurred mainly from nearby habitats. We also found a new habitat of the three-spined stickleback sympatric with the Japan Sea stickleback (Gasterosteus nipponicus). Conclusions These Japanese stickleback systems differ from those in the Pacific Northwest of North America and northern Europe in terms of divergence time and history. Stickleback populations in the Japanese Archipelago offer valuable opportunities to study diverse evolutionary processes in historical and contemporary timescales.
Collapse
Affiliation(s)
- Ryo Kakioka
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan.,Tropical Biosphere Research Center, University of the Ryukyus, Nishihara, Nakagami-gun, Okinawa, 903-0213, Japan
| | - Seiichi Mori
- Biological Laboratory, Gifu Kyoritsu University, Ogaki, Gifu, 503-8550, Japan
| | - Tomoyuki Kokita
- Department of Marine Bioscience, Fukui Prefectural University, Obama, Fukui, 917-0003, Japan
| | - Takuya K Hosoki
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan.,Department of Genetics, The Graduate University for Advanced Studies, Mishima, Shizuoka, 411-8540, Japan
| | - Atsushi J Nagano
- Faculty of Agriculture, Ryukoku University, Otsu, Shiga, 520-2194, Japan
| | - Asano Ishikawa
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan.,Department of Genetics, The Graduate University for Advanced Studies, Mishima, Shizuoka, 411-8540, Japan
| | - Manabu Kume
- Field Science Education and Research Center, Kyoto University, Kyoto, 606-8502, Japan
| | - Atsushi Toyoda
- Comparative Genomics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan
| | - Jun Kitano
- Ecological Genetics Laboratory, National Institute of Genetics, Mishima, Shizuoka, 411-8540, Japan. .,Department of Genetics, The Graduate University for Advanced Studies, Mishima, Shizuoka, 411-8540, Japan.
| |
Collapse
|