1
|
Subedi S, Sumida TS, Park YP. A scalable approach to topic modelling in single-cell data by approximate pseudobulk projection. Life Sci Alliance 2024; 7:e202402713. [PMID: 39107066 PMCID: PMC11303850 DOI: 10.26508/lsa.202402713] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2024] [Revised: 07/29/2024] [Accepted: 07/30/2024] [Indexed: 08/09/2024] Open
Abstract
Probabilistic topic modelling has become essential in many types of single-cell data analysis. Based on probabilistic topic assignments in each cell, we identify the latent representation of cellular states. A dictionary matrix, consisting of topic-specific gene frequency vectors, provides interpretable bases to be compared with known cell type-specific marker genes and other pathway annotations. However, fitting a topic model on a large number of cells would require heavy computational resources-specialized computing units, computing time and memory. Here, we present a scalable approximation method customized for single-cell RNA-seq data analysis, termed ASAP, short for Annotating a Single-cell data matrix by Approximate Pseudobulk estimation. Our approach is more accurate than existing methods but requires orders of magnitude less computing time, leaving much lower memory consumption. We also show that our approach is widely applicable for atlas-scale data analysis; our method seamlessly integrates single-cell and bulk data in joint analysis, not requiring additional preprocessing or feature selection steps.
Collapse
Affiliation(s)
- Sishir Subedi
- https://ror.org/03rmrcq20Bioinformatics Graduate Program, University of British Columbia, Vancouver, Canada
- BC Cancer Research, Vancouver, Canada
| | - Tomokazu S Sumida
- Neurology, Program for Neuroinflammation, Yale School of Medicine, New Haven, CT, USA
| | - Yongjin P Park
- BC Cancer Research, Vancouver, Canada
- Department of Pathology and Laboratory Medicine, University of British Columbia, Vancouver, Canada
- Department of Statistics, University of British Columbia, Vancouver, Canada
| |
Collapse
|
2
|
Radu A, Dudgeon C, Clegg SM, Foster Y, Levengood AL, Sendell-Price AT, Townsend KA, Potvin DA. Genetic patterns reveal geographic drivers of divergence in silvereyes (Zosterops lateralis). Sci Rep 2024; 14:20426. [PMID: 39227633 PMCID: PMC11372117 DOI: 10.1038/s41598-024-71364-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 08/26/2024] [Indexed: 09/05/2024] Open
Abstract
Identifying mechanisms that drive population divergence under varying geographic and ecological scenarios can inform our understanding of evolution and speciation. In particular, analysis of genetic data from island populations with known colonisation timelines allows us to identify potential source populations of diverging island subspecies and current relationships among populations. Silvereyes (Zosterops lateralis) are a small passerine that have served as a valuable study system to investigate evolutionary patterns on both large and small geographic scales. We examined genetic relatedness and diversity of two silvereye subspecies, the mainland Z. l. cornwalli and island Z. l. chlorocephalus, and used 18 077 single nucleotide polymorphisms (SNPs), to compare locations across southeast Queensland, Australia. Although silvereyes are prolific island colonisers our findings revealed population divergence over relatively small spatial scales was strongly influenced by geographic isolation mediated by water barriers. Strong genetic connectivity was displayed between mainland sites, but minimal inter-island connectivity was shown despite comparable sampling distances. Genetic diversity analysis showed little difference in heterozygosity between island and mainland populations, but lower inbreeding scores among the island populations. Our study confirmed the range of the Z. l. chlorocephalus subspecies throughout the southern Great Barrier Reef. Our results show that water barriers and not geographic distance per se are important in driving incipient divergence in island populations. This helps to explain the relatively high number of phenotypically differentiated, but often geographically proximate, island silvereye subspecies compared to a lower number of phenotypically less well-defined Australian continental subspecies.
Collapse
Affiliation(s)
- Annika Radu
- School of Science, Technology and Engineering, University of the Sunshine Coast, Petrie, QLD, Australia.
| | - Christine Dudgeon
- School of Science, Technology and Engineering, University of the Sunshine Coast, Petrie, QLD, Australia
| | - Sonya M Clegg
- Department of Biology, Edward Grey Institute of Field Ornithology, University of Oxford, Oxford, UK
- Centre for Planetary Health and Food Security, Griffith University, Brisbane, Australia
| | - Yasmin Foster
- Department of Zoology, University of Otago, Dunedin, New Zealand
| | - Alexis L Levengood
- School of Science, Technology and Engineering, University of the Sunshine Coast, Petrie, QLD, Australia
| | - Ashley T Sendell-Price
- Department of Biology, Edward Grey Institute of Field Ornithology, University of Oxford, Oxford, UK
- Bioinformatics Research Technology Platform, University of Warwick, Coventry, UK
| | - Kathy A Townsend
- School of Science, Technology and Engineering, University of the Sunshine Coast, Petrie, QLD, Australia
| | - Dominique A Potvin
- School of Science, Technology and Engineering, University of the Sunshine Coast, Petrie, QLD, Australia
| |
Collapse
|
3
|
Wray A, Petrou E, Nichols KM, Pacunski R, LeClair L, Andrews KS, Kardos M, Hauser L. Contrasting effect of hybridization on genetic differentiation in three rockfish species with similar life history. Evol Appl 2024; 17:e13749. [PMID: 39035131 PMCID: PMC11259572 DOI: 10.1111/eva.13749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Revised: 06/21/2024] [Accepted: 06/25/2024] [Indexed: 07/23/2024] Open
Abstract
Hybridization can provide evolutionary benefits (e.g., population resilience to climate change) through the introduction of adaptive alleles and increase of genetic diversity. Nevertheless, management strategies may be designed based only on the parental species within a hybrid zone, without considering the hybrids. This can lead to ineffective spatial management of species, which can directly harm population diversity and negatively impact food webs. Three species of rockfish (Brown Rockfish (Sebastes caurinus), Copper Rockfish (S. auriculatus), and Quillback Rockfish (S. maliger)) are known to hybridize within Puget Sound, Washington, but genetic data from these species are used to infer population structure in the entire genus, including in species that do not hybridize. The goal of this project was to estimate the hybridization rates within the region and determine the effect of hybridization on geographic patterns of genetic structure. We sequenced 290 Brown, Copper, and Quillback rockfish using restriction-site associated DNA sequencing (RADseq) from four regions within and outside Puget Sound, Washington. We show that (i) hybridization within Puget Sound was asymmetrical, not recent, widespread among individuals, and relatively low level within the genome, (ii) hybridization affected population structure in Copper and Brown rockfish, but not in Quillback Rockfish and (iii) after taking hybridization into account we found limited directional dispersal in Brown and Copper rockfish, and evidence for two isolated populations in Quillback Rockfish. Our results suggest that rockfish population structure is species-specific, dependent on the extent of hybridization, and cannot be inferred from one species to another despite similar life history.
Collapse
Affiliation(s)
- Anita Wray
- School of Aquatic and Fishery SciencesUniversity of WashingtonSeattleWashingtonUSA
| | - Eleni Petrou
- School of Aquatic and Fishery SciencesUniversity of WashingtonSeattleWashingtonUSA
- Present address:
United States Geological Survey, Alaska Science CenterAnchorageAlaskaUSA
| | - Krista M. Nichols
- Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, NOAASeattleWashingtonUSA
| | - Robert Pacunski
- Washington Department of Fish and WildlifeOlympiaWashingtonUSA
| | - Larry LeClair
- Washington Department of Fish and WildlifeOlympiaWashingtonUSA
| | - Kelly S. Andrews
- Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, NOAASeattleWashingtonUSA
| | - Marty Kardos
- Conservation Biology Division, Northwest Fisheries Science Center, National Marine Fisheries Service, NOAASeattleWashingtonUSA
| | - Lorenz Hauser
- School of Aquatic and Fishery SciencesUniversity of WashingtonSeattleWashingtonUSA
- Zoology DepartmentNelson Mandela UniversityGqeberhaSouth Africa
| |
Collapse
|
4
|
Caré O, Chano V, Erley M, Rogge M, Gailing O. Circadian rhythm and redox homeostasis candidate genes showed association with shallow elevation in Norway spruce. PLANT BIOLOGY (STUTTGART, GERMANY) 2024; 26:508-520. [PMID: 38568928 DOI: 10.1111/plb.13642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/22/2023] [Accepted: 02/21/2024] [Indexed: 04/05/2024]
Abstract
The analysis of genetic variation underlying local adaptation in natural populations, together with the response to different external stimuli, is currently a hot topic in forest sciences, with the aim of identifying genetic markers controlling key phenotypic traits of interest for their inclusion in restoration and breeding programs. In Europe, one of the main tree species is Norway spruce (Picea abies (L.) H.Karst.). Using the MassARRAY® platform, 568 trees from North Rhine-Westphalia (Germany) were genotyped with 94 single nucleotide polymorphisms (SNPs) related to circadian and growth rhythms, and to stress response. The association analysis of the selected markers with health status and elevation was performed using three different methods, and those identified by at least two of these were considered as high confidence associated SNPs. While just five markers showed a weak association with health condition, 32 SNPs were correlated with elevation, six of which were considered as high confidence associated SNPs, as indicated by at least two different association methods. Among these genes, thioredoxin and pseudo response regulator 1 (PRR1) are involved in redox homeostasis and ROS detoxification, APETALA2-like 3 (AP2L3), a transcription factor, is involved in seasonal apical growth, and a RPS2-like is a disease resistance gene. The function of some of these genes in controlling light-dependent reactions and metabolic processes suggests signatures of adaptation to local photoperiod and the synchronization of the circadian rhythm. This work provides new insights into the genetic basis of local adaptation over a shallow elevation gradient in Norway spruce.
Collapse
Affiliation(s)
- O Caré
- Department of Forest Genetics and Forest Tree Breeding, University of Göttingen, Göttingen, Germany
- Center for Integrated Breeding Research (CiBreed), University of Göttingen, Göttingen, Germany
| | - V Chano
- Department of Forest Genetics and Forest Tree Breeding, University of Göttingen, Göttingen, Germany
- Center for Integrated Breeding Research (CiBreed), University of Göttingen, Göttingen, Germany
| | - M Erley
- Landesbetrieb Wald und Holz Nordrhein-Westfalen, Arnsberg, Germany
| | - M Rogge
- Landesbetrieb Wald und Holz Nordrhein-Westfalen, Arnsberg, Germany
| | - O Gailing
- Department of Forest Genetics and Forest Tree Breeding, University of Göttingen, Göttingen, Germany
- Center for Integrated Breeding Research (CiBreed), University of Göttingen, Göttingen, Germany
| |
Collapse
|
5
|
Cardim Falcao R, Edwards MR, Hurst M, Fraser E, Otterstatter M. A Review on Microbiological Source Attribution Methods of Human Salmonellosis: From Subtyping to Whole-Genome Sequencing. Foodborne Pathog Dis 2024; 21:137-146. [PMID: 38032610 PMCID: PMC10924193 DOI: 10.1089/fpd.2023.0075] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2023] Open
Abstract
Salmonella is one of the main causes of human foodborne illness. It is endemic worldwide, with different animals and animal-based food products as reservoirs and vehicles of infection. Identifying animal reservoirs and potential transmission pathways of Salmonella is essential for prevention and control. There are many approaches for source attribution, each using different statistical models and data streams. Some aim to identify the animal reservoir, while others aim to determine the point at which exposure occurred. With the advance of whole-genome sequencing (WGS) technologies, new source attribution models will greatly benefit from the discriminating power gained with WGS. This review discusses some key source attribution methods and their mathematical and statistical tools. We also highlight recent studies utilizing WGS for source attribution and discuss open questions and challenges in developing new WGS methods. We aim to provide a better understanding of the current state of these methodologies with application to Salmonella and other foodborne pathogens that are common sources of illness in the poultry and human sectors.
Collapse
Affiliation(s)
- Rebeca Cardim Falcao
- British Columbia Centre for Disease Control, Vancouver, Canada
- School of Population and Public Health, The University of British Columbia, Vancouver, Canada
| | - Megan R Edwards
- British Columbia Centre for Disease Control, Vancouver, Canada
- School of Population and Public Health, The University of British Columbia, Vancouver, Canada
| | - Matt Hurst
- Public Health Agency of Canada, Guelph, Canada
| | - Erin Fraser
- British Columbia Centre for Disease Control, Vancouver, Canada
- School of Population and Public Health, The University of British Columbia, Vancouver, Canada
| | - Michael Otterstatter
- British Columbia Centre for Disease Control, Vancouver, Canada
- School of Population and Public Health, The University of British Columbia, Vancouver, Canada
| |
Collapse
|
6
|
Song B, Fu J, Guo K, Qian J, He T, Yang L, Cheng J, Fu J. A Tibetan group from Ngawa Tibetan and Qiang Autonomous Prefecture, southwest China, is rich in genetic polymorphisms at 36 autosomal STR loci and shares a complex genetic structure with other Chinese populations. Heliyon 2023; 9:e23005. [PMID: 38125507 PMCID: PMC10731225 DOI: 10.1016/j.heliyon.2023.e23005] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2023] [Revised: 11/14/2023] [Accepted: 11/23/2023] [Indexed: 12/23/2023] Open
Abstract
The Tibetan people are ancient and populous, constituting the seventh-largest of the fifty-five ethnic minority groups in China. The Ngawa Tibetan and Qiang Autonomous Prefecture (NTQAP), situated on the border of northwest and southwest China, has its distinct group relationships. Short tandem repeat (STR) is extremely polymorphic and extensively used in the application of forensic medicine and population genetics. However, it is not clear the genetic information including linkage disequilibrium (LD) by 36 autosomal STR (A-STR) markers in the Tibetan group from NTQAP. The Tibetan population from NTQAP of southwest China was examined for 36 A-STR loci in the research. Every marker across the 36 A-STR loci was consistent with Hardy-Weinberg equilibrium (HWE). The results of the calculation revealed that the total discrimination power (TDP) is 1-2.2552 × 10-42 and the cumulative probability of exclusion (CPE) is 1-1.3031 × 10-16. Subsequently, a total of 345 alleles with allelic frequencies ranging from 0.00382 to 0.55343 were identified, and the allelic numbers varied from 5 in both the TH01 and TPOX markers to 28 in the SE33 locus. The Ngawa Tibetan population, along with other Chinese populations, exhibited influences from historical factors and regional distribution, as indicated by the results of population genetics analysis. We thus first explored the genetic characteristics and correlated forensic parameters of the 36 A-STR markers in NTQAP to fill the gap in the Tibetan population. It was discovered that these 36 autosomal STR markers supplemented forensic STR databases and offered extremely valuable polymorphisms for Chinese forensic applications, such as parentage testing and personal identification. Moreover, the study would contribute additional information regarding the substructure and diversity in the Chinese population.
Collapse
Affiliation(s)
- Binghui Song
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Precision Medicine and DNA Forensic Medicine, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Jiewen Fu
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Precision Medicine and DNA Forensic Medicine, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Forensic DNA, the Judicial Authentication Center, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Kan Guo
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Jie Qian
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Ting He
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Lisha Yang
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Department of Obstetrics and Center for Prenatal Diagnosis, the Affiliated Hospital of Southwest Medical University, Luzhou, China
| | - Jingliang Cheng
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Precision Medicine and DNA Forensic Medicine, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Forensic DNA, the Judicial Authentication Center, Southwest Medical University, Luzhou, 646000, Sichuan, China
| | - Junjiang Fu
- Key Laboratory of Epigenetics and Oncology, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Precision Medicine and DNA Forensic Medicine, the Research Center for Preclinical Medicine, Southwest Medical University, Luzhou, 646000, Sichuan, China
- Laboratory of Forensic DNA, the Judicial Authentication Center, Southwest Medical University, Luzhou, 646000, Sichuan, China
| |
Collapse
|
7
|
Hatami Maleki H, Mohammadi R, Firouzkuhi F, Darvishzadeh R, Zeinalzadeh-Tabrizi H. Molecular evidence depicts genetic divergence among Agropyron elongatum and A. cristatum accessions from gene pool of Iran. PLoS One 2023; 18:e0294694. [PMID: 38033138 PMCID: PMC10688953 DOI: 10.1371/journal.pone.0294694] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2023] [Accepted: 11/06/2023] [Indexed: 12/02/2023] Open
Abstract
The genus Agropyron has an important role in soil protection and forage production in rangelands. The investigation utilized 37 ISSR primers, resulting in the detection of 956 loci within the A. elongatum genome and 705 loci within the A. cristatum genome. The findings revealed a high level of polymorphism, with 97% of loci in A. elongatum and 84% of loci in A. cristatum exhibiting variability. Notably, the primer (AC)8GCT emerged as a promising candidate for evaluating genetic diversity due to its ability to amplify numerous loci in both species. Using both the UPGMA algorithm and Bayesian analysis, the examined Agropyron accessions were categorized into two subgroups based on their respective species. The Q values associated with these subgroups suggested that certain accessions, namely "G16," "G19," "G20," "G21," "G22," "G23," "G24," and "G25," displayed potential admixture genomes. An analysis of molecular variance (AMOVA) underscored the significance of within-species variability, which accounted for 69% of the overall diversity, compared to between-species variability at 31%. Various genetic diversity parameters, including Na, Ne, I, He, and the number of private loci, were found to be higher in A. elongatum when compared to A. cristatum. Furthermore, Jaccard similarity coefficients ranged from 0.33 to 0.66 in A. cristatum and from 0.25 to 0.7 in A. elongatum, indicating the extent of genetic relatedness among these species. Intriguingly, the study identified two and three heterotic groups in A. cristatum and A. elongatum, respectively, which could be harnessed in the development of synthetic varieties to exploit heterosis. The results also indicated that a small proportion of ISSR loci pairs (5.2% in A. elongatum and 0.5% in A. cristatum) exhibited significant levels of linkage disequilibrium (LD) (P≤0.05), suggesting the potential utility of LD-based association mapping in Agropyron species. In conclusion, this research sheds light on the genetic diversity of Agropyron species and provides valuable insights into their potential applications in soil protection and forage production, as well as the prospects for enhancing genetic variability and heterosis in these species.
Collapse
Affiliation(s)
- Hamid Hatami Maleki
- Department of Plant Production and Genetics, Faculty of Agriculture, University of Maragheh, Maragheh, Iran
| | - Reza Mohammadi
- Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran
| | - Fatemeh Firouzkuhi
- Department of Plant Production and Genetics, Faculty of Agriculture, University of Maragheh, Maragheh, Iran
| | - Reza Darvishzadeh
- Department of Plant Production and Genetics, Faculty of Agriculture, Urmia University, Urmia, Iran
| | - Hossein Zeinalzadeh-Tabrizi
- Department of Horticulture and Agronomy, Faculty of Agriculture, Kyrgyz-Turkish Manas University, Bishkek, Kyrgyzstan
| |
Collapse
|
8
|
Wen Y, Liu J, Su Y, Chen X, Hou Y, Liao L, Wang Z. Forensic biogeographical ancestry inference: recent insights and current trends. Genes Genomics 2023; 45:1229-1238. [PMID: 37081293 DOI: 10.1007/s13258-023-01387-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/01/2023] [Indexed: 04/22/2023]
Abstract
BACKGROUND As a powerful complement to the paradigmatic DNA profiling strategy, biogeographical ancestry inference (BGAI) plays a significant part in human forensic investigation especially when a database hit or eyewitness testimony are not available. It indicates one's biogeographical profile based on known population-specific genetic variations, and thus is crucial for guiding authority investigations to find unknown individuals. Forensic biogeographical ancestry testing exploits much of the recent advances in the understanding of human genomic variation and improving of molecular biology. OBJECTIVE In this review, recent development of prospective ancestry informative markers (AIMs) and the statistical approaches of inferring biogeographic ancestry from AIMs are elucidated and discussed. METHODS We highlight the research progress of three potential AIMs (i.e., single nucleotide polymorphisms, microhaplotypes, and Y or mtDNA uniparental markers) and discuss the prospects and challenges of two methods that are commonly used in BGAI. CONCLUSION While BGAI for forensic purposes has been thriving in recent years, important challenges, such as ethics and responsibilities, data completeness, and ununified standards for evaluation, remain for the use of biogeographical ancestry information in human forensic investigations. To address these issues and fully realize the value of BGAI in forensic investigation, efforts should be made not only by labs/institutions around the world independently, but also by inter-lab/institution collaborations.
Collapse
Affiliation(s)
- Yufeng Wen
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
- School of Life Sciences, Jilin University, Changchun, 130012, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yonglin Su
- Department of Rehabilitation Medicine, West China Hospital Sichuan University, Chengdu, 610041, China
| | - Xiacan Chen
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yiping Hou
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Linchuan Liao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| | - Zheng Wang
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China.
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
9
|
Wootton LM, Forest F, Verboom GA. Consilience Across Multiple, Independent Genomic Data Sets Reveals Species in a Complex with Limited Phenotypic Variation. Syst Biol 2023; 72:753-766. [PMID: 37098166 DOI: 10.1093/sysbio/syad024] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2022] [Revised: 04/18/2023] [Accepted: 04/20/2023] [Indexed: 04/27/2023] Open
Abstract
Species delimitation in the genomic era has focused predominantly on the application of multiple analytical methodologies to a single massive parallel sequencing (MPS) data set, rather than leveraging the unique but complementary insights provided by different classes of MPS data. In this study, we demonstrate how the use of two independent MPS data sets, a sequence capture data set and a single-nucleotide polymorphism (SNP) data set generated via genotyping-by-sequencing, enables the resolution of species in three complexes belonging to the grass genus Ehrharta, whose strong population structure and subtle morphological variation limit the effectiveness of traditional species delimitation approaches. Sequence capture data are used to construct a comprehensive phylogenetic tree of Ehrharta and to resolve population relationships within the focal clades, while SNP data are used to detect patterns of gene pool sharing across populations, using a novel approach that visualizes multiple values of K. Given that the two genomic data sets are independent, the strong congruence in the clusters they resolve provides powerful ratification of species boundaries in all three complexes studied. Our approach is also able to resolve a number of single-population species and a probable hybrid species, both of which would be difficult to detect and characterize using a single MPS data set. Overall, the data reveal the existence of 11 and five species in the E. setacea and E. rehmannii complexes, with the E. ramosa complex requiring further sampling before species limits are finalized. Despite phenotypic differentiation being generally subtle, true crypsis is limited to just a few species pairs and triplets. We conclude that, in the absence of strong morphological differentiation, the use of multiple, independent genomic data sets is necessary in order to provide the cross-data set corroboration that is foundational to an integrative taxonomic approach. [Species delimitation; genotyping-by-sequencing; population structure; integrative taxonomy; cryptic species; Ehrharta (Poaceae).].
Collapse
Affiliation(s)
- Lara M Wootton
- Department of Biological Sciences and Bolus Herbarium, University of Cape Town, 7701, Rondebosch, South Africa
- Laboratoire d'Ecologie Alpine, Université Grenoble Alpes, FR-38000 Grenoble, France
| | - Félix Forest
- Royal Botanic Gardens, Kew, Richmond, Surrey TW9 3AE, UK
| | - G Anthony Verboom
- Department of Biological Sciences and Bolus Herbarium, University of Cape Town, 7701, Rondebosch, South Africa
| |
Collapse
|
10
|
Choi I, Han E, Wojciechowski MF, Heo T, Park J, Yang J, Gantsetseg A, Cheon K, Tamaki I, Lee J. The genetic structure and demographic history of Zabelia tyaihyonii, endemic to Korean limestone karst forests, based on genome-wide SNP markers. Ecol Evol 2023; 13:e10252. [PMID: 37404698 PMCID: PMC10316687 DOI: 10.1002/ece3.10252] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Revised: 06/13/2023] [Accepted: 06/20/2023] [Indexed: 07/06/2023] Open
Abstract
Similar to the global phenomenon, many plant species endemic to Korean limestone karst forests are at risk of extinction due to human intervention. Zabelia tyaihyonii is a familiar shrub, called "Hardy abelia" and "Fragrant abelia" growing in the karst forests of Korea, where it is one of the most threatened species. We investigated the genetic structure and demographic history of Z. tyaihyonii, which allow us to develop appropriate conservation and management strategies. The genetic structure was evaluated using a total of 187 samples from 14 populations, covering the entire distribution of Z. tyaihyonii in South Korea. We utilized 254 and 1753 SNP loci obtained via MIG-seq (Multiplexed ISSR Genotyping by sequencing) for structure and demographic analyses, respectively. The population demographic modeling was performed with site frequency spectrum. To gain further historical insights, we also employed ENM (Ecological Niche Modeling). We found two distinct clusters (CLI and CLII) of ancient origin (ca. 490 ka). Despite CLII experiencing a more severe bottleneck, both clusters showed similar levels of genetic diversity, indicating mutual historical gene flow. Their historical distribution range seems to have changed very little. We proposed a historical distribution scenario for Z. tyaihyonii, taking into account its intrinsic factors, and emphasized a more complex response to Quaternary climate change beyond simple allopatric speciation models. These findings provide valuable insights for conservation and management strategies for Z. tyaihyonii.
Collapse
Affiliation(s)
- In‐Su Choi
- Department of Biological Sciences and BiotechnologyHannam UniversityDaejeonKorea
| | - Eun‐Kyeong Han
- Department of Biology EducationChonnam National UniversityGwangjuKorea
| | | | - Tae‐Im Heo
- Division of Baekdudaegan Biodiversity ConservationBaekdudaegan National ArboretumBonghwaKorea
| | - Jong‐Soo Park
- Division of BotanyHonam National Institute of Biological ResourcesMokpoKorea
| | - Jong‐Cheol Yang
- Division of Baekdudaegan Biodiversity ConservationBaekdudaegan National ArboretumBonghwaKorea
| | | | | | | | - Jung‐Hyun Lee
- Department of Biology EducationChonnam National UniversityGwangjuKorea
- School of Life SciencesArizona State UniversityTempeArizonaUSA
| |
Collapse
|
11
|
Rodrigues LF, de Sousa AA, Mendes Júnior WP, Cardoso e Silva AC, do Nascimento MHS, Barros MC, Sampaio I, Fraga EDC. Genetic Differentiation of Aedes aegypti (Diptera: Culicidae) in Areas with High Rates of Infestation in Mid-North Region of Brazil. INSECTS 2023; 14:530. [PMID: 37367345 PMCID: PMC10299528 DOI: 10.3390/insects14060530] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/04/2023] [Revised: 05/30/2023] [Accepted: 06/01/2023] [Indexed: 06/28/2023]
Abstract
Aedes aegypti is the principal vector of the arboviruses-yellow fever, dengue virus, chikungunya, and zika virus. Given the epidemiological importance of this mosquito, its capacity to adapt to different habitats, and its resistance to many types of control measures, systematic research into the genetic variability of the populations of this mosquito is one of the most important steps toward a better understanding of its population structure and vector competence. In this context, the present study verified the presence of distinct genetic lineages of Ae. aegypti in areas with high infestation rates, based on the analysis of microsatellite markers. The samples were collected in nine municipalities with high building infestation rates in the Mid-North region of Brazil. Six microsatellite loci were genotyped in the 138 samples, producing a total of 32 alleles, varying from one to nine alleles per locus in each of the different populations. The AMOVA revealed greater within-population genetic differentiation with high fixation rates. The general analysis of population structure, based on a Bayesian approach, revealed K = 2, with two Ae. aegypti lineages that were highly differentiated genetically. These data on the connectivity of the populations and the genetic isolation of the lineages provide important insights for the development of innovative strategies for the control of the populations of this important disease vector.
Collapse
Affiliation(s)
- Luzianny Farias Rodrigues
- Graduate Program in Biodiversity, Environment and Health, Laboratory of Genetics and Molecular Biology, Universidade Estadual do Maranhão—UEMA, Caxias 65604-380, MA, Brazil; (L.F.R.)
| | - Andrelina Alves de Sousa
- Graduate Program in Genetics and Molecular Biology, Laboratory of Genetics and Molecular Biology, Universidade Federal do Pará—UFPA, Belém 66075-110, PA, Brazil; (A.A.d.S.)
| | - Walter Pinheiro Mendes Júnior
- Graduate Program in Biodiversity, Environment and Health, Laboratory of Genetics and Molecular Biology, Universidade Estadual do Maranhão—UEMA, Caxias 65604-380, MA, Brazil; (L.F.R.)
| | - Amanda Caroline Cardoso e Silva
- Graduate Program in Biodiversity, Environment and Health, Laboratory of Genetics and Molecular Biology, Universidade Estadual do Maranhão—UEMA, Caxias 65604-380, MA, Brazil; (L.F.R.)
| | | | - Maria Claudene Barros
- Graduate Program in Biodiversity, Environment and Health, Laboratory of Genetics and Molecular Biology, Universidade Estadual do Maranhão—UEMA, Caxias 65604-380, MA, Brazil; (L.F.R.)
| | - Iracilda Sampaio
- Graduate Program in Genetics and Molecular Biology, Laboratory of Genetics and Molecular Biology, Universidade Federal do Pará—UFPA, Belém 66075-110, PA, Brazil; (A.A.d.S.)
| | - Elmary da Costa Fraga
- Graduate Program in Biodiversity, Environment and Health, Laboratory of Genetics and Molecular Biology, Universidade Estadual do Maranhão—UEMA, Caxias 65604-380, MA, Brazil; (L.F.R.)
| |
Collapse
|
12
|
Farhat P, Mandáková T, Divíšek J, Kudoh H, German DA, Lysak MA. The evolution of the hypotetraploid Catolobus pendulus genome - the poorly known sister species of Capsella. FRONTIERS IN PLANT SCIENCE 2023; 14:1165140. [PMID: 37223809 PMCID: PMC10200890 DOI: 10.3389/fpls.2023.1165140] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 02/13/2023] [Accepted: 04/04/2023] [Indexed: 05/25/2023]
Abstract
The establishment of Arabidopsis as the most important plant model has also brought other crucifer species into the spotlight of comparative research. While the genus Capsella has become a prominent crucifer model system, its closest relative has been overlooked. The unispecific genus Catolobus is native to temperate Eurasian woodlands, from eastern Europe to the Russian Far East. Here, we analyzed chromosome number, genome structure, intraspecific genetic variation, and habitat suitability of Catolobus pendulus throughout its range. Unexpectedly, all analyzed populations were hypotetraploid (2n = 30, ~330 Mb). Comparative cytogenomic analysis revealed that the Catolobus genome arose by a whole-genome duplication in a diploid genome resembling Ancestral Crucifer Karyotype (ACK, n = 8). In contrast to the much younger Capsella allotetraploid genomes, the presumably autotetraploid Catolobus genome (2n = 32) arose early after the Catolobus/Capsella divergence. Since its origin, the tetraploid Catolobus genome has undergone chromosomal rediploidization, including a reduction in chromosome number from 2n = 32 to 2n = 30. Diploidization occurred through end-to-end chromosome fusion and other chromosomal rearrangements affecting a total of six of 16 ancestral chromosomes. The hypotetraploid Catolobus cytotype expanded toward its present range, accompanied by some longitudinal genetic differentiation. The sister relationship between Catolobus and Capsella allows comparative studies of tetraploid genomes of contrasting ages and different degrees of genome diploidization.
Collapse
Affiliation(s)
- Perla Farhat
- Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czechia
| | - Terezie Mandáková
- Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czechia
- Department of Experimental Biology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Jan Divíšek
- Department of Botany and Zoology, Faculty of Science, Masaryk University, Brno, Czechia
| | - Hiroshi Kudoh
- Center for Ecological Research, Kyoto University, Otsu, Japan
| | - Dmitry A. German
- South-Siberian Botanical Garden, Altai State University, Barnaul, Russia
| | - Martin A. Lysak
- Central European Institute of Technology (CEITEC), Masaryk University, Brno, Czechia
- National Centre for Biomolecular Research (NCBR), Faculty of Science, Masaryk University, Brno, Czechia
| |
Collapse
|
13
|
McCoy KD, Toty C, Dupraz M, Tornos J, Gamble A, Garnier R, Descamps S, Boulinier T. Climate change in the Arctic: Testing the poleward expansion of ticks and tick-borne diseases. GLOBAL CHANGE BIOLOGY 2023; 29:1729-1740. [PMID: 36700347 DOI: 10.1111/gcb.16617] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/25/2022] [Accepted: 11/29/2022] [Indexed: 05/28/2023]
Abstract
Climate change is most strongly felt in the polar regions of the world, with significant impacts on the species that live there. The arrival of parasites and pathogens from more temperate areas may become a significant problem for these populations, but current observations of parasite presence often lack a historical reference of prior absence. Observations in the high Arctic of the seabird tick Ixodes uriae suggested that this species expanded poleward in the last two decades in relation to climate change. As this tick can have a direct impact on the breeding success of its seabird hosts and vectors several pathogens, including Lyme disease spirochaetes, understanding its invasion dynamics is essential for predicting its impact on polar seabird populations. Here, we use population genetic data and host serology to test the hypothesis that I. uriae recently expanded into Svalbard. Both black-legged kittiwakes (Rissa tridactyla) and thick-billed murres (Uria lomvia) were sampled for ticks and blood in Kongsfjorden, Spitsbergen. Ticks were genotyped using microsatellite markers and population genetic analyses were performed using data from 14 reference populations from across the tick's northern distribution. In contrast to predictions, the Spitsbergen population showed high genetic diversity and significant differentiation from reference populations, suggesting long-term isolation. Host serology also demonstrated a high exposure rate to Lyme disease spirochaetes (Bbsl). Targeted PCR and sequencing confirmed the presence of Borrelia garinii in a Spitsbergen tick, demonstrating the presence of Lyme disease bacteria in the high Arctic for the first time. Taken together, results contradict the notion that I. uriae has recently expanded into the high Arctic. Rather, this tick has likely been present for some time, maintaining relatively high population sizes and an endemic transmission cycle of Bbsl. Close future observations of population infestation/infection rates will now be necessary to relate epidemiological changes to ongoing climate modifications.
Collapse
Affiliation(s)
- Karen D McCoy
- MIVEGEC, Centre IRD, University of Montpellier CNRS IRD, Montpellier, France
| | - Céline Toty
- MIVEGEC, Centre IRD, University of Montpellier CNRS IRD, Montpellier, France
| | - Marlène Dupraz
- MIVEGEC, Centre IRD, University of Montpellier CNRS IRD, Montpellier, France
| | - Jérémy Tornos
- MIVEGEC, Centre IRD, University of Montpellier CNRS IRD, Montpellier, France
- CEFE, UMR 5175, University of Montpellier CNRS, Montpellier, France
| | - Amandine Gamble
- CEFE, UMR 5175, University of Montpellier CNRS, Montpellier, France
| | - Romain Garnier
- CEFE, UMR 5175, University of Montpellier CNRS, Montpellier, France
| | | | | |
Collapse
|
14
|
Delfan S, Bihamta MR, Dadrezaei ST, Abbasi A, Alipoor H. Exploring genomic regions involved in bread wheat resistance to leaf rust at seedling/adult stages by using GWAS analysis. BMC Genomics 2023; 24:83. [PMID: 36810004 PMCID: PMC9945389 DOI: 10.1186/s12864-022-09096-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/06/2022] [Accepted: 12/22/2022] [Indexed: 02/24/2023] Open
Abstract
BACKGROUND Global wheat productivity is seriously challenged by a range of rust pathogens, especially leaf rust derived from Puccinia triticina. Since the most efficient approach to control leaf rust is genetic resistance, many efforts have been made to uncover resistance genes; however, it demands an ongoing exploration for effective resistance sources because of the advent of novel virulent races. Thus, the current study was focused on detecting leaf rust resistance-related genomic loci against the P. triticina prevalent races by GWAS in a set of Iranian cultivars and landraces. RESULTS Evaluation of 320 Iranian bread wheat cultivars and landraces against four prevalent rust pathotypes of P. triticina (LR-99-2, LR-98-12, LR-98-22, and LR-97-12) indicated the diversity in wheat accessions responses to P. triticina. From GWAS results, 80 leaf rust resistance QTLs were located in the surrounding known QTLs/genes on almost chromosomes, except for 1D, 3D, 4D, and 7D. Of these, six MTAs (rs20781/rs20782 associated with resistance to LR-97-12; rs49543/rs52026 for LR-98-22; rs44885/rs44886 for LR-98-22/LR-98-1/LR-99-2) were found on genomic regions where no resistance genes previously reported, suggesting new loci conferring resistance to leaf rust. The GBLUP genomic prediction model appeared better than RR-BLUP and BRR, reflecting that GBLUP is a potent model for genomic selection in wheat accessions. CONCLUSIONS Overall, the newly identified MTAs as well as the highly resistant accessions in the recent work provide an opportunity towards improving leaf rust resistance.
Collapse
Affiliation(s)
- Saba Delfan
- grid.46072.370000 0004 0612 7950Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran
| | - Mohammad Reza Bihamta
- Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran.
| | - Seyed Taha Dadrezaei
- grid.473705.20000 0001 0681 7351Department of Cereal Research, Seed and Plant Improvement Institute, Agricultural Research and Education Organization (AREEO), Karaj, Iran
| | - Alireza Abbasi
- grid.46072.370000 0004 0612 7950Department of Agronomy and Plant Breeding, Faculty of Agricultural Sciences and Engineering, University of Tehran, Karaj, Iran
| | - Hadi Alipoor
- grid.412763.50000 0004 0442 8645Department of Plant Production and Genetics, Faculty of Agriculture, Urmia University, Urmia, Iran
| |
Collapse
|
15
|
Suzuki M, Ohno K, Sawayama E, Morinaga SI, Kishida T, Matsumoto T, Kato H. Genomics reveals a genetically isolated population of the Pacific white-sided dolphin (Lagenorhynchus obliquidens) distributed in the Sea of Japan. Mol Ecol 2023; 32:881-891. [PMID: 36440502 DOI: 10.1111/mec.16797] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2022] [Revised: 11/10/2022] [Accepted: 11/24/2022] [Indexed: 11/29/2022]
Abstract
The processes by which animals become genetically isolated in an open environment such as the ocean have not yet been fully elucidated. Morphologically different populations of Pacific white-sided dolphin Lagenorhynchus obliquidens are observed sympatrically in the Sea of Japan. However, genetic studies that exclusively used limited mitochondrial loci or microsatellite DNA have failed to demonstrate the existence of genetically distinct populations. Here, to reveal the population structure, we analysed genome-wide population genetic data using single nucleotide polymorphisms (SNPs) gathered in 2018-2021 from all domestic captive individuals in aquaria, the majority of which originated from the wild, as well as from some stranded individuals, together covering a wide area of coastal water around Japan (n = 123). Multiplexed intersimple sequence repeat genotyping-by-sequencing analysis was performed to obtain the SNP data. Principal coordinate analysis and the clustering method structure indicated that two genetically-distinct populations exist, with little interpopulation gene flow revealed. In addition, the genotypic segregation was reflected in differences in external morphotype. Furthermore, a population demographic analysis based on the whole-genome sequences of an individual from each population indicated that sea-level changes during the Last Glacial Period probably led to allopatric divergence of this species in a limited area of the Sea of Japan, with that group subsequently sharing a distribution area with the other population. These findings yield insights into the formation of genetically isolated sympatric populations in the ocean.
Collapse
Affiliation(s)
- Miwa Suzuki
- College of Bioresource Sciences, Nihon University, Fujisawa, Kanagawa, Japan
| | - Kaho Ohno
- College of Bioresource Sciences, Nihon University, Fujisawa, Kanagawa, Japan
| | - Eitaro Sawayama
- College of Bioresource Sciences, Nihon University, Fujisawa, Kanagawa, Japan
| | - Shin-Ichi Morinaga
- Department of Natural & Environmental Science, Teikyo University of Science, Tokyo, Japan
| | - Takushi Kishida
- Museum of Natural and Environmental History, Shizuoka, Japan
| | | | | |
Collapse
|
16
|
Michiels B, Pardo AM, Ortega Masagué MF, Giovambattista G, Corva PM. Characterization of experimental cattle populations from Argentina with a low-density SNP genotyping panel. Anim Genet 2023; 54:68-72. [PMID: 36464984 DOI: 10.1111/age.13278] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/06/2022] [Revised: 09/28/2022] [Accepted: 11/14/2022] [Indexed: 12/12/2022]
Abstract
The objectives of the present experiment were to evaluate a low-density SNP array designed for the molecular characterisation of gene banks and to assess the genetic diversity and population structure of beef cattle herds from an Argentinean research station. Forty-nine animals from three breeds (Angus, Hereford, and Argentinean Creole) were genotyped using the multi-species IMAGE001 60-K SNP array (10 K for cattle). Genotypes of other 19 cattle populations from Argentina, other American countries, and Europe were included in the study. Of special interest was the characterization of the Argentinean Creole, the only autochthonous cattle breed in the country. Due to the merging of different datasets, approximately 5 K SNPs were effectively used. Genetic differentiation (FST ), principal component analysis, neighbour-joining tree of Reynolds distances and ancestry analysis showed that autochthonous American breeds are clearly differentiated, but all have genetic influences of Iberian cattle. The analysed herds of Argentinean Creole showed no evidence of recent admixture and represent a unique genetic pool within local American breeds. An experimental herd and the local commercial Hereford population have also diverged, probably due to the influence of current selection objectives in the breed. Our results illustrate the utility of using low-cost, low density SNP arrays in the evaluation of animal genetic resources. This type of panels could become a very useful resource in developing countries, where most endangered cattle breeds are located. The results also reinforce the importance of experimental herds as reservoir of genetic diversity, particularly in the case of local breeds under-represented in traditional production systems.
Collapse
Affiliation(s)
- Brian Michiels
- Facultad de Ciencias Agrarias, Universidad Nacional de Mar del Plata, Mar del Plata, Argentina
| | - Alan Maxs Pardo
- Facultad de Ciencias Agrarias, Universidad Nacional de Mar del Plata, Mar del Plata, Argentina.,Instituto Nacional de Tecnología Agropecuaria (INTA) E.E.A., Balcarce, Argentina
| | - Maria F Ortega Masagué
- Instituto de Investigación Animal del Chaco Semiárido, IIACS-CIAP (INTA), Tucumán, Argentina
| | - Guillermo Giovambattista
- Instituto de Genética Veterinaria (IGEVET, Facultad de Ciencias Veterinarias, UNLP-CONICET), La Plata, Argentina
| | - Pablo M Corva
- Facultad de Ciencias Agrarias, Universidad Nacional de Mar del Plata, Mar del Plata, Argentina
| |
Collapse
|
17
|
Dicks KL, Ball AD, Banfield L, Barrios V, Boufaroua M, Chetoui A, Chuven J, Craig M, Faqeer MYA, Garba HHM, Guedara H, Harouna A, Ivy J, Najjar C, Petretto M, Pusey R, Rabeil T, Riordan P, Senn HV, Taghouti E, Wacher T, Woodfine T, Gilbert T. Genetic diversity in global populations of the critically endangered addax ( Addax nasomaculatus) and its implications for conservation. Evol Appl 2023; 16:111-125. [PMID: 36699120 PMCID: PMC9850015 DOI: 10.1111/eva.13515] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/10/2022] [Accepted: 11/04/2022] [Indexed: 12/24/2022] Open
Abstract
Threatened species are frequently patchily distributed across small wild populations, ex situ populations managed with varying levels of intensity and reintroduced populations. Best practice advocates for integrated management across in situ and ex situ populations. Wild addax (Addax nasomaculatus) now number fewer than 100 individuals, yet 1000 of addax remain in ex situ populations, which can provide addax for reintroductions, as has been the case in Tunisia since the mid-1980s. However, integrated management requires genetic data to ascertain the relationships between wild and ex situ populations that have incomplete knowledge of founder origins, management histories, and pedigrees. We undertook a global assessment of genetic diversity across wild, ex situ and reintroduced populations in Tunisia to assist conservation planning for this Critically Endangered species. We show that the remnant wild populations retain more mitochondrial haplotypes that are more diverse than the entirety of the ex situ populations across Europe, North America and the United Arab Emirates, and the reintroduced Tunisian population. Additionally, 1704 SNPs revealed that whilst population structure within the ex situ population is minimal, each population carries unique diversity. Finally, we show that careful selection of founders and subsequent genetic management is vital to ensure genetic diversity is provided to, and minimize drift and inbreeding within reintroductions. Our results highlight a vital need to conserve the last remaining wild addax population, and we provide a genetic foundation for determining integrated conservation strategies to prevent extinction and optimize future reintroductions.
Collapse
Affiliation(s)
- Kara L. Dicks
- RZSS WildGenes, Royal Zoological Society of ScotlandEdinburghUK
| | - Alex D. Ball
- RZSS WildGenes, Royal Zoological Society of ScotlandEdinburghUK
| | - Lisa Banfield
- Life Sciences DepartmentAl Ain ZooAl AinUnited Arab Emirates
| | | | | | | | - Justin Chuven
- Terrestrial & Marine Biodiversity Management Sector, Environment Agency – Abu DhabiAbu DhabiUnited Arab Emirates
| | - Mark Craig
- Life Sciences DepartmentAl Ain ZooAl AinUnited Arab Emirates
| | | | | | | | - Abdoulaye Harouna
- SaharaConservationSaint Maur des FossésFrance
- Noé au NigerRéserve Naturelle Nationale de Termit et Tin‐ToummaNiger
| | - Jamie Ivy
- San Diego Zoo Wildlife AllianceSan DiegoCaliforniaUSA
| | - Chawki Najjar
- Conservation Biology, Marwell WildlifeWinchesterUK
- Association Tunisienne de la Vie SauvageTunisTunisia
| | | | - Ricardo Pusey
- Terrestrial & Marine Biodiversity Management Sector, Environment Agency – Abu DhabiAbu DhabiUnited Arab Emirates
| | | | - Philip Riordan
- Conservation Biology, Marwell WildlifeWinchesterUK
- School of Biological Sciences, Faculty of Environmental and Life SciencesUniversity of SouthamptonSouthamptonUK
| | - Helen V. Senn
- RZSS WildGenes, Royal Zoological Society of ScotlandEdinburghUK
| | | | - Tim Wacher
- Conservation & Policy, Zoological Society of LondonLondonUK
| | - Tim Woodfine
- Conservation Biology, Marwell WildlifeWinchesterUK
- School of Biological Sciences, Faculty of Environmental and Life SciencesUniversity of SouthamptonSouthamptonUK
| | - Tania Gilbert
- Conservation Biology, Marwell WildlifeWinchesterUK
- School of Biological Sciences, Faculty of Environmental and Life SciencesUniversity of SouthamptonSouthamptonUK
| |
Collapse
|
18
|
Ilves A, Kaljund K, Sild E, Münzbergová Z. High genetic variation of Trifolium alpestre at the northern margin: but for how long? CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01490-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
|
19
|
Fahey PS, Udovicic F, Cantrill DJ, Bayly MJ. A box on the river: The phylogenetics and phylogeography of Eucalyptus baueriana (Eucalyptus sect. Adnataria ser. Heterophloiae). PLoS One 2022; 17:e0276117. [PMID: 36395183 PMCID: PMC9671351 DOI: 10.1371/journal.pone.0276117] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2021] [Accepted: 09/29/2022] [Indexed: 11/19/2022] Open
Abstract
We present a phylogeographic study of the tree species Eucalyptus baueriana Schauer, which occurs in disjunct areas on the near coastal plains and ranges of the south-east Australian mainland. DArTseq data are used to build a phylogeny including E. baueriana and closely related taxa to test its monophyly, test the genetic distinctness of the three subspecies of E. baueriana, and investigate relationships between its disjunct populations. Additionally, we use population structure analysis to investigate the genetic distinctness of populations, and MaxEnt to investigate the environmental factors potentially influencing the species' distribution. We show E. baueriana is monophyletic and most closely related to three other Blue Box eucalypt species: E. conica H.Deane & Maiden, E. dalveenica T.L.Collins, R.L.Andrew & J.J.Bruhl and E. magnificata L.A.S.Johnson & K.D.Hill, with some evidence for genetic introgression between these taxa. Within E. baueriana, the deepest genetic breaks do not correspond with the subspecies classification as the two geographically restricted subspecies, together with samples of the more widespread E. baueriana subsp. baueriana from west of the Gippsland lowlands, form a south-western clade with that is sister to other populations of subsp. baueriana. The oldest genetic break in the species occurs in far eastern Gippsland (Victoria), corresponding to one of the shortest geographic disjunctions in the species' distribution. Genetic breaks in other species have been observed in this region which is broadly referred to as the southern transition zone. Both total annual rainfall and the seasonality of this rainfall are hypothesised to affect the species' distribution; gaps in its distribution are in areas of higher rainfall that support closed forest and in regions with more winter dominated rainfall.
Collapse
Affiliation(s)
- Patrick S. Fahey
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| | - Frank Udovicic
- Royal Botanic Gardens Victoria, South Yarra, Victoria, Australia
| | | | - Michael J. Bayly
- School of BioSciences, The University of Melbourne, Parkville, Victoria, Australia
| |
Collapse
|
20
|
Chaturvedi T, Gupta AK, Shanker K, Dubey BK, Tiwari G. Maximizing genetic gain through unlocking genetic variation in different ecotypes of kalmegh ( Andrographis paniculata (Burm. f.) Nee). FRONTIERS IN PLANT SCIENCE 2022; 13:1042222. [PMID: 36420038 PMCID: PMC9677111 DOI: 10.3389/fpls.2022.1042222] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 10/05/2022] [Indexed: 06/16/2023]
Abstract
Andrographis paniculata, commonly known as kalmegh is among the most popular medicinal herbs in Southeast Asia. It is widely cultivated for medicinal purposes. The bioactive molecule, Andrographolide accumulated in herb leaves has immense therapeutic and economic potential. However, comprehensive information regarding genetic diversity is very limited in this species. The present study assessed genetic diversity between and within the six populations (ecotypes) of twenty-four kalmegh accessions using multiple datasets (agro-morphological traits, phytochemical traits, and genic markers). This is the established report where EST-SSR (Expressed sequence tags-Simple Sequence Repeat) markers have been used to unlock genetic variation in kalmegh. Here, we identified and developed ninety-one metabolic pathway-specific EST-SSR markers. Finally, 32 random EST-SSR primer pairs were selected for genetic diversity assessment. Multivariate analysis to unveil the agro-morphological, phytochemical and genotypic variability was helpful in discriminating various germplasms studied in the present study. Among all the morphological discriptors used in present study, days to fifty percent flowering and dry herb yield were found as potential selection index for AP genetic improvement. Hierarchical cluster analysis built with agro-morphological data identified three major groups. However, corresponding analysis with phytochemical and molecular data generated two clear-cut groups among the studied individuals. Moreover, the grouping of individuals into different clusters using multiple datasets was geographically independent, and also showed inconsistency in grouping among agromorphological, phytochemical and molecular dataset based clusters. However, joint analysis using agro-morphological, phytochemical and genotypic information generated two genetic groups, which could be a valuable resource for identifying complementary crossing panels in the kalmegh breeding program. The accessions AP7, AP13, AP5, AP3 belong to cluster I and accessions AP17, AP18 belong to cluster II could be utilized as potential donors for high dry herb yield and andrographolide content, respectively in different selective breeding programs of AP. Thus, our results provided useful information about the overall genetic diversity and variation in economic traits useful for initiating selective breeding programs for contrasting traits of interest and maximizing genetic gain in kalmegh.
Collapse
Affiliation(s)
- Trishna Chaturvedi
- Division of Plant Breeding and Genetic Resource Conservation, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, Lucknow, Uttar Pradesh, India
| | - Anil Kumar Gupta
- Division of Plant Breeding and Genetic Resource Conservation, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, Lucknow, Uttar Pradesh, India
| | - Karuna Shanker
- Phytochemistry Division, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, Lucknow, Uttar Pradesh, India
| | - Basant Kumar Dubey
- Biotechnology Division, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, Lucknow, Uttar Pradesh, India
| | - Gunjan Tiwari
- Division of Plant Breeding and Genetic Resource Conservation, Central Institute of Medicinal and Aromatic Plants, Council of Scientific and Industrial Research, Lucknow, Uttar Pradesh, India
| |
Collapse
|
21
|
Haq S, Dubey S, Dhingra P, Verma KS, Kumari D, Kothari SL, Kachhwaha S. Exploring the genetic makeup and population structure among Capsicum accessions for crop improvement and breeding curriculum insights. J Genet Eng Biotechnol 2022; 20:116. [PMID: 35932438 PMCID: PMC9357239 DOI: 10.1186/s43141-022-00398-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/20/2022] [Accepted: 07/10/2022] [Indexed: 02/03/2023]
Abstract
Background Capsicum or chilli is an important crop in India which exhibits immense structural and genetic variations reflecting their intra- and inter-specific relationships. The aim of this study was to establish relationships amongst 54 Capsicum accessions through analysis of genetic and population structure using ISSR markers. Results Out of 19, successful DNA amplifications were shown by 7 ISSR primers and a total of 80 bands were identified ranging between 8 and 14 with an average of 11.43 bands/primer. A significant degree of polymorphic information content (PIC), discriminating power (DP), resolving power (RP), effective multiplex ratio (EMR), and marker index (MI) were identified as 0.39, 0.70, 6.40, 5.88, and 2.30, respectively, using ISSR markers in chillies. The cross-transferability ranged from 8.0 to 72.15% with an average of 52.63% among chillies. Amongst genetic information, grand mean values were 0.264, 0.180, 0.376, 0.296, and 0.180, which correspond to Shannon’s information index (I), expected heterozygosity (He), Nei’s gene diversity, total diversity among species (Ht), diversity within species (Hs), respectively. Further, the coefficients of gene differentiation (Gst) and gene flow (Nm) were 0.393 and 0.773, representing higher genetic variation among the population which was confirmed by analysis of molecular variance (AMOVA). Conclusion ISSR markers represented a potent system for the estimation of relationships or variation studies and generated information useful for planning crop management and improvement strategies in chilli breeding.
Collapse
Affiliation(s)
- Shamshadul Haq
- Department of Botany, University of Rajasthan, Jaipur, 302004, India
| | - Shikha Dubey
- Department of Genetics and Plant Breeding, UAS Dharwad, Dharwad, Karnataka, 580005, India
| | - Prerna Dhingra
- Department of Botany, University of Rajasthan, Jaipur, 302004, India
| | - Kumar Sambhav Verma
- Institute of biotechnology, Amity University, Jaipur-Campus, Jaipur, Rajasthan, 302006, India
| | - Deepa Kumari
- Department of Botany, University of Rajasthan, Jaipur, 302004, India
| | - S L Kothari
- Institute of biotechnology, Amity University, Jaipur-Campus, Jaipur, Rajasthan, 302006, India
| | - Sumita Kachhwaha
- Department of Botany, University of Rajasthan, Jaipur, 302004, India.
| |
Collapse
|
22
|
Population Dynamics of Wide Compatibility System and Evaluation of Intersubspecific Hybrids by indica-japonica Hybridization in Rice. PLANTS 2022; 11:plants11151930. [PMID: 35893634 PMCID: PMC9332614 DOI: 10.3390/plants11151930] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/03/2022] [Revised: 05/07/2022] [Accepted: 05/08/2022] [Indexed: 11/17/2022]
Abstract
The exploitation of heterosis through intersubspecific hybridisation between indica and japonica has been a major breeding target in rice, but is marred by the cross incompatibility between the genomes. Wide compatibility (WC) is a triallelic system at the S5 locus on chromosome 6 that ensures the specificity of hybridisation within and between indica and japonica. The S5n allele that favours intercrossing is sparsely distributed in the rice gene pool and therefore warrants identification of diverse WC sources to develop superior intersubspecific hybrids. In this study, we have identified several novel WC sources through the marker-assisted screening of a large set of 950 rice genotypes. Seventeen percent of the genotypes carried S5n, which fell into two subpopulations. The WC genotypes showed wide phenotypic and genotypic variability, including both indica and japonica lines. Based on phenotypic performance, the WC varieties were grouped into three clusters. A subset of 41 WC varieties was used to develop 164 hybrids, of which WC/japonica hybrids showed relative superiority over WC/indica hybrids. The multilocation evaluation of hybrids indicated that hybrids derived from WC varieties, such as IRG137, IRG143, OYR128, and IRGC10658, were higher yielding across all the three different locations. Most of the hybrids showed the stability of performance across locations. The identified diverse set of wide compatible varieties (WCVs) can be used in the development of intersubspecific hybrids and also for parental line development in hybrid rice breeding.
Collapse
|
23
|
Liu K, Qi M, Du FK. Population and Landscape Genetics Provide Insights Into Species Conservation of Two Evergreen Oaks in Qinghai-Tibet Plateau and Adjacent Regions. FRONTIERS IN PLANT SCIENCE 2022; 13:858526. [PMID: 35665182 PMCID: PMC9161217 DOI: 10.3389/fpls.2022.858526] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 01/20/2022] [Accepted: 04/11/2022] [Indexed: 06/15/2023]
Abstract
The combination of population and landscape genetics can facilitate the understanding of conservation strategy under the changing climate. Here, we focused on the two most diverse and ecologically important evergreen oaks: Quercus aquifolioides and Quercus spinosa in Qinghai-Tibetan Plateau (QTP), which is considered as world's biodiversity hotspot. We genotyped 1,657 individuals of 106 populations at 15 nuclear microsatellite loci throughout the species distribution range. Spatial patterns of genetic diversity were identified by mapping the allelic richness (AR) and locally common alleles (LCA) according to the circular neighborhood methodology. Migration routes from QTP were detected by historical gene flow estimation. The response pattern of genetic variation to environmental gradient was assessed by the genotype-environment association (GEA) analysis. The overall genetic structure showed a high level of intra-species genetic divergence of a strong west-east pattern. The West-to-East migration route indicated the complex demographic history of two oak species. We found evidence of isolation by the environment in Q. aqu-East and Q. spi-West lineage but not in Q. aqu-West and Q. spi-East lineage. Furthermore, priority for conservation should be given to populations that retain higher spatial genetic diversity or isolated at the edge of the distribution range. Our findings indicate that knowledge of spatial diversity and migration route can provide valuable information for the conservation of existing populations. This study provides an important guide for species conservation for two oak species by the integration of population and landscape genetic methods.
Collapse
Affiliation(s)
| | | | - Fang K. Du
- School of Ecology and Nature Conservation, Beijing Forestry University, Beijing, China
| |
Collapse
|
24
|
Posterior probabilities of membership of repertoires in acoustic clades. PLoS One 2022; 17:e0267501. [PMID: 35452481 PMCID: PMC9032417 DOI: 10.1371/journal.pone.0267501] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2021] [Accepted: 04/08/2022] [Indexed: 11/19/2022] Open
Abstract
Recordings of calls may be used to assess population structure for acoustic species. This can be particularly effective if there are identity calls, produced nearly exclusively by just one population segment. The identity call method, IDcall, classifies calls into types using contaminated mixture models, and then clusters repertoires of calls into identity clades (potential population segments) using identity calls that are characteristic of the repertoires in each identity clade. We show how to calculate the Bayesian posterior probabilities that each repertoire is a member of each identity clade, and display this information as a stacked bar graph. This methodology (IDcallPP) is introduced using the output of IDcall but could easily be adapted to estimate posterior probabilities of clade membership when acoustic clades are delineated using other methods. This output is similar to that of the STRUCTURE software which uses molecular genetic data to assess population structure and has become a standard in conservation genetics. The technique introduced here should be a valuable asset to those who use acoustic data to address evolution, ecology, or conservation, and creates a methodological and conceptual bridge between geneticists and acousticians who aim to assess population structure.
Collapse
|
25
|
Saidi A, Sarvmeili J, Pouresmael M. Genetic diversity study in lentil (Lens culinaris Medik.) Germplasm: a comparison of CAAT Box Derived Polymorphism (CBDP) and simple sequence repeat (SSR) markers. Biologia (Bratisl) 2022. [DOI: 10.1007/s11756-022-01089-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
26
|
Griesemer J, Barragán CA. Re-situations of scientific knowledge: a case study of a skirmish over clusters vs clines in human population genomics. HISTORY AND PHILOSOPHY OF THE LIFE SCIENCES 2022; 44:16. [PMID: 35445860 PMCID: PMC9023434 DOI: 10.1007/s40656-022-00497-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 03/12/2022] [Indexed: 06/14/2023]
Abstract
We track and analyze the re-situation of scientific knowledge in the field of human population genomics ancestry studies. We understand re-situation as a process of accommodating the direct or indirect transfer of objects of knowledge from one site/situation to (one or many) other sites/situations. Our take on the concept borrows from Mary S. Morgan's work on facts traveling while expanding it to include other objects of knowledge such as models, data, software, findings, and visualizations. We structure a specific case study by tracking the re-situation of these objects between three research projects studying human population diversity reported in three articles in Science, Genome Research and PLoS Genetics between 2002 and 2005. We characterize these three engagements as a unit of analysis, a "skirmish," in order to compare: (a) the divergence of interests in how life-scientists answer similar research questions and (b) to track the challenging transformation of workflows in research laboratories as these scientific objects are re-situated individually or in bundles. Our analysis of the case study shows that an accurate understanding of re-situation requires tracking the whole bundle of objects in a project because they interact in particular key ways. The absence or dismissal of these interactions opens the door to unforeseen trade-offs, misunderstandings and misrepresentations about research design(s) and workflow(s) and what these say about the questions asked and the findings produced.
Collapse
Affiliation(s)
- James Griesemer
- Department of Philosophy, University of California, Davis, One Shields Avenue, Davis, CA 95616 USA
- Department of Science and Technology Studies, University of California, Davis, One Shields Avenue, Davis, CA 95616 USA
| | - Carlos Andrés Barragán
- Department of Philosophy, University of California, Davis, One Shields Avenue, Davis, CA 95616 USA
- Department of Science and Technology Studies, University of California, Davis, One Shields Avenue, Davis, CA 95616 USA
| |
Collapse
|
27
|
Ruffley M, Smith ML, Espíndola A, Turck DF, Mitchell N, Carstens B, Sullivan J, Tank DC. Genomic evidence of an ancient Inland Temperate Rainforest in the Pacific Northwest of North America. Mol Ecol 2022; 31:2985-3001. [PMID: 35322900 PMCID: PMC9322681 DOI: 10.1111/mec.16431] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2021] [Revised: 01/15/2022] [Accepted: 02/21/2022] [Indexed: 12/02/2022]
Abstract
The disjunct temperate rainforests of the Pacific Northwest of North America (PNW) are characterized by late‐successional dominant tree species Thuja plicata (western redcedar) and Tsuga heterophylla (western hemlock). The demographic histories of these species, along with the PNW rainforest ecosystem in its entirety, have been heavily impacted by geological and climatic changes the PNW has experienced over the last 5 million years, including mountain orogeny and repeated Pleistocene glaciations. These environmental events have ultimately shaped the history of these species, with inland populations potentially being extirpated during the Pleistocene glaciations. Here, we collect genomic data for both species across their ranges to test multiple demographic models, each reflecting a different phylogeographical hypothesis on how the ecosystem‐dominating species may have responded to dramatic climatic change. Our results indicate that inland and coastal populations in both species diverged ~2.5 million years ago in the early Pleistocene and experienced decreases in population size during glacial cycles, with subsequent population expansion. Importantly, we found evidence for gene flow between coastal and inland populations during the mid‐Holocene. It is likely that intermittent migration in these species during this time has prevented allopatric speciation via genetic drift alone. In conclusion, our results from combining genomic data and demographic inference procedures establish that populations of the ecosystem dominants Thuja plicata and Tsuga heterophylla persisted in refugia located in both the coastal and inland regions of the PNW throughout the Pleistocene, with populations expanding and contracting in response to glacial cycles with occasional gene flow.
Collapse
Affiliation(s)
- Megan Ruffley
- Department of Biological Sciences, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Institute for Bioinformatics and Evolutionary Studies (IBEST), 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Department of Plant Biology, Carnegie Institution for Science, 260 Panama St, Stanford, CA, 94305, USA
| | - Megan L Smith
- Department of Evolution, Ecology, and Organismal Biology & Museum of Biological Diversity, The Ohio State University, 1315 Kinnear Rd, Columbus, OH, 43212, USA.,Department of Biology and Department of Computer Science, Indiana University, Bloomington, IN, 47405, USA
| | - Anahí Espíndola
- Department of Entomology, University of Maryland, 4291 Fieldhouse Dr, College Park, MD, 20742, USA
| | - Daniel F Turck
- Department of Biological Sciences, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Stillinger Herbarium, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA
| | - Niels Mitchell
- Department of Biological Sciences, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA
| | - Bryan Carstens
- Department of Evolution, Ecology, and Organismal Biology & Museum of Biological Diversity, The Ohio State University, 1315 Kinnear Rd, Columbus, OH, 43212, USA
| | - Jack Sullivan
- Department of Biological Sciences, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Institute for Bioinformatics and Evolutionary Studies (IBEST), 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA
| | - David C Tank
- Department of Biological Sciences, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Institute for Bioinformatics and Evolutionary Studies (IBEST), 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Stillinger Herbarium, University of Idaho, 875 Perimeter Dr. MS 3051, Moscow, ID, 83844-3051, USA.,Department of Botany & Rocky Mountain Herbarium, University of Wyoming, 1000 E. University Ave, Laramie, WY, 82071, USA
| |
Collapse
|
28
|
Strong population genetic structure and cryptic diversity in the Florida bonneted bat (Eumops floridanus). CONSERV GENET 2022. [DOI: 10.1007/s10592-022-01432-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
29
|
Conklin JR, Verkuil YI, Battley PF, Hassell CJ, Ten Horn J, Johnson JA, Tomkovich PS, Baker AJ, Piersma T, Fontaine MC. Global flyway evolution in red knots Calidris canutus and genetic evidence for a Nearctic refugium. Mol Ecol 2022; 31:2124-2139. [PMID: 35106871 PMCID: PMC9545425 DOI: 10.1111/mec.16379] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2021] [Revised: 12/13/2021] [Accepted: 01/14/2022] [Indexed: 11/30/2022]
Abstract
Present‐day ecology and population structure are the legacies of past climate and habitat perturbations, and this is particularly true for species that are widely distributed at high latitudes. The red knot, Calidris canutus, is an arctic‐breeding, long‐distance migratory shorebird with six recognized subspecies defined by differences in morphology, migration behavior, and annual cycle phenology, in a global distribution thought to have arisen just since the last glacial maximum (LGM). We used nextRAD sequencing of 10,881 single‐nucleotide polymorphisms (SNPs) to assess the neutral genetic structure and phylogeographic history of 172 red knots representing all known global breeding populations. Using population genetics approaches, including model‐based scenario‐testing in an approximate Bayesian computation (ABC) framework, we infer that red knots derive from two main lineages that diverged ca. 34,000 years ago, and thus most probably persisted at the LGM in both Palearctic and Nearctic refugia, followed by at least two instances of secondary contact and admixture. Within two Beringian subspecies (C. c. roselaari and rogersi), we detected previously unknown genetic structure among sub‐populations sharing a migratory flyway, reflecting additional complexity in the phylogeographic history of the region. Conversely, we found very weak genetic differentiation between two Nearctic populations (rufa and islandica) with clearly divergent migratory phenotypes and little or no apparent contact throughout the annual cycle. Together, these results suggest that relative gene flow among migratory populations reflects a complex interplay of historical, geographical, and ecological factors.
Collapse
Affiliation(s)
- Jesse R Conklin
- Groningen Institute for Evolutionary Life Sciences (GELIFES), University of Groningen, P.O. Box 11103, 9700 CC, Groningen, The Netherlands
| | - Yvonne I Verkuil
- Groningen Institute for Evolutionary Life Sciences (GELIFES), University of Groningen, P.O. Box 11103, 9700 CC, Groningen, The Netherlands
| | - Phil F Battley
- Wildlife and Ecology Group, School of Agriculture and Environment, Massey University, Palmerston North, 4442, New Zealand
| | - Chris J Hassell
- Global Flyway Network, PO Box 3089, Broome, WA, 6725, Australia
| | - Job Ten Horn
- Department of Coastal Systems, NIOZ Royal Netherlands Institute for Sea Research, P.O. Box 59, 1790 AB, Den Burg, Texel, The Netherlands
| | - James A Johnson
- U.S. Fish & Wildlife Service, Migratory Bird Management, 1011 E. Tudor Road, MS 201, Anchorage, Alaska, 99503, USA
| | - Pavel S Tomkovich
- Zoological Museum, Moscow MV Lomonosov State University, Bolshaya Nikitskaya Str. 6, Moscow, 125009, Russia
| | - Allan J Baker
- Department of Natural History, Royal Ontario Museum, 100 Queens Park, Toronto, ON, M5S 2C6, Canada
| | - Theunis Piersma
- Groningen Institute for Evolutionary Life Sciences (GELIFES), University of Groningen, P.O. Box 11103, 9700 CC, Groningen, The Netherlands.,Department of Coastal Systems, NIOZ Royal Netherlands Institute for Sea Research, P.O. Box 59, 1790 AB, Den Burg, Texel, The Netherlands
| | - Michaël C Fontaine
- Groningen Institute for Evolutionary Life Sciences (GELIFES), University of Groningen, P.O. Box 11103, 9700 CC, Groningen, The Netherlands.,MIVEGEC, University of Montpellier, CNRS, IRD, Montpellier, France.,Montpellier Ecology and Evolution of Diseases Network (MEEDiN), Montpellier, France
| |
Collapse
|
30
|
Tvedebrink T. Review of the Forensic Applicability of Biostatistical Methods for Inferring Ancestry from Autosomal Genetic Markers. Genes (Basel) 2022; 13:genes13010141. [PMID: 35052480 PMCID: PMC8774801 DOI: 10.3390/genes13010141] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2021] [Revised: 01/10/2022] [Accepted: 01/11/2022] [Indexed: 02/01/2023] Open
Abstract
The inference of ancestry has become a part of the services many forensic genetic laboratories provide. Interest in ancestry may be to provide investigative leads or identify the region of origin in cases of unidentified missing persons. There exist many biostatistical methods developed for the study of population structure in the area of population genetics. However, the challenges and questions are slightly different in the context of forensic genetics, where the origin of a specific sample is of interest compared to the understanding of population histories and genealogies. In this paper, the methodologies for modelling population admixture and inferring ancestral populations are reviewed with a focus on their strengths and weaknesses in relation to ancestry inference in the forensic context.
Collapse
Affiliation(s)
- Torben Tvedebrink
- Department of Mathematical Sciences, Aalborg University, DK-9220 Aalborg, Denmark;
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, DK-1165 Copenhagen, Denmark
| |
Collapse
|
31
|
Fitak RR. OptM: estimating the optimal number of migration edges on population trees using Treemix. Biol Methods Protoc 2021; 6:bpab017. [PMID: 34595352 PMCID: PMC8476930 DOI: 10.1093/biomethods/bpab017] [Citation(s) in RCA: 103] [Impact Index Per Article: 34.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Revised: 09/09/2021] [Accepted: 09/13/2021] [Indexed: 11/25/2022] Open
Abstract
The software Treemix has become extensively used to estimate the number of migration events, or edges (m), on population trees from genome-wide allele frequency data. However, the appropriate number of edges to include remains unclear. Here, I show that an optimal value of m can be inferred from the second-order rate of change in likelihood (Δm) across incremental values of m. Repurposed from its original use to estimate the number of population clusters in the software Structure (ΔK), I show using simulated populations that Δm performs equally as well as current recommendations for Treemix. A demonstration of an empirical dataset from domestic dogs indicates that this method may be preferable in large, complex population histories and can prioritize migration events for subsequent investigation. The method has been implemented in a freely available R package called “OptM” and as a web application (https://rfitak.shinyapps.io/OptM/) to interface directly with the output files of Treemix.
Collapse
Affiliation(s)
- Robert R Fitak
- Department of Biology, Genomics and Bioinformatics Cluster, University of Central Florida, Orlando, FL 32816, USA
| |
Collapse
|
32
|
Stankiewicz KH, Vasquez Kuntz KL, Baums IB. The impact of estimator choice: Disagreement in clustering solutions across K estimators for Bayesian analysis of population genetic structure across a wide range of empirical data sets. Mol Ecol Resour 2021; 22:1135-1148. [PMID: 34597471 DOI: 10.1111/1755-0998.13522] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/28/2019] [Revised: 08/21/2021] [Accepted: 09/07/2021] [Indexed: 11/29/2022]
Abstract
The software program STRUCTURE is one of the most cited tools for determining population structure. To infer the optimal number of clusters from STRUCTURE output, the ΔK method is often applied. However, a recent study relying on simulated microsatellite data suggested that this method has a downward bias in its estimation of K and is sensitive to uneven sampling. If this finding holds for empirical data sets, conclusions about the scale of gene flow may have to be revised for a large number of studies. To determine the impact of method choice, we applied recently described estimators of K to re-estimate genetic structure in 41 empirical microsatellite data sets; 15 from a broad range of taxa and 26 from one phylogenetic group, coral. We compared alternative estimates of K (Puechmaille statistics) with traditional (ΔK and posterior probability) estimates and found widespread disagreement of estimators across data sets. Thus, one estimator alone is insufficient for determining the optimal number of clusters; this was regardless of study organism or evenness of sampling scheme. Subsequent analysis of molecular variance (AMOVA) did not necessarily clarify which clustering solution was best. To better infer population structure, we suggest a combination of visual inspection of STRUCTURE plots and calculation of the alternative estimators at various thresholds in addition to ΔK. Disagreement between traditional and recent estimators may have important biological implications, such as previously unrecognized population structure, as was the case for many studies reanalysed here.
Collapse
Affiliation(s)
- Kathryn H Stankiewicz
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | - Kate L Vasquez Kuntz
- Department of Biology, Pennsylvania State University, University Park, Pennsylvania, USA
| | | | | |
Collapse
|
33
|
Maas M, Tatem-Dokter R, Rijks JM, Dam-Deisz C, Franssen F, van Bolhuis H, Heddergott M, Schleimer A, Schockert V, Lambinet C, Hubert P, Redelijk T, Janssen R, Cruz APL, Martinez IC, Caron Y, Linden A, Lesenfants C, Paternostre J, van der Giessen J, Frantz AC. Population genetics, invasion pathways and public health risks of the raccoon and its roundworm Baylisascaris procyonis in northwestern Europe. Transbound Emerg Dis 2021; 69:2191-2200. [PMID: 34227236 DOI: 10.1111/tbed.14218] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/18/2021] [Revised: 06/30/2021] [Accepted: 07/01/2021] [Indexed: 12/01/2022]
Abstract
The geographic range of the zoonotic raccoon roundworm (Baylisascaris procyonis) is expanding together with the range of its host, the raccoon (Procyon lotor). This creates a new public health risk in parts of Europe where this parasite was previously absent. In the Netherlands, a raccoon population is becoming established and incidental findings of B. procyonis have been reported. To assess the risk to public health, the prevalence of B. procyonis was determined in the province of Limburg, where currently the largest Dutch raccoon population is present, as well as in the adjoining region of southern Belgium. Furthermore, genetic methods were employed to assess invasion pathways of both the raccoon and B. procyonis to aid in the development of control measures. Macroscopic analysis of intestinal content and testing of faecal samples were performed to detect B. procyonis adults and eggs. The population genetics of both B. procyonis and its raccoon host were analysed using samples from central and northwestern Europe. B. procyonis was found in 14/23 (61%, 95% CI: 41%-78%) raccoons from Limburg, but was not detected in 50 Belgian raccoons. Genetic analyses showed that the majority of the Dutch raccoons and their roundworms were introduced through ex-captive individuals. As long as free-living raccoon populations originate from captivity, population control methods may be pursued. However, natural dispersal from the border regions will complicate prolonged population control. To reduce the public health risk posed by B. procyonis, public education to increase awareness and adapt behaviour towards raccoons is key.
Collapse
Affiliation(s)
- Miriam Maas
- Centre for Infectious Disease Control (CIb), National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| | - Rea Tatem-Dokter
- Centre for Infectious Disease Control (CIb), National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| | - Jolianne M Rijks
- Dutch Wildlife Health Centre (DWHC), Faculty of Veterinary Medicine, Utrecht University, Utrecht, the Netherlands
| | - Cecile Dam-Deisz
- Centre for Infectious Disease Control (CIb), National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| | - Frits Franssen
- Centre for Infectious Disease Control (CIb), National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| | | | | | - Anna Schleimer
- Musée National d'Histoire Naturelle, Luxembourg, Luxembourg
| | - Vinciane Schockert
- Unité de Recherches zoogéographiques, Département de Biologie, Ecologie et Evolution, Faculté des Sciences, Université de Liège, Liège, Belgium
| | - Clotilde Lambinet
- Unité de Recherches zoogéographiques, Département de Biologie, Ecologie et Evolution, Faculté des Sciences, Université de Liège, Liège, Belgium
| | - Pauline Hubert
- Centre de Recherche et de Formation en Éco-éthologie, Université de Reims Champagne-Ardenne, Boult-aux-Bois, France
| | | | | | | | - Irène Campos Martinez
- Unité de Recherches zoogéographiques, Département de Biologie, Ecologie et Evolution, Faculté des Sciences, Université de Liège, Liège, Belgium
| | - Yannick Caron
- Parasitologie et Pathologie des Maladies Parasitaires, Département des Maladies infectieuses et parasitaires, Faculté de Médecine Vétérinaire, Université de Liège, Liège, Belgium
| | - Annick Linden
- Service de Santé et Pathologie de la Faune sauvage, Département des Maladies infectieuses et parasitaires, Faculté de Médecine Vétérinaire, Université de Liège, Liège, Belgium
| | - Christophe Lesenfants
- Service de Santé et Pathologie de la Faune sauvage, Département des Maladies infectieuses et parasitaires, Faculté de Médecine Vétérinaire, Université de Liège, Liège, Belgium
| | - Julien Paternostre
- Service de Santé et Pathologie de la Faune sauvage, Département des Maladies infectieuses et parasitaires, Faculté de Médecine Vétérinaire, Université de Liège, Liège, Belgium
| | - Joke van der Giessen
- Centre for Infectious Disease Control (CIb), National Institute for Public Health and the Environment (RIVM), Bilthoven, the Netherlands
| | - Alain C Frantz
- Musée National d'Histoire Naturelle, Luxembourg, Luxembourg
| |
Collapse
|
34
|
Cornman RS, Fike JA, Oyler-McCance SJ, Cryan PM. Historical effective population size of North American hoary bat ( Lasiurus cinereus) and challenges to estimating trends in contemporary effective breeding population size from archived samples. PeerJ 2021; 9:e11285. [PMID: 33976981 PMCID: PMC8061578 DOI: 10.7717/peerj.11285] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2020] [Accepted: 03/25/2021] [Indexed: 11/20/2022] Open
Abstract
Background Hoary bats (Lasiurus cinereus) are among the bat species most commonly killed by wind turbine strikes in the midwestern United States. The impact of this mortality on species census size is not understood, due in part to the difficulty of estimating population size for this highly migratory and elusive species. Genetic effective population size (Ne) could provide an index of changing census population size if other factors affecting Ne are stable. Methods We used the NeEstimator package to derive effective breeding population size (Nb) estimates for two temporally spaced cohorts: 93 hoary bats collected in 2009-2010 and an additional 93 collected in 2017-2018. We sequenced restriction-site associated polymorphisms and generated a de novo genome assembly to guide the removal of sex-linked and multi-copy loci, as well as identify physically linked markers. Results Analysis of the reference genome with psmc suggested at least a doubling of Ne in the last 100,000 years, likely exceeding Ne = 10,000 in the Holocene. Allele and genotype frequency analyses confirmed that the two cohorts were comparable, although some samples had unusually high or low observed heterozygosities. Additionally, the older cohort had lower mean coverage and greater variability in coverage, and batch effects of sampling locality were observed that were consistent with sample degradation. We therefore excluded samples with low coverage or outlier heterozygosity, as well as loci with sequence coverage far from the mode value, from the final data set. Prior to excluding these outliers, contemporary Nb estimates were significantly higher in the more recent cohort, but this finding was driven by high values for the 2018 sample year and low values for all other years. In the reduced data set, Nb did not differ significantly between cohorts. We found base substitutions to be strongly biased toward cytosine to thymine or the complement, and further partitioning loci by substitution type had a strong effect on Nb estimates. Minor allele frequency and base quality bias thresholds also had strong effects on Nb estimates. Instability of Nb with respect to common data filtering parameters and empirically identified factors prevented robust comparison of the two cohorts. Given that confidence intervals frequently included infinity as the stringency of data filtering increased, contemporary trends in Nb of North American hoary bats may not be tractable with the linkage disequilibrium method, at least using the protocol employed here.
Collapse
Affiliation(s)
- Robert S Cornman
- U.S. Geological Survey, Fort Collins Science Center, Fort Collins, CO, United States of America
| | - Jennifer A Fike
- U.S. Geological Survey, Fort Collins Science Center, Fort Collins, CO, United States of America
| | - Sara J Oyler-McCance
- U.S. Geological Survey, Fort Collins Science Center, Fort Collins, CO, United States of America
| | - Paul M Cryan
- U.S. Geological Survey, Fort Collins Science Center, Fort Collins, CO, United States of America
| |
Collapse
|
35
|
Transcriptome sequencing and microsatellite marker discovery in Ailanthus altissima (Mill.) Swingle (Simaroubaceae). Mol Biol Rep 2021; 48:2007-2023. [PMID: 33730287 DOI: 10.1007/s11033-020-05402-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/22/2019] [Accepted: 03/25/2020] [Indexed: 10/21/2022]
Abstract
Ailanthus altissima Swingle, is a tree species native to East Asia and has a great potential in decorative, bioenergy and industrial applications in many countries. To date, despite its commercial importance, the genomic and genetic resources available for this species are still insufficient. In this study, we characterized the transcriptome of A. altissima and developed thirteen EST-SSRs (expressed sequence tag-simple sequence repeats) based on Illumina paired-end RNA sequencing (RNA-seq). Besides, we developed ten polymorphic chloroplast microsatellite (cpSSR) markers using the available chloroplast genome of A. altissima. The transcriptome data produced 87,797 unigenes, of which 64,891 (73.91%) unigenes were successfully annotated in at least one protein database. For cpSSR markers the number of detected alleles (N) per marker varied from three at cpSSR12 to twelve at cpSSR8, the unbiased haploid diversity indices (uh) varied from 0.111 to 0.485, and haploid diversity indices (h) ranged from 0.101 to 0.444 with an average unbiased haploid diversity index (uh) of 0.274. Overall, a total of 65 different cpSSR alleles were identified at the ten loci among 165 individuals of A. altissima. The allele number per locus for EST-SSRs varied from 2.143 to 9.357, and the values of observed and expected heterozygosity ranged from 0.312 to 1.000 and 0.505 to 0.826, respectively. The molecular markers developed in this study will facilitate future genetic diversity, population structure, long distance-gene transfer and pollen-based gene flow analyses of A. altissima populations from its known distribution ranges in China focusing on planted and natural forest stands.
Collapse
|
36
|
Abstract
Population structure is a commonplace feature of genetic variation data, and it has importance in numerous application areas, including evolutionary genetics, conservation genetics, and human genetics. Understanding the structure in a sample is necessary before more sophisticated analyses are undertaken. Here we provide a protocol for running principal component analysis (PCA) and admixture proportion inference-two of the most commonly used approaches in describing population structure. Along with hands-on examples with CEPH-Human Genome Diversity Panel and pragmatic caveats, readers will learn to analyze and visualize population structure on their own data.
Collapse
|
37
|
Carlson J, Harris K. Quantifying and contextualizing the impact of bioRxiv preprints through automated social media audience segmentation. PLoS Biol 2020; 18:e3000860. [PMID: 32960891 PMCID: PMC7508356 DOI: 10.1371/journal.pbio.3000860] [Citation(s) in RCA: 19] [Impact Index Per Article: 4.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Accepted: 08/17/2020] [Indexed: 12/27/2022] Open
Abstract
Engagement with scientific manuscripts is frequently facilitated by Twitter and other social media platforms. As such, the demographics of a paper's social media audience provide a wealth of information about how scholarly research is transmitted, consumed, and interpreted by online communities. By paying attention to public perceptions of their publications, scientists can learn whether their research is stimulating positive scholarly and public thought. They can also become aware of potentially negative patterns of interest from groups that misinterpret their work in harmful ways, either willfully or unintentionally, and devise strategies for altering their messaging to mitigate these impacts. In this study, we collected 331,696 Twitter posts referencing 1,800 highly tweeted bioRxiv preprints and leveraged topic modeling to infer the characteristics of various communities engaging with each preprint on Twitter. We agnostically learned the characteristics of these audience sectors from keywords each user's followers provide in their Twitter biographies. We estimate that 96% of the preprints analyzed are dominated by academic audiences on Twitter, suggesting that social media attention does not always correspond to greater public exposure. We further demonstrate how our audience segmentation method can quantify the level of interest from nonspecialist audience sectors such as mental health advocates, dog lovers, video game developers, vegans, bitcoin investors, conspiracy theorists, journalists, religious groups, and political constituencies. Surprisingly, we also found that 10% of the preprints analyzed have sizable (>5%) audience sectors that are associated with right-wing white nationalist communities. Although none of these preprints appear to intentionally espouse any right-wing extremist messages, cases exist in which extremist appropriation comprises more than 50% of the tweets referencing a given preprint. These results present unique opportunities for improving and contextualizing the public discourse surrounding scientific research.
Collapse
Affiliation(s)
- Jedidiah Carlson
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
| | - Kelley Harris
- Department of Genome Sciences, University of Washington, Seattle, Washington, United States of America
- Computational Biology Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| |
Collapse
|
38
|
Georges-Filteau J, Hamelin RC, Blanchette M. Mycorrhiza: genotype assignment using phylogenetic networks. Bioinformatics 2020; 36:212-220. [PMID: 31197316 DOI: 10.1093/bioinformatics/btz476] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/23/2018] [Revised: 05/03/2019] [Accepted: 06/06/2019] [Indexed: 01/09/2023] Open
Abstract
MOTIVATION The genotype assignment problem consists of predicting, from the genotype of an individual, which of a known set of populations it originated from. The problem arises in a variety of contexts, including wildlife forensics, invasive species detection and biodiversity monitoring. Existing approaches perform well under ideal conditions but are sensitive to a variety of common violations of the assumptions they rely on. RESULTS In this article, we introduce Mycorrhiza, a machine learning approach for the genotype assignment problem. Our algorithm makes use of phylogenetic networks to engineer features that encode the evolutionary relationships among samples. Those features are then used as input to a Random Forests classifier. The classification accuracy was assessed on multiple published empirical SNP, microsatellite or consensus sequence datasets with wide ranges of size, geographical distribution and population structure and on simulated datasets. It compared favorably against widely used assessment tests or mixture analysis methods such as STRUCTURE and Admixture, and against another machine-learning based approach using principal component analysis for dimensionality reduction. Mycorrhiza yields particularly significant gains on datasets with a large average fixation index (FST) or deviation from the Hardy-Weinberg equilibrium. Moreover, the phylogenetic network approach estimates mixture proportions with good accuracy. AVAILABILITY AND IMPLEMENTATION Mycorrhiza is released as an easy to use open-source python package at github.com/jgeofil/mycorrhiza. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
| | - Richard C Hamelin
- Department of Forest and Conservation Sciences, The University of British Columbia, Vancouver, BC, Canada.,Département des sciences du bois et de la forêt, Université Laval, Québec, Canada
| | | |
Collapse
|
39
|
Garcia-Erill G, Albrechtsen A. Evaluation of model fit of inferred admixture proportions. Mol Ecol Resour 2020; 20:936-949. [PMID: 32323416 DOI: 10.1111/1755-0998.13171] [Citation(s) in RCA: 30] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/05/2019] [Revised: 03/11/2020] [Accepted: 04/15/2020] [Indexed: 12/12/2022]
Abstract
Model based methods for genetic clustering of individuals, such as those implemented in structure or ADMIXTURE, allow the user to infer individual ancestries and study population structure. The underlying model makes several assumptions about the demographic history that shaped the analysed genetic data. One assumption is that all individuals are a result of K homogeneous ancestral populations that are all well represented in the data, while another assumption is that no drift happened after the admixture event. The histories of many real world populations do not conform to that model, and in that case taking the inferred admixture proportions at face value might be misleading. We propose a method to evaluate the fit of admixture models based on estimating the correlation of the residual difference between the true genotypes and the genotypes predicted by the model. When the model assumptions are not violated, the residuals from a pair of individuals are not correlated. In the case of a bad fitting admixture model, individuals with similar demographic histories have a positive correlation of their residuals. Using simulated and real data, we show how the method is able to detect a bad fit of inferred admixture proportions due to using an insufficient number of clusters K or to demographic histories that deviate significantly from the admixture model assumptions, such as admixture from ghost populations, drift after admixture events and nondiscrete ancestral populations. We have implemented the method as an open source software that can be applied to both unphased genotypes and low depth sequencing data.
Collapse
Affiliation(s)
- Genís Garcia-Erill
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| | - Anders Albrechtsen
- Section for Computational and RNA Biology, Department of Biology, University of Copenhagen, Copenhagen N, Denmark
| |
Collapse
|
40
|
Cotton JA, Durrant C, Franssen SU, Gelanew T, Hailu A, Mateus D, Sanders MJ, Berriman M, Volf P, Miles MA, Yeo M. Genomic analysis of natural intra-specific hybrids among Ethiopian isolates of Leishmania donovani. PLoS Negl Trop Dis 2020; 14:e0007143. [PMID: 32310945 PMCID: PMC7237039 DOI: 10.1371/journal.pntd.0007143] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2019] [Revised: 05/19/2020] [Accepted: 12/24/2019] [Indexed: 12/30/2022] Open
Abstract
Parasites of the genus Leishmania (Kinetoplastida: Trypanosomatidae) cause widespread and devastating human diseases. Visceral leishmaniasis due to Leishmania donovani is endemic in Ethiopia where it has also been responsible for major epidemics. The presence of hybrid genotypes has been widely reported in surveys of natural populations, genetic variation reported in a number of Leishmania species, and the extant capacity for genetic exchange demonstrated in laboratory experiments. However, patterns of recombination and the evolutionary history of admixture that produced these hybrid populations remain unclear. Here, we use whole-genome sequence data to investigate Ethiopian L. donovani isolates previously characterized as hybrids by microsatellite and multi-locus sequencing. To date there is only one previous study on a natural population of Leishmania hybrids based on whole-genome sequences. We propose that these hybrids originate from recombination between two different lineages of Ethiopian L. donovani occurring in the same region. Patterns of inheritance are more complex than previously reported with multiple, apparently independent, origins from similar parents that include backcrossing with parental types. Analysis indicates that hybrids are representative of at least three different histories. Furthermore, isolates were highly polysomic at the level of chromosomes with differences between parasites recovered from a recrudescent infection from a previously treated individual. The results demonstrate that recombination is a significant feature of natural populations and contributes to the growing body of data that shows how recombination, and gene flow, shape natural populations of Leishmania.
Collapse
Affiliation(s)
| | | | | | - Tesfaye Gelanew
- Faculty of Medicine, Addis Ababa University, Addis Ababa, Ethiopia
| | - Asrat Hailu
- Faculty of Medicine, Addis Ababa University, Addis Ababa, Ethiopia
| | - David Mateus
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | | | | | - Petr Volf
- Department of Parasitology, Faculty of Science, Charles University, Prague, Czech Republic
| | - Michael A. Miles
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| | - Matthew Yeo
- Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London, United Kingdom
| |
Collapse
|
41
|
Shahabzadeh Z, Mohammadi R, Darvishzadeh R, Jaffari M. Genetic structure and diversity analysis of tall fescue populations by EST-SSR and ISSR markers. Mol Biol Rep 2019; 47:655-669. [PMID: 31707600 DOI: 10.1007/s11033-019-05173-z] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2019] [Accepted: 10/30/2019] [Indexed: 12/14/2022]
Abstract
Tall fescue is a perennial cool-season grass with economic importance especially in temperate regions of the northern hemisphere. This study was done to assess the genetic diversity and population structure of 90 tall fescue populations and cultivars using ISSR and EST-SSR markers in order to categorize valuable populations for breeding programs and to construct the core collection of tall fescue collection in Iran. The 10 EST-SSR primer pairs amplified 92 alleles. The allele numbers varied from 4 to 13 alleles per locus with an average of 9.2 alleles, of which 84 (90.6%) were polymorphic with an average of 8.4 polymorphic bands per primer. The 39 ISSR primers totally produced 387 scorable bands, of which 335 (86.6%) were polymorphic with an average of 8.6 polymorphic bands per primer. The amplified markers by ISSR primers varied from 6 to 14 markers per primer with an average of 9.92 markers per primer. The 90 tall fescue populations using both EST-SSR and ISSR data were classified into two clusters by UPGMA method that was coincide with PCA and structure analysis results. The turf-type and forage-type populations were clearly separated. Based on the results, the Iranian populations provide a valuable and novel germplasm to employ in tall fescue varietal improvement programs for both forage and turf-type applications. This progression is an important step to introduce this collection for development of a core collection of tall fescue germplasm in Iran.
Collapse
Affiliation(s)
- Z Shahabzadeh
- Department of Plant Breeding and Biotechnology, Faculty of Agriculture, Urmia University, Urmia, Iran
| | - R Mohammadi
- Branch for Northwest & West Region, Agricultural Biotechnology Research Institute of Iran (ABRII), Agricultural Research, Education and Extension Organization (AREEO), Tabriz, Iran.
| | - R Darvishzadeh
- Department of Plant Breeding and Biotechnology, Faculty of Agriculture, Urmia University, Urmia, Iran
| | - M Jaffari
- Department of Plant Breeding and Biotechnology, Faculty of Agriculture, Urmia University, Urmia, Iran
| |
Collapse
|
42
|
Kijas JW, Gutierrez AP, Houston RD, McWilliam S, Bean TP, Soyano K, Symonds JE, King N, Lind C, Kube P. Assessment of genetic diversity and population structure in cultured Australian Pacific oysters. Anim Genet 2019; 50:686-694. [PMID: 31518019 DOI: 10.1111/age.12845] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/16/2019] [Indexed: 01/14/2023]
Abstract
The recent development of Pacific oyster (Crassostrea gigas) SNP genotyping arrays has allowed detailed characterisation of genetic diversity and population structure within and between oyster populations. It also raises the potential of harnessing genomic selection for genetic improvement in oyster breeding programmes. The aim of this study was to characterise a breeding population of Australian oysters through genotyping and analysis of 18 027 SNPs, followed by comparison with genotypes of oyster sampled from Europe and Asia. This revealed that the Australian populations had similar population diversity (HE ) to oysters from New Zealand, the British Isles, France and Japan. Population divergence was assessed using PCA of genetic distance and revealed that Australian oysters were distinct from all other populations tested. Australian Pacific oysters originate from planned introductions sourced from three Japanese populations. Approximately 95% of these introductions were from geographically, and potentially genetically, distinct populations from the Nagasaki oysters assessed in this study. Finally, in preparation for the application of genomic selection in oyster breeding programmes, the strength of LD was evaluated and subsets of loci were tested for their ability to accurately infer relationships. Weak LD was observed on average; however, SNP subsets were shown to accurately reconstitute a genomic relationship matrix constructed using all loci. This suggests that low-density SNP panels may have utility in the Australian population tested, and the findings represent an important first step towards the design and implementation of genomic approaches for applied breeding in Pacific oysters.
Collapse
Affiliation(s)
- J W Kijas
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Qld, 4067, Australia
| | - A P Gutierrez
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - R D Houston
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - S McWilliam
- CSIRO Agriculture and Food, Queensland Bioscience Precinct, Brisbane, Qld, 4067, Australia
| | - T P Bean
- The Roslin Institute and Royal (Dick) School of Veterinary Studies, University of Edinburgh, Midlothian, EH25 9RG, UK
| | - K Soyano
- Institute for East China Sea Research, Nagasaki University, Nagasaki, 852-8521, Japan
| | | | - N King
- Cawthron Institute, Nelson, New Zealand
| | - C Lind
- CSIRO Agriculture and Food, Hobart, Tasmania, 7004, Australia
| | - P Kube
- CSIRO Agriculture and Food, Hobart, Tasmania, 7004, Australia
| |
Collapse
|
43
|
Joseph TA, Pe'er I. Inference of Population Structure from Time-Series Genotype Data. Am J Hum Genet 2019; 105:317-333. [PMID: 31256878 PMCID: PMC6698887 DOI: 10.1016/j.ajhg.2019.06.002] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2019] [Accepted: 06/04/2019] [Indexed: 10/26/2022] Open
Abstract
Sequencing ancient DNA can offer direct probing of population history. Yet, such data are commonly analyzed with standard tools that assume DNA samples are all contemporary. We present DyStruct, a model and inference algorithm for inferring shared ancestry from temporally sampled genotype data. DyStruct explicitly incorporates temporal dynamics by modeling individuals as mixtures of unobserved populations whose allele frequencies drift over time. We develop an efficient inference algorithm for our model using stochastic variational inference. On simulated data, we show that DyStruct outperforms the current state of the art when individuals are sampled over time. Using a dataset of 296 modern and 80 ancient samples, we demonstrate DyStruct is able to capture a well-supported admixture event of steppe ancestry into modern Europe. We further apply DyStruct to a genome-wide dataset of 2,067 modern and 262 ancient samples used to study the origin of farming in the Near East. We show that DyStruct provides new insight into population history when compared with alternate approaches, within feasible run time.
Collapse
Affiliation(s)
- Tyler A Joseph
- Department of Computer Science, Columbia University, New York, NY 10027, USA.
| | - Itsik Pe'er
- Department of Computer Science, Columbia University, New York, NY 10027, USA; Department of Systems Biology, Columbia University, New York, NY 10027, USA; Data Science Institute, Columbia University, New York, NY 10027, USA.
| |
Collapse
|
44
|
Barker BS, Cocio JE, Anderson SR, Braasch JE, Cang FA, Gillette HD, Dlugosch KM. Potential limits to the benefits of admixture during biological invasion. Mol Ecol 2018; 28:100-113. [PMID: 30485593 DOI: 10.1111/mec.14958] [Citation(s) in RCA: 15] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2017] [Revised: 11/06/2018] [Accepted: 11/09/2018] [Indexed: 12/18/2022]
Abstract
Species introductions often bring together genetically divergent source populations, resulting in genetic admixture. This geographic reshuffling of diversity has the potential to generate favourable new genetic combinations, facilitating the establishment and invasive spread of introduced populations. Observational support for the superior performance of admixed introductions has been mixed, however, and the broad importance of admixture to invasion questioned. Under most underlying mechanisms, admixture's benefits should be expected to increase with greater divergence among and lower genetic diversity within source populations, though these effects have not been quantified in invaders. We experimentally crossed source populations differing in divergence in the invasive plant Centaurea solstitialis. Crosses resulted in many positive (heterotic) interactions, but fitness benefits declined and were ultimately negative at high source divergence, with patterns suggesting cytonuclear epistasis. We explored the literature to assess whether such negative epistatic interactions might be impeding admixture at high source population divergence. Admixed introductions reported for plants came from sources with a wide range of genetic variation, but were disproportionately absent where there was high genetic divergence among native populations. We conclude that while admixture is common in species introductions and often happens under conditions expected to be beneficial to invaders, these conditions may be constrained by predictable negative genetic interactions, potentially explaining conflicting evidence for admixture's benefits to invasion.
Collapse
Affiliation(s)
- Brittany S Barker
- University of Arizona, Tucson, Arizona.,United States Geological Survey, Boise, Idaho
| | | | | | | | | | - Heather D Gillette
- University of Arizona, Tucson, Arizona.,Northern Arizona University, Flagstaff, Arizona
| | | |
Collapse
|
45
|
Feurtey A, Stukenbrock EH. Interspecific Gene Exchange as a Driver of Adaptive Evolution in Fungi. Annu Rev Microbiol 2018; 72:377-398. [DOI: 10.1146/annurev-micro-090817-062753] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Throughout evolutionary history in the kingdom Fungi, taxa have exchanged genetic information among species, as revealed in particular by analyses of genome sequences. In fungi, hybridization can occur by sexual mating or by fusion of vegetative structures giving rise to new species or leaving traces of introgression in the genome. Furthermore, gene exchange can occur by horizontal gene transfer between species and can even include organisms outside the kingdom Fungi. In several cases, interspecific gene exchange has been instrumental in rapid adaptive evolution of fungal species and has notably played a role in the emergence of new pathogens. Here we summarize mechanisms and examples of gene exchange in fungi with a particular focus on the genomic context. We emphasize the need for and potential of applying population genetic approaches to better understand the processes and the impact of interspecific gene exchange in rapid adaptive evolution and species diversification. The broad occurrence of gene exchange among fungal species challenges our species concepts in the kingdom Fungi.
Collapse
Affiliation(s)
- Alice Feurtey
- Environmental Genomics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany;,
| | - Eva H. Stukenbrock
- Environmental Genomics, Max Planck Institute for Evolutionary Biology, 24306 Plön, Germany;,
- Environmental Genomics, Christian-Albrechts University of Kiel, 24118 Kiel, Germany
| |
Collapse
|
46
|
Lawson DJ, van Dorp L, Falush D. A tutorial on how not to over-interpret STRUCTURE and ADMIXTURE bar plots. Nat Commun 2018; 9:3258. [PMID: 30108219 PMCID: PMC6092366 DOI: 10.1038/s41467-018-05257-7] [Citation(s) in RCA: 239] [Impact Index Per Article: 39.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2017] [Accepted: 06/21/2018] [Indexed: 01/05/2023] Open
Abstract
Genetic clustering algorithms, implemented in programs such as STRUCTURE and ADMIXTURE, have been used extensively in the characterisation of individuals and populations based on genetic data. A successful example is the reconstruction of the genetic history of African Americans as a product of recent admixture between highly differentiated populations. Histories can also be reconstructed using the same procedure for groups that do not have admixture in their recent history, where recent genetic drift is strong or that deviate in other ways from the underlying inference model. Unfortunately, such histories can be misleading. We have implemented an approach, badMIXTURE, to assess the goodness of fit of the model using the ancestry "palettes" estimated by CHROMOPAINTER and apply it to both simulated data and real case studies. Combining these complementary analyses with additional methods that are designed to test specific hypotheses allows a richer and more robust analysis of recent demographic history.
Collapse
Affiliation(s)
- Daniel J Lawson
- University of Bristol, Integrative Epidemiology Unit, Population Health Sciences, Bristol, BS8 1TH, UK
| | - Lucy van Dorp
- University College London Genetics Institute (UGI), University College London, London, WC1E 6BT, UK
- Centre for Mathematics and Physics in the Life Sciences and Experimental Biology (CoMPLEX), University College London, London, WC1E 6BT, UK
| | - Daniel Falush
- Milner Centre for Evolution, University of Bath, Bath, BA2 7AY, UK.
| |
Collapse
|
47
|
Barbosa S, Mestre F, White TA, Paupério J, Alves PC, Searle JB. Integrative approaches to guide conservation decisions: Using genomics to define conservation units and functional corridors. Mol Ecol 2018; 27:3452-3465. [PMID: 30030869 DOI: 10.1111/mec.14806] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2017] [Revised: 07/01/2018] [Accepted: 07/05/2018] [Indexed: 01/13/2023]
Abstract
Climate change and increasing habitat loss greatly impact species survival, requiring range shifts, phenotypic plasticity and/or evolutionary change for long-term persistence, which may not readily occur unaided in threatened species. Therefore, defining conservation actions requires a detailed assessment of evolutionary factors. Existing genetic diversity needs to be thoroughly evaluated and spatially mapped to define conservation units (CUs) in an evolutionary context, and we address that here. We also propose a multidisciplinary approach to determine corridors and functional connectivity between CUs by including genetic diversity in the modelling while controlling for isolation by distance and phylogeographic history. We evaluate our approach on a Near Threatened Iberian endemic rodent by analysing genotyping-by-sequencing (GBS) genomic data from 107 Cabrera voles (Microtus cabrerae), screening the entire species distribution to define categories of CUs and their connectivity: We defined six management units (MUs) which can be grouped into four evolutionarily significant units (ESUs) and three (putatively) adaptive units (AUs). We demonstrate that the three different categories of CU can be objectively defined using genomic data, and their characteristics and connectivity can inform conservation decision-making. In particular, we show that connectivity of the Cabrera vole is very limited in eastern Iberia and that the pre-Pyrenean and part of the Betic geographic nuclei contribute the most to the species genetic diversity. We argue that a multidisciplinary framework for CU definition is essential and that this framework needs a strong evolutionary basis.
Collapse
Affiliation(s)
- Soraia Barbosa
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto/InBIO Laboratório Associado, Vairão, Portugal.,Departamento de Biologia, Faculdade de Ciências da Universidade do Porto, Porto, Portugal.,Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York
| | - Frederico Mestre
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade de Évora/InBIO Laboratório Associado, Évora, Portugal
| | - Thomas A White
- Lancaster Environment Centre, Lancaster University, Lancaster, UK
| | - Joana Paupério
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto/InBIO Laboratório Associado, Vairão, Portugal
| | - Paulo C Alves
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto/InBIO Laboratório Associado, Vairão, Portugal.,Departamento de Biologia, Faculdade de Ciências da Universidade do Porto, Porto, Portugal
| | - Jeremy B Searle
- CIBIO-InBIO, Centro de Investigação em Biodiversidade e Recursos Genéticos, Universidade do Porto/InBIO Laboratório Associado, Vairão, Portugal.,Departamento de Biologia, Faculdade de Ciências da Universidade do Porto, Porto, Portugal.,Department of Ecology and Evolutionary Biology, Cornell University, Ithaca, New York
| |
Collapse
|
48
|
|
49
|
Pan X, Wang Y, Wong EHM, Telenti A, Venter JC, Jin L. Fine population structure analysis method for genomes of many. Sci Rep 2017; 7:12608. [PMID: 28974706 PMCID: PMC5626719 DOI: 10.1038/s41598-017-12319-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2017] [Accepted: 09/01/2017] [Indexed: 12/22/2022] Open
Abstract
Fine population structure can be examined through the clustering of individuals into subpopulations. The clustering of individuals in large sequence datasets into subpopulations makes the calculation of subpopulation specific allele frequency possible, which may shed light on selection of candidate variants for rare diseases. However, as the magnitude of the data increases, computational burden becomes a challenge in fine population structure analysis. To address this issue, we propose fine population structure analysis (FIPSA), which is an individual-based non-parametric method for dissecting fine population structure. FIPSA maximizes the likelihood ratio of the contingency table of the allele counts multiplied by the group. We demonstrated that its speed and accuracy were superior to existing non-parametric methods when the simulated sample size was up to 5,000 individuals. When applied to real data, the method showed high resolution on the Human Genome Diversity Project (HGDP) East Asian dataset. FIPSA was independently validated on 11,257 human genomes. The group assignment given by FIPSA was 99.1% similar to those assigned based on supervised learning. Thus, FIPSA provides high resolution and is compatible with a real dataset of more than ten thousand individuals.
Collapse
Affiliation(s)
- Xuedong Pan
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | - Yi Wang
- Ministry of Education Key Laboratory of Contemporary Anthropology, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China
| | | | | | | | - Li Jin
- State Key Laboratory of Genetic Engineering, Collaborative Innovation Center for Genetics and Development, School of Life Sciences, Fudan University, Shanghai, China.
| |
Collapse
|
50
|
Janes JK, Miller JM, Dupuis JR, Malenfant RM, Gorrell JC, Cullingham CI, Andrew RL. The
K
= 2 conundrum. Mol Ecol 2017; 26:3594-3602. [DOI: 10.1111/mec.14187] [Citation(s) in RCA: 355] [Impact Index Per Article: 50.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2016] [Revised: 03/22/2017] [Accepted: 05/03/2017] [Indexed: 12/29/2022]
Affiliation(s)
- Jasmine K. Janes
- School of Environmental and Rural Sciences The University of New England Armidale NSW Australia
- Biology Department Vancouver Island University Nanaimo BC Canada
| | - Joshua M. Miller
- Department of Ecology and Evolutionary Biology Yale University New Haven CT USA
| | - Julian R. Dupuis
- Department of Plant and Environmental Protection Sciences University of Hawai'i at Mãnoa Honolulu HI USA
| | - René M. Malenfant
- Department of Biology University of New Brunswick Fredericton NB Canada
| | - Jamieson C. Gorrell
- School of Environmental and Rural Sciences The University of New England Armidale NSW Australia
- Biology Department Vancouver Island University Nanaimo BC Canada
| | | | - Rose L. Andrew
- School of Environmental and Rural Sciences The University of New England Armidale NSW Australia
| |
Collapse
|