1
|
You H, Lee SD, Cho S. A machine learning approach for estimating Eastern Asian origins from massive screening of Y chromosomal short tandem repeats polymorphisms. Int J Legal Med 2025; 139:531-540. [PMID: 39775035 PMCID: PMC11850560 DOI: 10.1007/s00414-024-03406-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Accepted: 12/26/2024] [Indexed: 01/11/2025]
Abstract
Inferring the ancestral origin of DNA evidence recovered from crime scenes is crucial in forensic investigations, especially in the absence of a direct suspect match. Ancestry informative markers (AIMs) have been widely researched and commercially developed into panels targeting multiple continental regions. However, existing forensic ancestry inference panels typically group East Asian individuals into a homogenous category without further differentiation. In this study, we screened Y chromosomal short tandem repeat (Y-STR) haplotypes from 10,154 Asian individuals to explore their genetic structure and generate an ancestry inference tool through a machine learning (ML) approach. Our research identified distinct genetic separations between East Asians and their neighboring Southwest Asians, with tendencies of northern and southern differentiation observed within East Asian populations. All machine learning models developed in this study demonstrated high accuracy, with the Asian classification model achieving an optimal performance of 82.92% and the East Asian classification model reaching 84.98% accuracy. This work not only deepens the understanding of genetic substructures within Asian populations but also showcases the potential of ML in forensic ancestry inference using extensive Y-STR data. By employing computational methods to analyze intricate genetic datasets, we can enhance the resolution of ancestry in forensic contexts involving Asian populations.
Collapse
Affiliation(s)
- Haeun You
- Department of Forensic Medicine, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Soong Deok Lee
- Department of Forensic Medicine, Seoul National University College of Medicine, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
- Institute of Forensic and Anthropological Science, Seoul National University Medical Research Center, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
| | - Sohee Cho
- Institute of Forensic and Anthropological Science, Seoul National University Medical Research Center, 103 Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea.
| |
Collapse
|
2
|
Pedroza Matute S, Iyavoo S. Applications and Performance of Precision ID GlobalFiler NGS STR, Identity, and Ancestry Panels in Forensic Genetics. Genes (Basel) 2024; 15:1133. [PMID: 39336724 PMCID: PMC11431077 DOI: 10.3390/genes15091133] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/07/2024] [Revised: 08/19/2024] [Accepted: 08/23/2024] [Indexed: 09/30/2024] Open
Abstract
Short Tandem Repeat (STR) testing via capillary electrophoresis is undoubtedly the most popular forensic genetic testing method. However, its low multiplexing capabilities and limited performance with challenging samples are among the factors pushing scientists towards new technologies. Next-generation sequencing (NGS) methods overcome some of these limitations while also enabling the testing of Single-Nucleotide Polymorphisms (SNPs). Nonetheless, these methods are still under optimization, and their adoption into practice is limited. Among the available kits, Thermo Fisher Scientific (Waltham, MA, USA) produces three Precision ID Panels: GlobalFiler NGS STR, Identity, and Ancestry. A clear review of these kits, providing information useful for the promotion of their use, is, however, lacking. To close the gap, a literature review was performed to investigate the popularity, applications, and performance of these kits. Following the PRISMA guidelines, 89 publications produced since 2015 were identified. China was the most active country in the field, and the Identity Panel was the most researched. All kits appeared robust and useful for low-quality and low-quantity samples, while performance with mixtures varied. The need for more population data was highlighted, as well as further research surrounding variables affecting the quality of the sequencing results.
Collapse
Affiliation(s)
- Sharlize Pedroza Matute
- School of Natural Sciences, University of Lincoln, Brayford Pool, Lincoln LN6 7TS, UK
- AttoGroup Limited, Scottow Enterprise Park, Badersfield, Norwich NR10 5FB, UK
| | - Sasitaran Iyavoo
- School of Natural Sciences, University of Lincoln, Brayford Pool, Lincoln LN6 7TS, UK
- AttoGroup Limited, Scottow Enterprise Park, Badersfield, Norwich NR10 5FB, UK
| |
Collapse
|
3
|
Kasai F, Fukushima M, Miyagi Y, Nakamura Y. Genetic diversity among the present Japanese population: evidence from genotyping of human cell lines established in Japan. Hum Cell 2024; 37:944-950. [PMID: 38639832 PMCID: PMC11194210 DOI: 10.1007/s13577-024-01055-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/02/2023] [Accepted: 03/12/2024] [Indexed: 04/20/2024]
Abstract
Japan is often assumed to have a highly homogeneous ethnic population, because it is an island country. This is evident in human cell lines collected from cell banks; however, these genotypes have not been thoroughly characterized. To examine the population genotypes of human cell lines established in Japan, we conducted SNP genotyping on 57 noncancerous cell lines and 43 lung cancer cell lines. Analysis of biogeographic ancestry revealed that 58 cell lines had non-admixed Japanese genotypes, 21 cell lines had an admixture of Japanese and East Asian genotypes, and the remaining 21 cell lines had East Asian genotypes. The proportion of non-admixed Japanese genotypes was similar between lung cancer and noncancerous cell lines, suggesting that patients in Japan may not exclusively have Japanese genotypes. This could influence the incidence of inherited diseases and should be taken into account in personalized medicine tailored to genetic background. The genetic makeup of the present-day Japanese population cannot be fully explained by the ancestral Jomon and Yayoi lineages. Instead, it is necessary to consider a certain level of genetic admixture between Japanese and neighboring Asian populations. Our study revealed genetic variation among human cell lines derived from Japanese individuals, reflecting the diversity present within the Japanese population.
Collapse
Affiliation(s)
- Fumio Kasai
- Cell Engineering Division, BioResource Research Center, RIKEN Cell Bank, Tsukuba, Japan.
| | - Makoto Fukushima
- Cell Engineering Division, BioResource Research Center, RIKEN Cell Bank, Tsukuba, Japan
| | - Yohei Miyagi
- Molecular Pathology and Genetics Division, Kanagawa Cancer Center Research Institute, Yokohama, Japan
| | - Yukio Nakamura
- Cell Engineering Division, BioResource Research Center, RIKEN Cell Bank, Tsukuba, Japan
| |
Collapse
|
4
|
Wen Y, Liu J, Su Y, Chen X, Hou Y, Liao L, Wang Z. Forensic biogeographical ancestry inference: recent insights and current trends. Genes Genomics 2023; 45:1229-1238. [PMID: 37081293 DOI: 10.1007/s13258-023-01387-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/01/2023] [Indexed: 04/22/2023]
Abstract
BACKGROUND As a powerful complement to the paradigmatic DNA profiling strategy, biogeographical ancestry inference (BGAI) plays a significant part in human forensic investigation especially when a database hit or eyewitness testimony are not available. It indicates one's biogeographical profile based on known population-specific genetic variations, and thus is crucial for guiding authority investigations to find unknown individuals. Forensic biogeographical ancestry testing exploits much of the recent advances in the understanding of human genomic variation and improving of molecular biology. OBJECTIVE In this review, recent development of prospective ancestry informative markers (AIMs) and the statistical approaches of inferring biogeographic ancestry from AIMs are elucidated and discussed. METHODS We highlight the research progress of three potential AIMs (i.e., single nucleotide polymorphisms, microhaplotypes, and Y or mtDNA uniparental markers) and discuss the prospects and challenges of two methods that are commonly used in BGAI. CONCLUSION While BGAI for forensic purposes has been thriving in recent years, important challenges, such as ethics and responsibilities, data completeness, and ununified standards for evaluation, remain for the use of biogeographical ancestry information in human forensic investigations. To address these issues and fully realize the value of BGAI in forensic investigation, efforts should be made not only by labs/institutions around the world independently, but also by inter-lab/institution collaborations.
Collapse
Affiliation(s)
- Yufeng Wen
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
- School of Life Sciences, Jilin University, Changchun, 130012, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yonglin Su
- Department of Rehabilitation Medicine, West China Hospital Sichuan University, Chengdu, 610041, China
| | - Xiacan Chen
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yiping Hou
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Linchuan Liao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| | - Zheng Wang
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China.
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
5
|
Felkl AB, Avila E, Gastaldo AZ, Lindholz CG, Dorn M, Alho CS. Ancestry resolution of South Brazilians by forensic 165 ancestry-informative SNPs panel. Forensic Sci Int Genet 2023; 64:102838. [PMID: 36736201 DOI: 10.1016/j.fsigen.2023.102838] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2022] [Revised: 01/15/2023] [Accepted: 01/22/2023] [Indexed: 01/24/2023]
Abstract
Forensic DNA phenotyping (FDP) includes biogeographic ancestry (BGA) inference and externally visible characteristics (EVCs) prediction directly from an evidential DNA sample as alternatives to provide valuable intelligence when conventional DNA profiling fails to achieve identification. In this context, the application of Massively Parallel Sequencing (MPS) methodologies, which enables simultaneous typing of multiple samples and hundreds of forensic markers, has been gradually implemented in forensic genetic casework. The Precision ID Ancestry Panel (Thermo Fisher Scientific, Waltham, USA) is a forensic multiplex assay consisting of 165 autosomal SNPs designed to provide biogeographic ancestry information. In this work, a sample of 250 individuals from Rio Grande do Sul (RS) State, southern Brazil, apportioned into four main population groups (African-, European-, Amerindian-, and Admixed-derived Gauchos), was evaluated with this panel, to assess the feasibility of this approach in a highly heterogeneous population. Forensic descriptive parameters estimated for each population group revealed that this panel has enough polymorphic and informative SNPs to be used as a supplementary instrument in forensic individual identification and kinship testing regardless of ethnicity. No statistically significant deviation from Hardy-Weinberg equilibrium was observed after Bonferroni correction. However, seven loci pairs displayed linkage disequilibrium in pairwise LD testing (p < 3.70 × 10-6). Interpopulation comparisons by FST analysis, MDS plot, and STRUCTURE analysis among the four RS population groups apart and along with 89 reference worldwide populations demonstrated that Admixed- and African-derived Gauchos present the highest levels of admixture and population stratification, whereas European- and Amerindian-derived exhibit a more homogeneous genetic conformation.
Collapse
Affiliation(s)
- Aline Brugnera Felkl
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil.
| | - Eduardo Avila
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil; Technical Scientific Section, Federal Police Department in Rio Grande do Sul State, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil
| | - André Zoratto Gastaldo
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil
| | - Catieli Gobetti Lindholz
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Márcio Dorn
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil; Institute of Informatics, Federal University of Rio Grande do Sul, Porto Alegre, RS, Brazil
| | - Clarice Sampaio Alho
- Forensic Genetics Laboratory, School of Health and Life Sciences, Pontifical Catholic University of Rio Grande do Sul, Porto Alegre, RS, Brazil; National Institute of Science and Technology - Forensic Science, Porto Alegre, RS, Brazil
| |
Collapse
|
6
|
Carratto TMT, Moraes VMS, Recalde TSF, Oliveira MLGD, Teixeira Mendes-Junior C. Applications of massively parallel sequencing in forensic genetics. Genet Mol Biol 2022; 45:e20220077. [PMID: 36121926 PMCID: PMC9514793 DOI: 10.1590/1678-4685-gmb-2022-0077] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 07/15/2022] [Indexed: 11/22/2022] Open
Abstract
Massively parallel sequencing, also referred to as next-generation sequencing, has positively changed DNA analysis, allowing further advances in genetics. Its capability of dealing with low quantity/damaged samples makes it an interesting instrument for forensics. The main advantage of MPS is the possibility of analyzing simultaneously thousands of genetic markers, generating high-resolution data. Its detailed sequence information allowed the discovery of variations in core forensic short tandem repeat loci, as well as the identification of previous unknown polymorphisms. Furthermore, different types of markers can be sequenced in a single run, enabling the emergence of DIP-STRs, SNP-STR haplotypes, and microhaplotypes, which can be very useful in mixture deconvolution cases. In addition, the multiplex analysis of different single nucleotide polymorphisms can provide valuable information about identity, biogeographic ancestry, paternity, or phenotype. DNA methylation patterns, mitochondrial DNA, mRNA, and microRNA profiling can also be analyzed for different purposes, such as age inference, maternal lineage analysis, body-fluid identification, and monozygotic twin discrimination. MPS technology also empowers the study of metagenomics, which analyzes genetic material from a microbial community to obtain information about individual identification, post-mortem interval estimation, geolocation inference, and substrate analysis. This review aims to discuss the main applications of MPS in forensic genetics.
Collapse
Affiliation(s)
- Thássia Mayra Telles Carratto
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | - Vitor Matheus Soares Moraes
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | | | | | - Celso Teixeira Mendes-Junior
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| |
Collapse
|
7
|
Truelsen D, Tvedebrink T, Mogensen HS, Farzad MS, Shan MA, Morling N, Pereira V, Børsting C. Assessment of the effectiveness of the EUROFORGEN NAME and Precision ID Ancestry panel markers for ancestry investigations. Sci Rep 2021; 11:18595. [PMID: 34545122 PMCID: PMC8452675 DOI: 10.1038/s41598-021-97654-0] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 08/11/2021] [Indexed: 11/08/2022] Open
Abstract
The EUROFORGEN NAME panel is a regional ancestry panel designed to differentiate individuals from the Middle East, North Africa, and Europe. The first version of the panel was developed for the MassARRAY system and included 111 SNPs. Here, a custom AmpliSeq EUROFORGEN NAME panel with 102 of the original 111 loci was used to sequence 1098 individuals from 14 populations from Europe, the Middle East, North Africa, North-East Africa, and South-Central Asia. These samples were also sequenced with a global ancestry panel, the Precision ID Ancestry Panel. The GenoGeographer software was used to assign the AIM profiles to reference populations and calculate the weight of the evidence as likelihood ratios. The combination of the EUROFORGEN NAME and Precision ID Ancestry panels led to fewer ambiguous assignments, especially for individuals from the Middle East and South-Central Asia. The likelihood ratios showed that North African individuals could be separated from European and Middle Eastern individuals using the Precision ID Ancestry Panel. The separation improved with the addition of the EUROFORGEN NAME panel. The analyses also showed that the separation of Middle Eastern populations from European and South-Central Asian populations was challenging even when both panels were applied.
Collapse
Affiliation(s)
- D Truelsen
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark.
| | - T Tvedebrink
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
- Department of Mathematical Sciences, Aalborg University, 9220, Aalborg, Denmark
| | - H S Mogensen
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| | - M S Farzad
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| | - M A Shan
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| | - N Morling
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
- Department of Mathematical Sciences, Aalborg University, 9220, Aalborg, Denmark
| | - V Pereira
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| | - C Børsting
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, 2100, Copenhagen, Denmark
| |
Collapse
|
8
|
Pereira V, Santangelo R, Børsting C, Tvedebrink T, Almeida APF, Carvalho EF, Morling N, Gusmão L. Evaluation of the Precision of Ancestry Inferences in South American Admixed Populations. Front Genet 2020; 11:966. [PMID: 32973885 PMCID: PMC7472784 DOI: 10.3389/fgene.2020.00966] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2020] [Accepted: 07/31/2020] [Indexed: 11/13/2022] Open
Abstract
Ancestry informative markers (AIMs) are used in forensic genetics to infer biogeographical ancestry (BGA) of individuals and may also have a prominent role in future police and identification investigations. In the last few years, many studies have been published reporting new AIM sets. These sets include markers (usually around 100 or less) selected with different purposes and different population resolutions. Regardless of the ability of these sets to separate populations from different continents or regions, the uncertainty associated with the estimates provided by these panels and their capacity to accurately report the different ancestral contributions in individuals of admixed populations has rarely been investigated. This issue is addressed in this study by evaluating different AIM sets. Ancestry inference was carried out in admixed South American populations, both at population and individual levels. The results of ancestry inferences using AIM sets with different numbers of markers among admixed reference populations were compared. To evaluate the performance of the different ancestry panels at the individual level, expected and observed estimates among families and their offspring were compared, considering that (1) the apportionment of ancestry in the offspring should be closer to the average ancestry of the parents, and (2) full siblings should present similar ancestry values. The results obtained illustrate the importance of having a good balance/compromise between not only the number of markers and their ability to differentiate ancestral populations, but also a balanced differentiation among reference groups, to obtain more precise values of genetic ancestry. This work also highlights the importance of estimating errors associated with the use of a limited number of markers. We demonstrate that although these errors have a moderate effect at the population level, they may have an important impact at the individual level. Considering that many AIM-sets are being described for inferences at the individual level and not at the population level, e.g., in association studies or the determination of a suspect's BGA, the results of this work point to the need of a more careful evaluation of the uncertainty associated with the ancestry estimates in admixed populations, when small AIM-sets are used.
Collapse
Affiliation(s)
- Vania Pereira
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Roberta Santangelo
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Claus Børsting
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Torben Tvedebrink
- Department of Mathematical Sciences, Aalborg University, Aalborg, Denmark
| | - Ana Paula F Almeida
- DNA Diagnostic Laboratory, State University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Elizeu F Carvalho
- DNA Diagnostic Laboratory, State University of Rio de Janeiro, Rio de Janeiro, Brazil
| | - Niels Morling
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Leonor Gusmão
- DNA Diagnostic Laboratory, State University of Rio de Janeiro, Rio de Janeiro, Brazil.,Instituto de Investigação e Inovação em Saúde, i3S, Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
| |
Collapse
|
9
|
Wendt FR, Novroski NM. Identity informative SNP associations in the UK Biobank. Forensic Sci Int Genet 2019; 42:45-48. [PMID: 31226582 DOI: 10.1016/j.fsigen.2019.06.007] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2019] [Revised: 06/01/2019] [Accepted: 06/13/2019] [Indexed: 10/26/2022]
Abstract
Single nucleotide polymorphisms (SNPs) are amenable to genotyping DNA from degraded, inhibited, and/or ancient substrates due to their relatively small amplicon size. Though they have clear advantages over traditional short tandem repeat (STR) typing for specific casework scenarios, the advances in massively parallel sequencing (MPS) have drastically increased the utility of this marker type. The biallelic nature of SNPs makes them individually less informative than STRs due to limited heterozygosity; however, in sufficiently large multiplexes, identity informative SNPs (iiSNPs) may produce combined random match probabilities comparable to STR typing. Multiple MPS library preparation kits now include iiSNPs and similar to STRs, these loci have been rigorously characterized during multiplex development. The relative accessibility of genome-wide association study (GWAS) summary statistics enables re-investigation of forensically relevant targets in high-quality datasets. Here, 4085 GWASs from the UK Biobank European datasets (UKB; 787 ≤ N ≤ 361,194) were mined for iiSNPs typed by the ForenSeq DNA Signature Prep Kit (Verogen). Seven iiSNPs had genome-wide association (p ≤ 5 × 10-8) with 17 phenotypes in UKB Europeans. Most notably, these relationships involve two outwardly visible characteristics: standing height (rs907100; β = 0.011, p = 1.35 × 10-10) and hair/balding patterns (rs2399332; β = -0.009, p = 3.83 × 10-8). The remaining associations involve red blood cell characteristics and measures of lung function. Though these traits are highly polygenic and the individual SNP effects described here have been refuted empirically, we describe the importance and ease of exploring high-quality, freely accessible data to continuously and robustly characterize new and existing forensically relevant loci.
Collapse
Affiliation(s)
- Frank R Wendt
- Department of Psychiatry, Yale School of Medicine and VA CT Healthcare Center, West Haven, CT 06516, USA.
| | - Nicole Mm Novroski
- Forensic Science Program, Department of Anthropology, University of Toronto Mississauga, Mississauga, ON L5L 1C6, Canada
| |
Collapse
|