1
|
Ralf A, Zandstra D, van Wersch B, Köksal Z, Larmuseau MHD, Rosa A, Jobling MA, D'Amato ME, Courts C, Gysi M, Haas C, Flores R, Neis M, Wetton JH, Kiesler K, Ameur A, Azonbakin S, Bôžiková A, Choma A, De Ungria MC, Corradini B, Cruz C, Dunkelmann B, Ferri G, Fleckhaus J, Fragou D, Gaens N, Gonçalves R, Havaš Auguštin D, Helm K, Hölzl-Müller P, Kaliszan M, Kasu M, Kovatsi L, Lesaoana M, Mizuno N, Neuhuber F, Nováčková J, Ňuňuková A, Pamjav H, Parson W, Ramankulov Y, Rangel Villalobos H, Rębała K, Rootsi S, Salvador J, Šarac J, Steffen CR, Stenzl V, Török T, Villems R, Watahiki H, Zhabagin M, Schneider PM, Kayser M. UYSD: a novel data repository accessible via public website for worldwide population frequencies of Y-SNP haplogroups. Eur J Hum Genet 2025:10.1038/s41431-025-01854-5. [PMID: 40341774 DOI: 10.1038/s41431-025-01854-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2024] [Revised: 03/31/2025] [Accepted: 04/17/2025] [Indexed: 05/11/2025] Open
Abstract
For decades, there has been scientific interest in the variation and geographic distribution of paternal lineages associated with the human Y chromosome. However, the relevant data have been dispersed across numerous publications, making it difficult to consolidate. Additionally, understanding the relationships between different variants, and the tools used to analyze them, have evolved over time, further complicating efforts to harmonize this information. The Universal Y-SNP Database (UYSD) marks a substantial advancement by providing a comprehensive and accessible platform for Y-SNP and haplogroup data from populations around the world. UYSD harmonizes diverse datasets into a unified repository, facilitating the exploration of global Y-chromosomal variation. The platform handles data generated with both high- and low-throughput technology and is compatible with the automated analysis software tool, Yleaf v3. Key functionalities include the ability to: i) visualize haplogroup distributions on an interactive world map, ii) estimate haplogroup frequencies in geographic regions with sparse data through interpolation, and iii) display detailed phylogenetic trees of Y-chromosomal haplogroups. Currently, UYSD encompasses data from over 6,600 males across 27 populations. This dataset largely aligns with known global Y-haplogroup patterns, but also reveals unexplored finer-scale geographic variations. While the present dataset is largely European-centered, UYSD is designed for ongoing expansion by the scientific community, aiming to include more global data and higher-resolution population sequencing data. The platform thus offers valuable insights into human genetic diversity and migration patterns, serving several fields of research such as: human population genetics, genetic anthropology, ancient DNA analysis and forensic genetics.
Collapse
Affiliation(s)
- Arwin Ralf
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
- Department of Pathology and Clinical Bioinformatics, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Dion Zandstra
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Bram van Wersch
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Zehra Köksal
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Maarten H D Larmuseau
- Laboratory of Human Genetic Genealogy, Department Human Genetics, KU Leuven, Leuven, Belgium
| | | | | | | | - Cornelius Courts
- University Hospital of Cologne, Institute of Legal Medicine, Cologne, Germany
| | - Mario Gysi
- University of Zurich, Zurich, Switzerland
| | | | | | - Maximilian Neis
- University Hospital of Cologne, Institute of Legal Medicine, Cologne, Germany
| | - Jon H Wetton
- University of Leicester, Leicester, United Kingdom
| | - Kevin Kiesler
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | - Adam Ameur
- Department of Immunology, Genetics and Pathology, Uppsala University, Uppsala, Sweden
- SciLifeLab, Uppsala University, Uppsala, Sweden
| | | | | | - Andrej Choma
- Institute of Forensic Science, Bratislava, Slovak Republic
| | | | - Beatrice Corradini
- Institute of Legal Medicine, University of Modena and Reggio Emilia, Modena, Italy
| | | | | | - Gianmarco Ferri
- Institute of Legal Medicine, University of Modena and Reggio Emilia, Modena, Italy
| | - Jan Fleckhaus
- Ludwig Maximilian University, Institute of Legal Medicine, Munich, Germany
| | - Domniki Fragou
- Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Noah Gaens
- Laboratory of Human Genetic Genealogy, Department Human Genetics, KU Leuven, Leuven, Belgium
| | | | | | | | - Petra Hölzl-Müller
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
| | | | - Mohaimin Kasu
- University of the Western Cape, Western Cape, South Africa
| | - Leda Kovatsi
- Aristotle University of Thessaloniki, Thessaloniki, Greece
| | - Mpasi Lesaoana
- University of the Western Cape, Western Cape, South Africa
- Lesotho Mounted Police, Maseru, Lesotho
| | - Natsuko Mizuno
- National Research Institute of Police Science, Kashiwa, Japan
| | | | | | - Alena Ňuňuková
- Institute of Forensic Science, Bratislava, Slovak Republic
| | - Horolma Pamjav
- Hungarian Institute for Forensic Sciences, Institute of Forensic Genetics, Budapest, Hungary
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria
- Forensic Science Program, The Pennsylvania State University, University Park, PA, USA
| | | | | | | | | | | | - Jelena Šarac
- Institute for Anthropological Research, Zagreb, Croatia
| | - Carolyn R Steffen
- National Institute of Standards and Technology, Gaithersburg, MD, USA
| | | | | | | | | | - Maxat Zhabagin
- Nazarbayev University, Astana, Kazakhstan
- National Center for Biotechnology, Astana, Kazakhstan
| | - Peter M Schneider
- University Hospital of Cologne, Institute of Legal Medicine, Cologne, Germany
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
- Department of Pathology and Clinical Bioinformatics, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| |
Collapse
|
2
|
Terrado-Ortuño N, May P. Forensic DNA phenotyping: a review on SNP panels, genotyping techniques, and prediction models. Forensic Sci Res 2025; 10:owae013. [PMID: 39990695 PMCID: PMC11843099 DOI: 10.1093/fsr/owae013] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 10/16/2023] [Indexed: 02/25/2025] Open
Abstract
In the past few years, forensic DNA phenotyping has attracted a strong interest in the forensic research. Among the increasing publications, many have focused on testing the available panels to infer biogeographical ancestry on less represented populations and understanding the genetic mechanisms underlying externally visible characteristics. However, there are currently no publications that gather all the existing panels limited to forensic DNA phenotyping and discuss the main technical limitations of the technique. In this review, we performed a bibliographic search in Scopus database of phenotyping-related literature, which resulted in a total of 48, 43, and 15 panels for biogeographical ancestry, externally visible characteristics, and both traits inference, respectively. Here we provide a list of commercial and non-commercial panels and the limitations regarding the lack of harmonization in terms of terminology (i.e., categorization and measurement of traits) and reporting, the lack of genetic knowledge and environment influence to select markers and develop panels, and the debate surrounding the selection of genotyping technologies and prediction models and algorithms. In conclusion, this review aims to be an updated guide and to present an overview of the current related literature.
Collapse
Affiliation(s)
- Nuria Terrado-Ortuño
- Luxembourg Centre for Systems Biomedicine, Genome Analysis, Bioinformatics Core, Esch-sur-Alzette, Luxembourg
| | - Patrick May
- Luxembourg Centre for Systems Biomedicine, Genome Analysis, Bioinformatics Core, Esch-sur-Alzette, Luxembourg
| |
Collapse
|
3
|
Wang M, Duan S, Sun Q, Liu K, Liu Y, Wang Z, Li X, Wei L, Liu Y, Nie S, Zhou K, Ma Y, Yuan H, Liu B, Hu L, Liu C, He G. YHSeqY3000 panel captures all founding lineages in the Chinese paternal genomic diversity database. BMC Biol 2025; 23:18. [PMID: 39838386 PMCID: PMC11752814 DOI: 10.1186/s12915-025-02122-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2024] [Accepted: 01/07/2025] [Indexed: 01/23/2025] Open
Abstract
BACKGROUND The advancements in second-/third-generation sequencing technologies, alongside computational innovations, have significantly enhanced our understanding of the genomic structure of Y-chromosomes and their unique phylogenetic characteristics. These researches, despite the challenges posed by the lack of population-scale genomic databases, have the potential to revolutionize our approach to high-resolution, population-specific Y-chromosome panels and databases for anthropological and forensic applications. OBJECTIVES This study aimed to develop the highest-resolution Y-targeted sequencing panel, utilizing time-stamped, core phylogenetic informative mutations identified from high-coverage sequences in the YanHuang cohort. This panel is intended to provide a new tool for forensic complex pedigree search and paternal biogeographical ancestry inference, as well as explore the general patterns of the fine-scale paternal evolutionary history of ethnolinguistically diverse Chinese populations. RESULTS The sequencing performance of the East Asian-specific Y-chromosomal panel, including 2999-core SNP variants, was found to be robust and reliable. The YHSeqY3000 panel was designed to capture the genetic diversity of Chinese paternal lineages from 3500 years ago, identifying 408 terminal lineages in 2097 individuals across 41 genetically and geographically distinct populations. We identified a fine-scale paternal substructure that was correlating with ancient population migrations and expansions. New evidence was provided for extensive gene flow events between minority ethnic groups and Han Chinese people, based on the integrative Chinese Paternal Genomic Diversity Database. CONCLUSIONS This work successfully integrated Y-chromosome-related basic genomic science with forensic and anthropological translational applications, emphasizing the necessity of comprehensively characterizing Y-chromosome genomic diversity from genomically under-representative populations. This is particularly important in the second phase of our population-specific medical or anthropological genomic cohorts, where dense sampling strategies are employed.
Collapse
Affiliation(s)
- Mengge Wang
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China.
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China.
- Anti-Drug Technology Center of Guangdong Province, Guangzhou, 510230, China.
- Department of Oto-Rhino-Laryngology, West China Hospital of Sichuan University, Chengdu, 610000, China.
| | - Shuhan Duan
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, 637100, China
- Department of Oto-Rhino-Laryngology, West China Hospital of Sichuan University, Chengdu, 610000, China
| | - Qiuxia Sun
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, 400331, China
| | - Kaijun Liu
- School of International Tourism and Culture, Guizhou Normal University, Guiyang, 550025, China
- MoFang Human Genome Research Institute, Tianfu Software Park, Chengdu, 610042, Sichuan, China
| | - Yan Liu
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, 637100, China
| | - Zhiyong Wang
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Xiangping Li
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Lanhai Wei
- School of Ethnology and Anthropology, Inner Mongolia Normal University, Hohhot, 010028, Inner Mongolia, China
| | - Yunhui Liu
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, 400331, China
| | - Shengjie Nie
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Kun Zhou
- MoFang Human Genome Research Institute, Tianfu Software Park, Chengdu, 610042, Sichuan, China
| | - Yongxin Ma
- Department of Medical Genetics, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610041, Sichuan, China
| | - Huijun Yuan
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
| | - Bing Liu
- Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Lan Hu
- Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Chao Liu
- Anti-Drug Technology Center of Guangdong Province, Guangzhou, 510230, China.
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China.
| | - Guanglin He
- Institute of Rare Diseases, Frontiers Science Center for Disease-Related Molecular Network, West China Hospital, Sichuan University, Chengdu, 610000, Sichuan, China.
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China.
- Anti-Drug Technology Center of Guangdong Province, Guangzhou, 510230, China.
| |
Collapse
|
4
|
McNevin D, Watson J, Grisedale K, Dahal A, Goodwin C, Ward J. Comparison of commercial targeted amplicon sequencing assays for human remains identification casework. Int J Legal Med 2025; 139:49-60. [PMID: 39404865 PMCID: PMC11732929 DOI: 10.1007/s00414-024-03335-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2024] [Accepted: 09/13/2024] [Indexed: 01/15/2025]
Abstract
Targeted amplicon sequencing (TAS) facilitates the genotyping of forensically informative single nucleotide polymorphisms (SNPs) using massively parallel sequencing (MPS). For human remains identification, where any extracted DNA is likely to be degraded, TAS may succeed when short tandem repeat (STR) profiling using capillary electrophoresis fails. Further, as well as yielding identity information, SNPs can provide information about ancestry, phenotype, kinship and paternal lineage (Y chromosome haplotypes). Two TAS platforms were compared in this study: Ion AmpliSeq™ panels coupled with Ion Torrent sequencing on an Ion GeneStudio™ S5 Plus System, manufactured by Thermo Fisher Scientific, and the ForenSeq® Kintelligence Kit coupled with Illumina sequencing on the MiSeq FGx® Sequencing System, manufactured by QIAGEN. Four Ion AmpliSeq™ panels (Precision ID Identity, Precision ID Ancestry, DNA Phenotyping and HID Y-SNP) share 177 SNPs with the ForenSeq® Kintelligence Kit and all five were used to profile the DNA extracted from the petrous part of the temporal bone from six skeletonised cadavers. Of the 6 × 177 = 1,062 SNP genotype comparisons, 1,055 (99%) were concordant between the Ion AmpliSeq™ panels and Kintelligence Kit. Of the seven (< 1%) non-concordant SNPs, only three of them (0.3%) would have resulted in erroneous genotypes being reported as a result of allele dropout by either assay, using our optimised relative variant frequency windows for allele calling. We conclude that both the Ion AmpliSeq™ panels and the ForenSeq® Kintelligence Kit were suitable for TAS applied to the human remains in this study.
Collapse
Affiliation(s)
- Dennis McNevin
- National DNA Program for Unidentified and Missing Persons, Australian Federal Police, Majura, ACT, Australia.
- Centre for Forensic Science, School of Mathematical & Physical Sciences, Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia.
| | - Jessica Watson
- National DNA Program for Unidentified and Missing Persons, Australian Federal Police, Majura, ACT, Australia
- Centre for Forensic Science, School of Mathematical & Physical Sciences, Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia
| | - Kelly Grisedale
- National DNA Program for Unidentified and Missing Persons, Australian Federal Police, Majura, ACT, Australia
| | - Ayusha Dahal
- Centre for Forensic Science, School of Mathematical & Physical Sciences, Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia
| | | | - Jodie Ward
- National DNA Program for Unidentified and Missing Persons, Australian Federal Police, Majura, ACT, Australia
- Centre for Forensic Science, School of Mathematical & Physical Sciences, Faculty of Science, University of Technology Sydney, Ultimo, NSW, Australia
| |
Collapse
|
5
|
Liu L, Li S, Cui W, Fang Y, Mei S, Chen M, Xu H, Bai X, Zhu B. Ancestry analysis using a self-developed 56 AIM-InDel loci and machine learning methods. Forensic Sci Int 2024; 361:112065. [PMID: 38889603 DOI: 10.1016/j.forsciint.2024.112065] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Revised: 12/11/2023] [Accepted: 05/16/2024] [Indexed: 06/20/2024]
Abstract
Insertion/deletion (InDel) polymorphisms can be used as one of the ancestry-informative markers in ancestry analysis. In this study, a self-developed panel consisting of 56 ancestry-informative InDels was used to investigate the genetic structures and genetic relationships between Chinese Inner Mongolia Manchu group and 26 reference populations. The Inner Mongolia Manchu group was closely related in genetic background to East Asian populations, especially the Han Chinese in Beijing. Moreover, populations from northern and southern East Asia displayed obvious variations in ancestral components, suggesting the potential value of this panel in distinguishing the populations from northern and southern East Asia. Subsequently, four machine learning models were performed based on the 56 AIM-InDel loci to evaluate the performance of this panel in ancestry prediction. The random forest model presented better performance in ancestry prediction, with 91.87% and 99.73% accuracy for the five and three continental populations, respectively. The individuals of the Inner Mongolia Manchu group were assigned to the East Asian populations by the random forest model, and they exhibited closer genetic affinities with northern East Asian populations. Furthermore, the random forest model distinguished 87.18% of the Inner Mongolia Manchus from the East Asian populations, suggesting that the random forest model based on the 56 ancestry-informative InDels could be a potential tool for ancestry analysis.
Collapse
Affiliation(s)
- Liu Liu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Shuanglin Li
- Department of Anatomy and Histology, School of Basic Medical Sciences, Shenzhen University Medical School, Shenzhen University, 1066 Xueyuan Avenue, Shenzhen, Guangdong, China
| | - Wei Cui
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Yating Fang
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Shuyan Mei
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Man Chen
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Hui Xu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Xiaole Bai
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China
| | - Bofeng Zhu
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, 1838 Guangzhou Avenue North, Guangzhou, Guangdong, PR China; Key Laboratory of Shaanxi Province for Craniofacial Precision Medicine Research, College of Stomatology, Xi'anJiaotong University, 99 Yanxiang Road, Xi'an, Shaanxi, PR China.
| |
Collapse
|
6
|
Köksal Z, Børsting C, Bailliet G, Burgos G, Carvalho E, Casas-Vargas A, Castillo A, Gomes MB, Martínez B, Ossa H, Parolin ML, Quiroz A, Toscanini U, Usaquén W, Velázquez IF, Vullo C, Gusmão L, Pereira V. Application of Targeted Y-Chromosomal Capture Enrichment to Increase the Resolution of Native American Haplogroup Q. Hum Mutat 2024; 2024:3046495. [PMID: 40225924 PMCID: PMC11918922 DOI: 10.1155/2024/3046495] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Revised: 04/17/2024] [Accepted: 07/15/2024] [Indexed: 04/15/2025]
Abstract
Y-chromosomal haplogroups and the Y-SNPs defining them are relevant for the exploration of male lineages, inference of paternal ancestry, and reconstruction of migration pathways, to name a few. Currently, over 300,000 Y-SNPs have been reported, defining 20 main haplogroups. However, ascertainment bias in the investigations has led to some haplogroups being overlooked, which hinders a representative depiction of certain populations and their migration events. For migration pattern analyses of the first settlers of the Americas, the Native American main founding lineage Q-M3 needs to be further investigated to allow clear genetic differentiation of individuals of different ethnogeographic origins. To increase the resolution within this haplogroup, a total of 7.45 Mb of the Y chromosome of 59 admixed South Americans of haplogroup Q was targeted for sequencing using hybridization capture enrichment. Data were combined with 218 publicly available sequences of Central and South Americans of haplogroup Q. After rigorous data processing, variants not meeting the quality criteria were excluded and 4128 reliable Y-SNPs were reported. A total of 2224 Y-SNPs had previously unknown positions in the phylogenetic tree, and 1291 of these are novel. The phylogenetic relationships between the Y-SNPs were established using the software SNPtotree in order to report a redesigned phylogenetic tree containing 300 branches, defined by 3400 Y-SNPs. The new tree introduces 117 previously undescribed branches and is the most comprehensive phylogenetic tree of the Native American haplogroup Q lineages to date. The 214 sequences were assigned to 135 different low- to high-resolution branches, while in the previous phylogenetic tree, only 195 sequences could be sorted into 14 low-resolution branches with the same quality criteria. The improved genetic differentiation of subhaplogroup Q-M3 has a great potential to resolve migration patterns of Native Americans.
Collapse
Affiliation(s)
- Zehra Köksal
- Section of Forensic GeneticsDepartment of Forensic MedicineFaculty of Health and Medical SciencesUniversity of Copenhagen, Copenhagen, Denmark
| | - Claus Børsting
- Section of Forensic GeneticsDepartment of Forensic MedicineFaculty of Health and Medical SciencesUniversity of Copenhagen, Copenhagen, Denmark
| | - Graciela Bailliet
- Instituto Multidisciplinario de Biología CelularUniversidad Nacional de La PlataCCT-CONICET-La PlataCIC, La Plata, Argentina
| | - Germán Burgos
- One Health Global Research GroupFacultad de MedicinaUniversidad de Las Américas (UDLA), Quito, Ecuador
- Grupo de Medicina XenómicaUniversidad de Santiago de Compostela, Santiago de Compostela, Spain
| | - Elizeu Carvalho
- DNA Diagnostic Laboratory (LDD)State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Andrea Casas-Vargas
- Grupo de Genética de Poblaciones e IdentificaciónInstituto de GenéticaUniversidad Nacional de Colombia, Bogotá, Colombia
| | - Adriana Castillo
- Department of Basic SciencesUniversidad Industrial de Santander (UIS), Bucaramanga, Colombia
| | - Marilia Brito Gomes
- Department of Internal MedicineDiabetes UnitState University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Beatriz Martínez
- Instituto de Investigaciones InmunológicasUniversidad de Cartagena, Cartagena, Colombia
| | - Humberto Ossa
- Department of HematologyInstituto de PrevisiónLaboratório de Genética y Biología Molecular, Asunción, Paraguay
- Facultad de CienciasPontificia Universidad Javeriana, Bogotá, Colombia
| | - María Laura Parolin
- Instituto de Diversidad y Evolución Austral (IDEAus)Centro Nacional PatagónicoCONICET, Puerto Madryn, Argentina
| | | | - Ulises Toscanini
- Primer Centro Argentino de Inmunogenética (PRICAI)Fundación Favaloro, Buenos Aires, Argentina
| | - William Usaquén
- Grupo de Genética de Poblaciones e IdentificaciónInstituto de GenéticaUniversidad Nacional de Colombia, Bogotá, Colombia
| | - Irina F. Velázquez
- Instituto de Diversidad y Evolución Austral (IDEAus)Centro Nacional PatagónicoCONICET, Puerto Madryn, Argentina
| | - Carlos Vullo
- DNA Forensic LaboratoryEquipo Argentino de Antropología Forense (EAAF), Córdoba, Argentina
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD)State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Vania Pereira
- Section of Forensic GeneticsDepartment of Forensic MedicineFaculty of Health and Medical SciencesUniversity of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
7
|
Fan H, Xu Y, Zhao Y, Feng K, Hong L, Zhao Q, Lu X, Shi M, Li H, Wang L, Wen S. Development and validation of YARN: A novel SE-400 MPS kit for East Asian paternal lineage analysis. Forensic Sci Int Genet 2024; 71:103029. [PMID: 38518712 DOI: 10.1016/j.fsigen.2024.103029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 02/12/2024] [Accepted: 03/03/2024] [Indexed: 03/24/2024]
Abstract
Y-chromosomal short tandem repeat polymorphisms (Y-STRs) and Y-chromosomal single nucleotide polymorphisms (Y-SNPs) are valuable genetic markers used in paternal lineage identification and population genetics. Currently, there is a lack of an effective panel that integrates Y-STRs and Y-SNPs for studying paternal lineages, particularly in East Asian populations. Hence, we developed a novel Y-chromosomal targeted panel called YARN (Y-chromosome Ancestry and Region Network) based on multiplex PCR and a single-end 400 massive parallel sequencing (MPS) strategy, consisting of 44 patrilineage Y-STRs and 260 evolutionary Y-SNPs. A total of 386 reactions were validated for the effectiveness and applicability of YARN according to SWGDAM validation guidelines, including sensitivity (with a minimum input gDNA of 0.125 ng), mixture identification (ranging from 1:1-1:10), PCR inhibitor testing (using substances such as 50 μM hematin, 100 μM hemoglobin, 100 μM humic acid, and 2.5 mM indigo dye), species specificity (successfully distinguishing humans from other animals), repeatability study (achieved 100% accuracy), and concordance study (with 99.91% accuracy for 1121 Y-STR alleles). Furthermore, we conducted a pilot study using YARN in a cohort of 484 Han Chinese males from Huaiji County, Zhaoqing City, Guangdong, China (GDZQHJ cohort). In this cohort, we identified 52 different Y-haplogroups and 73 different surnames. We found weak to moderate correlations between the Y-haplogroups, Chinese surnames, and geographical locations of the GDZQHJ cohort (with λ values ranging from 0.050 to 0.340). However, when we combined two different categories into a new independent variable, we observed stronger correlations (with λ values ranging from 0.617 to 0.754). Overall, the YARN panel, which combines Y-STR and Y-SNP genetic markers, meets forensic DNA quality assurance guidelines and holds potential for East Asian geographical origin inference and paternal lineage analysis.
Collapse
Affiliation(s)
- Haoliang Fan
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; School of Forensic Medicine, Shanxi Medical University, Taiyuan 030001, China.
| | - Yiran Xu
- Institute of Archaeological Science, Fudan University, Shanghai 200433, China.
| | - Yutao Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Kai Feng
- Duanzhou Branch of Zhaoqing Public Security Bureau, Zhaoqing 526060, China.
| | - Liuxi Hong
- Sihui Public Security Bureau of Guangdong Province, Zhaoqing 526299, China.
| | - Qiancheng Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Xiaoyu Lu
- Deepreads Biotech Company Limited, Guangzhou 510663, China.
| | - Meisen Shi
- Criminal Justice College of China University of Political Science and Law, Beijing 100088, China.
| | - Haiyan Li
- Criminal Technology Center of Guangdong Provincial Public Security Department, Guangzhou 510050, China.
| | - Lingxiang Wang
- MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| | - Shaoqing Wen
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; Institute of Archaeological Science, Fudan University, Shanghai 200433, China; MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| |
Collapse
|
8
|
Guo F, Jing G, Lang Y, Liu Z, Yu S. Simultaneous sequencing of 102 Y-STRs on Ion Torrent ™ GeneStudio ™ S5 System. Forensic Sci Int Genet 2024; 71:103059. [PMID: 38749212 DOI: 10.1016/j.fsigen.2024.103059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 05/02/2024] [Accepted: 05/05/2024] [Indexed: 06/18/2024]
Abstract
The Precision ID NGS System from Thermo Fisher Scientific is a mainstream next-generation sequencing (NGS) platform used in forensic laboratories to detect almost all commonly used forensic markers, except for Y-chromosomal short tandem repeats (Y-STRs). This study aimed to: 1) develop a Y-STR panel compatible with the automatic workflow of the NGS system using Ion AmpliSeq Technology, 2) evaluate the panel performance following the SWGDAM guidelines, and 3) explore the possibility of using a combination workflow to detect autosomal STRs and Y-STRs (AY-STR NGS workflow). The GrandFiler Y-STR Panel was successfully designed using the 'separating' and 'merging' strategies, including 102 Y-STRs and Amelogenin with an average amplicon length of 133 bp. It is a mega Y-STR multiplex system in which up to 16 samples can be sequenced simultaneously on an Ion 530 ™ Chip. Developmental validation studies of the performance of the NGS platform, species specificity, reproducibility, concordance, sensitivity, degraded samples, case-type samples, and mixtures were conducted to unequivocally determine whether the GrandFiler Y-STR Panel is suitable for real scenarios. The newly developed Y-STR panel showed compelling run metrics and NGS performance, including 92.47% bases with ≥ Q20, 91.80% effective reads, 2106 × depth of coverage (DoC), and 97.09% inter-locus balance. Additionally, it showed high specificity for human males and 99.40% methodological and bioinformatical concordance, generated complete profiles at ≥ 0.1 ng input DNA, and recovered more genetic information from severely degraded and diverse case samples. Although the outcome when used on mixtures was not as expected, more genetic information was obtained compared to that from capillary electrophoresis (CE) methods. The AY-STR NGS workflow was established by combining the GrandFiler Y-STR Panel with the Precision ID GlobalFiler ™ NGS STR Panel v2 at a 2:1 concentration ratio. The combination workflow on NGS performance, reproducibility, concordance, and sensitivity was as stable as the single Y-STR NGS workflow, providing more options for forensic scientists when dealing with different case scenarios. Overall, the GrandFiler Y-STR Panel was confirmed as the first to effectively detect a large number of Y-STR markers on the Precision ID NGS System, which is compatible with 51 Y-STRs in commercial CE kits and 51 Y-STRs in commercial NGS kits and the STRBase. The panel is as robust, reliable, and sensitive as current CE/NGS kits, and is suitable for solving real cases, especially for severely degraded samples (degradation index > 10).
Collapse
Affiliation(s)
- Fei Guo
- Shenyang Medical College, Shenyang, Liaoning 110034, PR China; Key Laboratory of Human Ethnic Specificity and Phenomics of Critical Illness in Liaoning Province, Shenyang, Liaoning 110034, PR China; Key Laboratory of Phenomics in Shenyang City, Shenyang, Liaoning 110034, PR China.
| | - Guangxin Jing
- Heyuan Public Security Forensic Science Center, Heyuan, Guangdong 517000, PR China
| | - Yubo Lang
- School of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang, Liaoning 110854, PR China
| | - Ze Liu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Shaobo Yu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China.
| |
Collapse
|
9
|
Wang Z, Wang M, Hu L, He G, Nie S. Evolutionary profiles and complex admixture landscape in East Asia: New insights from modern and ancient Y chromosome variation perspectives. Heliyon 2024; 10:e30067. [PMID: 38756579 PMCID: PMC11096704 DOI: 10.1016/j.heliyon.2024.e30067] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2023] [Revised: 04/08/2024] [Accepted: 04/18/2024] [Indexed: 05/18/2024] Open
Abstract
Human Y-chromosomes are characterized by nonrecombination and uniparental inheritance, carrying traces of human history evolution and admixture. Large-scale population-specific genomic sources based on advanced sequencing technologies have revolutionized our understanding of human Y chromosome diversity and its anthropological and forensic applications. Here, we reviewed and meta-analyzed the Y chromosome genetic diversity of modern and ancient people from China and summarized the patterns of founding lineages of spatiotemporally different populations associated with their origin, expansion, and admixture. We emphasized the strong association between our identified founding lineages and language-related human dispersal events correlated with the Sino-Tibetan, Altaic, and southern Chinese multiple-language families related to the Hmong-Mien, Tai-Kadai, Austronesian, and Austro-Asiatic languages. We subsequently summarize the recent advances in translational applications in forensic and anthropological science, including paternal biogeographical ancestry inference (PBGAI), surname investigation, and paternal history reconstruction. Whole-Y sequencing or high-resolution panels with high coverage of terminal Y chromosome lineages are essential for capturing the genomic diversity of ethnolinguistically diverse East Asians. Generally, we emphasized the importance of including more ethnolinguistically diverse, underrepresented modern and spatiotemporally different ancient East Asians in human genetic research for a comprehensive understanding of the paternal genetic landscape of East Asians with a detailed time series and for the reconstruction of a reference database in the PBGAI, even including new technology innovations of Telomere-to-Telomere (T2T) for new genetic variation discovery.
Collapse
Affiliation(s)
- Zhiyong Wang
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610000, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
| | - Mengge Wang
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610000, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510275, China
| | - Liping Hu
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Guanglin He
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610000, China
- Center for Archaeological Science, Sichuan University, Chengdu, 610000, China
| | - Shengjie Nie
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| |
Collapse
|
10
|
Goli RC, Chishi KG, Ganguly I, Singh S, Dixit S, Rathi P, Diwakar V, Sree C C, Limbalkar OM, Sukhija N, Kanaka K. Global and Local Ancestry and its Importance: A Review. Curr Genomics 2024; 25:237-260. [PMID: 39156729 PMCID: PMC11327809 DOI: 10.2174/0113892029298909240426094055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2024] [Revised: 03/02/2024] [Accepted: 03/11/2024] [Indexed: 08/20/2024] Open
Abstract
The fastest way to significantly change the composition of a population is through admixture, an evolutionary mechanism. In animal breeding history, genetic admixture has provided both short-term and long-term advantages by utilizing the phenomenon of complementarity and heterosis in several traits and genetic diversity, respectively. The traditional method of admixture analysis by pedigree records has now been replaced greatly by genome-wide marker data that enables more precise estimations. Among these markers, SNPs have been the popular choice since they are cost-effective, not so laborious, and automation of genotyping is easy. Certain markers can suggest the possibility of a population's origin from a sample of DNA where the source individual is unknown or unwilling to disclose their lineage, which are called Ancestry-Informative Markers (AIMs). Revealing admixture level at the locus-specific level is termed as local ancestry and can be exploited to identify signs of recent selective response and can account for genetic drift. Considering the importance of genetic admixture and local ancestry, in this mini-review, both concepts are illustrated, encompassing basics, their estimation/identification methods, tools/software used and their applications.
Collapse
Affiliation(s)
| | - Kiyevi G. Chishi
- ICAR-National Dairy Research Institute, Karnal, 132001, Haryana, India
| | - Indrajit Ganguly
- ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Sanjeev Singh
- ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - S.P. Dixit
- ICAR-National Bureau of Animal Genetic Resources, Karnal, 132001, Haryana, India
| | - Pallavi Rathi
- ICAR-National Dairy Research Institute, Karnal, 132001, Haryana, India
| | - Vikas Diwakar
- ICAR-National Dairy Research Institute, Karnal, 132001, Haryana, India
| | - Chandana Sree C
- ICAR-National Dairy Research Institute, Karnal, 132001, Haryana, India
| | | | - Nidhi Sukhija
- ICAR-National Dairy Research Institute, Karnal, 132001, Haryana, India
- Central Tasar Research and Training Institute, Ranchi, 835303, Jharkhand, India
| | - K.K Kanaka
- ICAR- Indian Institute of Agricultural Biotechnology, Ranchi, 834010, Jharkhand, India
| |
Collapse
|
11
|
Fu D, Adnan A, Yao J, Aldayan NH, Wang CC, Hongyi C. Unraveling the paternal genetic structure and forensic traits of the Hui population in Liaoning Province, China using Y-chromosome analysis. BMC Genomics 2023; 24:691. [PMID: 37978341 PMCID: PMC10655310 DOI: 10.1186/s12864-023-09774-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/01/2023] [Accepted: 10/30/2023] [Indexed: 11/19/2023] Open
Abstract
The Hui people are the second-largest ethnic minority in China, and they are distributed throughout the country. A previous study explored the paternal genetic structure of the Hui population in nine different regions of China, but it overlooked the Liaoning province. In this study, we examined the paternal genetic makeup and forensic traits of the Hui population in Liaoning province by analyzing 157 Y-chromosome single nucleotide polymorphisms (Y-SNPs) and 26 short tandem repeats (Y-STRs). We successfully genotyped 282 unrelated male individuals from the Hui population of Liaoning province using the SNaPshot® single base extension assay and Goldeneye™ Y26 system kit (PEOPLESPOT R&D, Beijing, China). The results revealed high haplotypic diversity (0.9998) and identified 46 terminal haplogroups for the Hui population. Additional analyses, such as heat maps, principal component analysis (PCA), genetic distance (FST), Multidimensional scaling (MDS) analysis, and median-joining network (MJ) analysis, showed that the Hui population could be classified into three groups: Northwest Hui populations (NWH), including Liaoning, Xinjiang, Qinghai, Gansu, Ningxia, Shaanxi, and Henan; Hui populations from Sichuan and Shandong (SSH); and Yunnan Hui populations (YNH). Pairwise genetic distance (Rst) comparisons with other Chinese populations revealed that the Hui population displayed genetic affinity with the Han population. The comprehensive understanding of the Hui population in Liaoning province, explored by Y-SNPs and Y-STRs, can be utilized to interpret their genetic structure and enhance the accuracy of forensic databases.
Collapse
Affiliation(s)
- Dazhi Fu
- First Affiliated Hospital of China Medical University, 155 Heping District, Shenyang, 110001, China
| | - Atif Adnan
- Department of Forensic Sciences, Collage of Criminal Justice, Naif Arab University for Security Sciences, Riyadh, Saudi Arabia.
| | - Jun Yao
- Department of Forensic Biology and Genetics, School of Forensic Medicine, China Medical University, Shenyang, 110001, China
| | - Noura H Aldayan
- Department of Medical Laboratory Sciences, College of Applied Medical Sciences, Prince Sattam Ibn Abdulaziz University, Al-Kharj, 16273, Saudi Arabia
| | - Chuan-Chao Wang
- Department of Anthropology and Ethnology, Institute of Anthropology, School of Sociology and Anthropology, Xiamen University, Xiamen, Fujian, People's Republic of China.
| | - Cao Hongyi
- First Affiliated Hospital of China Medical University, 155 Heping District, Shenyang, 110001, China.
- Department of Pathology, School of Basic Medical Sciences, China Medical University, Shenyang, 110001, China.
| |
Collapse
|
12
|
Zhao GB, Miao L, Wang M, Yuan JH, Wei LH, Feng YS, Zhao J, Kang KL, Zhang C, Ji AQ, He G, Wang L. Developmental validation of a high-resolution panel genotyping 639 Y-chromosome SNP and InDel markers and its evolutionary features in Chinese populations. BMC Genomics 2023; 24:611. [PMID: 37828453 PMCID: PMC10568895 DOI: 10.1186/s12864-023-09709-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2023] [Accepted: 10/02/2023] [Indexed: 10/14/2023] Open
Abstract
Uniparental-inherited haploid genetic marker of Y-chromosome single nucleotide polymorphisms (Y-SNP) have the power to provide a deep understanding of the human evolutionary past, forensic pedigree, and bio-geographical ancestry information. Several international cross-continental or regional Y-panels instead of Y-whole sequencing have recently been developed to promote Y-tools in forensic practice. However, panels based on next-generation sequencing (NGS) explicitly developed for Chinese populations are insufficient to represent the Chinese Y-chromosome genetic diversity and complex population structures, especially for Chinese-predominant haplogroup O. We developed and validated a 639-plex panel including 633 Y-SNPs and 6 Y-Insertion/deletions, which covered 573 Y haplogroups on the Y-DNA haplogroup tree. In this panel, subgroups from haplogroup O accounted for 64.4% of total inferable haplogroups. We reported the sequencing metrics of 354 libraries sequenced with this panel, with the average sequencing depth among 226 individuals being 3,741×. We illuminated the high level of concordance, accuracy, reproducibility, and specificity of the 639-plex panel and found that 610 loci were genotyped with as little as 0.03 ng of genomic DNA in the sensitivity test. 94.05% of the 639 loci were detectable in male-female mixed DNA samples with a mix ratio of 1:500. Nearly all of the loci were genotyped correctly when no more than 25 ng/μL tannic acid, 20 ng/μL humic acid, or 37.5 μM hematin was added to the amplification mixture. More than 80% of genotypes were obtained from degraded DNA samples with a degradation index of 11.76. Individuals from the same pedigree shared identical genotypes in 11 male pedigrees. Finally, we presented the complex evolutionary history of 183 northern Chinese Hans and six other Chinese populations, and found multiple founding lineages that contributed to the northern Han Chinese gene pool. The 639-plex panel proved an efficient tool for Chinese paternal studies and forensic applications.
Collapse
Affiliation(s)
- Guang-Bin Zhao
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Lei Miao
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Mengge Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-sen University, Guangzhou, 510275, China
| | - Jia-Hui Yuan
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Lan-Hai Wei
- School of Ethnology and Anthropology, Inner Mongolia Normal University, Inner Mongolia, 010028, China
| | - Yao-Sen Feng
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Jie Zhao
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Ke-Lai Kang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Chi Zhang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - An-Quan Ji
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China.
| | - Guanglin He
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China.
| | - Le Wang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China.
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China.
| |
Collapse
|
13
|
Wen Y, Liu J, Su Y, Chen X, Hou Y, Liao L, Wang Z. Forensic biogeographical ancestry inference: recent insights and current trends. Genes Genomics 2023; 45:1229-1238. [PMID: 37081293 DOI: 10.1007/s13258-023-01387-5] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Accepted: 04/01/2023] [Indexed: 04/22/2023]
Abstract
BACKGROUND As a powerful complement to the paradigmatic DNA profiling strategy, biogeographical ancestry inference (BGAI) plays a significant part in human forensic investigation especially when a database hit or eyewitness testimony are not available. It indicates one's biogeographical profile based on known population-specific genetic variations, and thus is crucial for guiding authority investigations to find unknown individuals. Forensic biogeographical ancestry testing exploits much of the recent advances in the understanding of human genomic variation and improving of molecular biology. OBJECTIVE In this review, recent development of prospective ancestry informative markers (AIMs) and the statistical approaches of inferring biogeographic ancestry from AIMs are elucidated and discussed. METHODS We highlight the research progress of three potential AIMs (i.e., single nucleotide polymorphisms, microhaplotypes, and Y or mtDNA uniparental markers) and discuss the prospects and challenges of two methods that are commonly used in BGAI. CONCLUSION While BGAI for forensic purposes has been thriving in recent years, important challenges, such as ethics and responsibilities, data completeness, and ununified standards for evaluation, remain for the use of biogeographical ancestry information in human forensic investigations. To address these issues and fully realize the value of BGAI in forensic investigation, efforts should be made not only by labs/institutions around the world independently, but also by inter-lab/institution collaborations.
Collapse
Affiliation(s)
- Yufeng Wen
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
- School of Life Sciences, Jilin University, Changchun, 130012, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yonglin Su
- Department of Rehabilitation Medicine, West China Hospital Sichuan University, Chengdu, 610041, China
| | - Xiacan Chen
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yiping Hou
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Linchuan Liao
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| | - Zheng Wang
- Key Laboratory of Evidence Science (China University of Political Science and Law), Ministry of Education, Beijing, 100088, China.
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, 610041, China.
| |
Collapse
|
14
|
García-Olivares V, Muñoz-Barrera A, Rubio-Rodríguez LA, Jáspez D, Díaz-de Usera A, Iñigo-Campos A, Veeramah KR, Alonso S, Thomas MG, Lorenzo-Salazar JM, González-Montelongo R, Flores C. Benchmarking of human Y-chromosomal haplogroup classifiers with whole-genome and whole-exome sequence data. Comput Struct Biotechnol J 2023; 21:4613-4618. [PMID: 37817776 PMCID: PMC10560978 DOI: 10.1016/j.csbj.2023.09.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Revised: 09/12/2023] [Accepted: 09/12/2023] [Indexed: 10/12/2023] Open
Abstract
In anthropological, medical, and forensic studies, the nonrecombinant region of the human Y chromosome (NRY) enables accurate reconstruction of pedigree relationships and retrieval of ancestral information. Using high-throughput sequencing (HTS) data, we present a benchmarking analysis of command-line tools for NRY haplogroup classification. The evaluation was performed using paired Illumina data from whole-genome sequencing (WGS) and whole-exome sequencing (WES) experiments from 50 unrelated donors. Additionally, as a validation, we also used paired WGS/WES datasets of 54 individuals from the 1000 Genomes Project. Finally, we evaluated the tools on data from third-generation HTS obtained from a subset of donors and one reference sample. Our results show that WES, despite typically offering less genealogical resolution than WGS, is an effective method for determining the NRY haplogroup. Y-LineageTracker and Yleaf showed the highest accuracy for WGS data, classifying precisely 98% and 96% of the samples, respectively. Yleaf outperforms all benchmarked tools in the WES data, classifying approximately 90% of the samples. Yleaf, Y-LineageTracker, and pathPhynder can correctly classify most samples (88%) sequenced with third-generation HTS. As a result, Yleaf provides the best performance for applications that use WGS and WES. Overall, our study offers researchers with a guide that allows them to select the most appropriate tool to analyze the NRY region using both second- and third-generation HTS data.
Collapse
Affiliation(s)
- Víctor García-Olivares
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
- Plataforma Genómica de Alto Rendimiento para el Estudio de la Biodiversidad, Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas, San Cristóbal de La Laguna, Spain
| | - Adrián Muñoz-Barrera
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - Luis A. Rubio-Rodríguez
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - David Jáspez
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - Ana Díaz-de Usera
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - Antonio Iñigo-Campos
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - Krishna R. Veeramah
- Department of Ecology and Evolution, Stony Brook University, Stony Brook, NY 11794-5245, United States
| | - Santos Alonso
- Department of Genetics, Physical Anthropology and Animal Physiology, University of the Basque Country UPV/EHU, Leioa, Bizkaia, Spain
- María Goyri Building, Biotechnology Center, Human Molecular Evolution Lab 2.08 UPV/EHU Science Park, 48940 Leioa, Bizkaia, Spain
| | - Mark G. Thomas
- UCL Genetics Institute, University College London (UCL), Gower Street, London WC1E 6BT, United Kingdom
- Research Department of Genetics, Evolution & Environment, University College London (UCL), Darwin Building, Gower Street, London WC1E 6BT, United Kingdom
| | - José M. Lorenzo-Salazar
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
| | - Rafaela González-Montelongo
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
- Plataforma Genómica de Alto Rendimiento para el Estudio de la Biodiversidad, Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas, San Cristóbal de La Laguna, Spain
| | - Carlos Flores
- Genomics Division, Instituto Tecnológico y de Energías Renovables (ITER), Santa Cruz de Tenerife, Spain
- Plataforma Genómica de Alto Rendimiento para el Estudio de la Biodiversidad, Instituto de Productos Naturales y Agrobiología (IPNA), Consejo Superior de Investigaciones Científicas, San Cristóbal de La Laguna, Spain
- Research Unit, Hospital Universitario Nuestra Señora de Candelaria, Santa Cruz de Tenerife, Spain
- CIBER de Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Madrid, Spain
- Facultad de Ciencias de la Salud, Universidad Fernando de Pessoa Canarias, Las Palmas de Gran Canaria, Spain
| |
Collapse
|
15
|
Xavier C, Sutter C, Amory C, Niederstätter H, Parson W. NuMY-A qPCR Assay Simultaneously Targeting Human Autosomal, Y-Chromosomal, and Mitochondrial DNA. Genes (Basel) 2023; 14:1645. [PMID: 37628695 PMCID: PMC10454206 DOI: 10.3390/genes14081645] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2023] [Revised: 08/13/2023] [Accepted: 08/14/2023] [Indexed: 08/27/2023] Open
Abstract
The accurate quantification of DNA in forensic samples is of utmost importance. These samples are often present in limited amounts; therefore, it is indicated to use the appropriate analysis route with the optimum DNA amount (when possible). Also, DNA quantification can inform about the degradation stage and therefore support the decision on which downstream genotyping method to use. Consequently, DNA quantification aids in getting the best possible results from a forensic sample, considering both its DNA quantity and quality limitations. Here, we introduce NuMY, a new quantitative real-time PCR (qPCR) method for the parallel quantification of human nuclear (n) and mitochondrial (mt) DNA, assessing the male portion in mixtures of both sexes and testing for possible PCR inhibition. NuMY is based on previous work and follows the MIQE guidelines whenever applicable. Although quantification of nuclear (n)DNA by simultaneously analyzing autosomal and male-specific targets is available in commercial qPCR kits, tools that include the quantification of mtDNA are sparse. The quantification of mtDNA has proven relevant for samples with low nDNA content when conventional DNA fingerprinting techniques cannot be followed. Furthermore, the development and use of new massively parallel sequencing assays that combine multiple marker types, i.e., autosomal, Y-chromosomal, and mtDNA, can be optimized when precisely knowing the amount of each DNA component present in the input sample. For high-quality DNA extracts, NuMY provided nDNA results comparable to those of another quantification technique and has also proven to be a reliable tool for challenging, forensically relevant samples such as mixtures, inhibited, and naturally degraded samples.
Collapse
Affiliation(s)
- Catarina Xavier
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; (C.S.); (C.A.); (H.N.)
- i3S—Institute for Research and Innovation in Health, University of Porto, 4099-002 Porto, Portugal
| | - Charlotte Sutter
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; (C.S.); (C.A.); (H.N.)
| | - Christina Amory
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; (C.S.); (C.A.); (H.N.)
| | - Harald Niederstätter
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; (C.S.); (C.A.); (H.N.)
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; (C.S.); (C.A.); (H.N.)
- Forensic Science Program, The Pennsylvania State University, University Park, PA 16801, USA
| |
Collapse
|
16
|
Kayser M, Sajantila A, Butler JM, Parson W, Salas A, Gill P, Parsons T, Phillips C, Egeland T, Marshall C. Special issue: Forensic Genetics: Unde venisti et quo vadis? Forensic Sci Int Genet 2023:102881. [PMID: 37173159 DOI: 10.1016/j.fsigen.2023.102881] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2023] [Accepted: 04/26/2023] [Indexed: 05/15/2023]
Affiliation(s)
- Manfred Kayser
- Department of Genetic Identification, Erasmus MC, University Medical Center Rotterdam, the Netherland.
| | - Antti Sajantila
- Department of Forensic Medicine, University of Helsinki and Finnish Institute for Health and Welfare, Helsinki, Finland
| | - John M Butler
- National Institute of Standards and Technology, Special Programs Office, Gaithersburg, MD, USA
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, Austria and Forensic Science Program, The Pennsylvania State University, University Park, PA, USA
| | - Antonio Salas
- Forensic Genetics Unit, Institute of Forensic Sciences and GenPoB Research Group, Institute of Health Research, Clinic Hospital, University of Santiago de Compostela, Spain
| | - Peter Gill
- Forensic Genetics Research Group, Oslo University Hospital and Department of Clinical Medicine, University of Oslo, Norway
| | - Thomas Parsons
- International Commission on Missing Persons, Den Haag, the Netherlands
| | - Chris Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - Thore Egeland
- Faculty of Chemistry, Biotechnology and Food Sciences, Norwegian University of Life Sciences, Aas, Norway
| | - Charla Marshall
- Emerging Technologies Section, Armed Forces Medical Examiner System's Armed Forces DNA Identification Laboratory (AFMES-AFDIL), Dover Air Force Base, DE, USA
| |
Collapse
|
17
|
Elwick K, Rydzak P, Robertson JM. Evaluation of Library Preparation Workflows and Applications to Different Sample Types Using the PowerSeq ® 46GY System with Massively Parallel Sequencing. Genes (Basel) 2023; 14:genes14050977. [PMID: 37239337 DOI: 10.3390/genes14050977] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 04/21/2023] [Accepted: 04/24/2023] [Indexed: 05/28/2023] Open
Abstract
This project evaluated the prototype PowerSeq® 46GY System using donor DNA and casework-type samples. The goal of this study was to determine whether modifications to the manufacturer's protocol could increase read coverage and improve sample results. Buccal and casework-type libraries were prepared using the TruSeq® DNA PCR-Free HT kit or the KAPA HyperPrep kit. Both kits were evaluated unmodified, and by substituting AMPure® XP beads for the beads of the most optimal kit. Two qPCR kits, the PowerSeq® Quant MS System and KAPA Library Quantification Kit, were also evaluated along with a KAPA size-adjustment workbook, which was compared as a third quantification method. Libraries were sequenced using the MiSeq® FGx and data were analyzed with STRait Razor. Results suggested that all three quantification methods overestimated library concentration, but the PowerSeq kit was most accurate. Samples prepared with the TruSeq library kit provided the highest coverage and the fewest instances of dropout and below-threshold alleles compared with the KAPA kit. Additionally, all bone and hair samples demonstrated full profile completeness, with bone samples yielding a higher average coverage than hair samples. Overall, our study demonstrated that the 46GY manufacturer's protocol produced the best quality results compared to alternative library preparation options.
Collapse
Affiliation(s)
- Kyleen Elwick
- Visiting Scientist Program, Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, 2501 Investigation Parkway, Quantico, VA 22135, USA
| | - Patrick Rydzak
- Visiting Scientist Program, Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, 2501 Investigation Parkway, Quantico, VA 22135, USA
| | - James M Robertson
- Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, Quantico, VA 22135, USA
| |
Collapse
|
18
|
Kayser M, Branicki W, Parson W, Phillips C. Recent advances in Forensic DNA Phenotyping of appearance, ancestry and age. Forensic Sci Int Genet 2023; 65:102870. [PMID: 37084623 DOI: 10.1016/j.fsigen.2023.102870] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2023] [Accepted: 04/04/2023] [Indexed: 04/09/2023]
Abstract
Forensic DNA Phenotyping (FDP) comprises the prediction of a person's externally visible characteristics regarding appearance, biogeographic ancestry and age from DNA of crime scene samples, to provide investigative leads to help find unknown perpetrators that cannot be identified with forensic STR-profiling. In recent years, FDP has advanced considerably in all of its three components, which we summarize in this review article. Appearance prediction from DNA has broadened beyond eye, hair and skin color to additionally comprise other traits such as eyebrow color, freckles, hair structure, hair loss in men, and tall stature. Biogeographic ancestry inference from DNA has progressed from continental ancestry to sub-continental ancestry detection and the resolving of co-ancestry patterns in genetically admixed individuals. Age estimation from DNA has widened beyond blood to more somatic tissues such as saliva and bones as well as new markers and tools for semen. Technological progress has allowed forensically suitable DNA technology with largely increased multiplex capacity for the simultaneous analysis of hundreds of DNA predictors with targeted massively parallel sequencing (MPS). Forensically validated MPS-based FDP tools for predicting from crime scene DNA i) several appearance traits, ii) multi-regional ancestry, iii) several appearance traits together with multi-regional ancestry, and iv) age from different tissue types, are already available. Despite recent advances that will likely increase the impact of FDP in criminal casework in the near future, moving reliable appearance, ancestry and age prediction from crime scene DNA to the level of detail and accuracy police investigators may desire, requires further intensified scientific research together with technical developments and forensic validations as well as the necessary funding.
Collapse
Affiliation(s)
- Manfred Kayser
- Department of Genetic Identification, Erasmus MC, University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Wojciech Branicki
- Institute of Zoology and Biomedical Research, Jagiellonian University, Kraków, Poland,; Institute of Forensic Research, Kraków, Poland
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, Innsbruck, Austria; Forensic Science Program, The Pennsylvania State University, PA, USA
| | - Christopher Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| |
Collapse
|
19
|
He G, Wang M, Miao L, Chen J, Zhao J, Sun Q, Duan S, Wang Z, Xu X, Sun Y, Liu Y, Liu J, Wang Z, Wei L, Liu C, Ye J, Wang L. Multiple founding paternal lineages inferred from the newly-developed 639-plex Y-SNP panel suggested the complex admixture and migration history of Chinese people. Hum Genomics 2023; 17:29. [PMID: 36973821 PMCID: PMC10045532 DOI: 10.1186/s40246-023-00476-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2023] [Accepted: 03/15/2023] [Indexed: 03/29/2023] Open
Abstract
BACKGROUND Non-recombining regions of the Y-chromosome recorded the evolutionary traces of male human populations and are inherited haplotype-dependently and male-specifically. Recent whole Y-chromosome sequencing studies have identified previously unrecognized population divergence, expansion and admixture processes, which promotes a better understanding and application of the observed patterns of Y-chromosome genetic diversity. RESULTS Here, we developed one highest-resolution Y-chromosome single nucleotide polymorphism (Y-SNP) panel targeted for uniparental genealogy reconstruction and paternal biogeographical ancestry inference, which included 639 phylogenetically informative SNPs. We genotyped these loci in 1033 Chinese male individuals from 33 ethnolinguistically diverse populations and identified 256 terminal Y-chromosomal lineages with frequency ranging from 0.0010 (singleton) to 0.0687. We identified six dominant common founding lineages associated with different ethnolinguistic backgrounds, which included O2a2b1a1a1a1a1a1a1-M6539, O2a1b1a1a1a1a1a1-F17, O2a2b1a1a1a1a1b1a1b-MF15397, O2a2b2a1b1-A16609, O1b1a1a1a1b2a1a1-F2517, and O2a2b1a1a1a1a1a1-F155. The AMOVA and nucleotide diversity estimates revealed considerable differences and high genetic diversity among ethnolinguistically different populations. We constructed one representative phylogenetic tree among 33 studied populations based on the haplogroup frequency spectrum and sequence variations. Clustering patterns in principal component analysis and multidimensional scaling results showed a genetic differentiation between Tai-Kadai-speaking Li, Mongolic-speaking Mongolian, and other Sinitic-speaking Han Chinese populations. Phylogenetic topology inferred from the BEAST and Network relationships reconstructed from the popART further showed the founding lineages from culturally/linguistically diverse populations, such as C2a/C2b was dominant in Mongolian people and O1a/O1b was dominant in island Li people. We also identified many lineages shared by more than two ethnolinguistically different populations with a high proportion, suggesting their extensive admixture and migration history. CONCLUSIONS Our findings indicated that our developed high-resolution Y-SNP panel included major dominant Y-lineages of Chinese populations from different ethnic groups and geographical regions, which can be used as the primary and powerful tool for forensic practice. We should emphasize the necessity and importance of whole sequencing of more ethnolinguistically different populations, which can help identify more unrecognized population-specific variations for the promotion of Y-chromosome-based forensic applications.
Collapse
Affiliation(s)
- Guanglin He
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China.
| | - Mengge Wang
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, 510275, China
| | - Lei Miao
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Jing Chen
- School of Forensic Medicine, Shanxi Medical University, Jinzhong, 030001, China
| | - Jie Zhao
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China
| | - Qiuxia Sun
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
- Department of Forensic Medicine, College of Basic Medicine, Chongqing Medical University, Chongqing, 400331, China
| | - Shuhan Duan
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, 637000, China
| | - Zhiyong Wang
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
- School of Forensic Medicine, Kunming Medical University, Kunming, 650500, China
| | - Xiaofei Xu
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
| | - Yuntao Sun
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
- Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Yan Liu
- Institute of Rare Diseases, West China Hospital of Sichuan University, Sichuan University, Chengdu, 610041, China
- School of Basic Medical Sciences, North Sichuan Medical College, Nanchong, 637000, China
| | - Jing Liu
- Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Zheng Wang
- Institute of Forensic Medicine, West China School of Basic Science and Forensic Medicine, Sichuan University, Chengdu, 610041, China
| | - Lanhai Wei
- School of Ethnology and Anthropology, Inner Mongolia Normal University, Hohhot, 010028, Inner Mongolia, China
| | - Chao Liu
- Faculty of Forensic Medicine, Zhongshan School of Medicine, Sun Yat-Sen University, Guangzhou, 510275, China
- Guangzhou Key Laboratory of Forensic Multi-Omics for Precision Identification, School of Forensic Medicine, Southern Medical University, Guangzhou, 510515, China
| | - Jian Ye
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China.
| | - Le Wang
- National Engineering Laboratory for Forensic Science, Key Laboratory of Forensic Genetics of Ministry of Public Security, Institute of Forensic Science, Ministry of Public Security, Beijing, 100038, China.
| |
Collapse
|
20
|
Ruiz-Ramírez J, de la Puente M, Xavier C, Ambroa-Conde A, Álvarez-Dios J, Freire-Aradas A, Mosquera-Miguel A, Ralf A, Amory C, Katsara MA, Khellaf T, Nothnagel M, Cheung EYY, Gross TE, Schneider PM, Uacyisrael J, Oliveira S, Klautau-Guimarães MDN, Carvalho-Gontijo C, Pośpiech E, Branicki W, Parson W, Kayser M, Carracedo A, Lareu MV, Phillips C. Development and evaluations of the ancestry informative markers of the VISAGE Enhanced Tool for Appearance and Ancestry. Forensic Sci Int Genet 2023; 64:102853. [PMID: 36917866 DOI: 10.1016/j.fsigen.2023.102853] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2022] [Revised: 02/15/2023] [Accepted: 03/02/2023] [Indexed: 03/07/2023]
Abstract
The VISAGE Enhanced Tool for Appearance and Ancestry (ET) has been designed to combine markers for the prediction of bio-geographical ancestry plus a range of externally visible characteristics into a single massively parallel sequencing (MPS) assay. We describe the development of the ancestry panel markers used in ET, and the enhanced analyses they provide compared to previous MPS-based forensic ancestry assays. As well as established autosomal single nucleotide polymorphisms (SNPs) that differentiate sub-Saharan African, European, East Asian, South Asian, Native American, and Oceanian populations, ET includes autosomal SNPs able to efficiently differentiate populations from Middle East regions. The ability of the ET autosomal ancestry SNPs to distinguish Middle East populations from other continentally defined population groups is such that characteristic patterns for this region can be discerned in genetic cluster analysis using STRUCTURE. Joint cluster membership estimates showing individual co-ancestry that signals North African or East African origins were detected, or cluster patterns were seen that indicate origins from central and Eastern regions of the Middle East. In addition to an augmented panel of autosomal SNPs, ET includes panels of 85 Y-SNPs, 16 X-SNPs and 21 autosomal Microhaplotypes. The Y- and X-SNPs provide a distinct method for obtaining extra detail about co-ancestry patterns identified in males with admixed backgrounds. This study used the 1000 Genomes admixed African and admixed American sample sets to fully explore these enhancements to the analysis of individual co-ancestry. Samples from urban and rural Brazil with contrasting distributions of African, European, and Native American co-ancestry were also studied to gauge the efficiency of combining Y- and X-SNP data for this purpose. The small panel of Microhaplotypes incorporated in ET were selected because they showed the highest levels of haplotype diversity amongst the seven population groups we sought to differentiate. Microhaplotype data was not formally combined with single-site SNP genotypes to analyse ancestry. However, the haplotype sequence reads obtained with ET from these loci creates an effective system for de-convoluting two-contributor mixed DNA. We made simple mixture experiments to demonstrate that when the contributors have different ancestries and the mixture ratios are imbalanced (i.e., not 1:1 mixtures) the ET Microhaplotype panel is an informative system to infer ancestry when this differs between the contributors.
Collapse
Affiliation(s)
- J Ruiz-Ramírez
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - M de la Puente
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain.
| | - C Xavier
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - A Ambroa-Conde
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - J Álvarez-Dios
- Faculty of Mathematics, University of Santiago de Compostela, 15705 Santiago de Compostela, Spain
| | - A Freire-Aradas
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - A Mosquera-Miguel
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - A Ralf
- Department of Genetic Identification, Erasmus MC, University Medical Center Rotterdam, 3015 CN Rotterdam, South Holland, the Netherlands
| | - C Amory
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria
| | - M A Katsara
- Cologne Center for Genomics, University of Cologne, 50823 Cologne, Germany
| | - T Khellaf
- Cologne Center for Genomics, University of Cologne, 50823 Cologne, Germany
| | - M Nothnagel
- Cologne Center for Genomics, University of Cologne, 50823 Cologne, Germany; University Hospital Cologne, 50937 Cologne, Germany
| | - E Y Y Cheung
- Institute of Legal Medicine, Faculty of Medicine and University Clinic, University of Cologne, 50823 Cologne, Germany
| | - T E Gross
- Institute of Legal Medicine, Faculty of Medicine and University Clinic, University of Cologne, 50823 Cologne, Germany
| | - P M Schneider
- Institute of Legal Medicine, Faculty of Medicine and University Clinic, University of Cologne, 50823 Cologne, Germany
| | - J Uacyisrael
- Fiji Police Forensic Biology and DNA Laboratory, Nasova, Suva, Fiji
| | - S Oliveira
- Departamento Genética e Morfologia, Universidade de Brasília, Brasília, DF, Brazil
| | | | - C Carvalho-Gontijo
- Departamento Genética e Morfologia, Universidade de Brasília, Brasília, DF, Brazil
| | - E Pośpiech
- Malopolska Centre of Biotechnology, Jagiellonian University, 30-387 Kraków, Poland
| | - W Branicki
- Institute of Zoology and Biomedical Research, Jagiellonian University, 30-387 Kraków, Poland
| | - W Parson
- Institute of Legal Medicine, Medical University of Innsbruck, 6020 Innsbruck, Austria; Forensic Science Program, The Pennsylvania State University, University Park, State College, PA 16802, USA
| | - M Kayser
- Department of Genetic Identification, Erasmus MC, University Medical Center Rotterdam, 3015 CN Rotterdam, South Holland, the Netherlands
| | - A Carracedo
- Fundación Pública Galega de Medicina Xenómica (FPGMX), Instituto de Investigación Sanitaria (IDIS),15706 Santiago de Compostela, Spain; Genomics Group, CIBERER, CIMUS, University of Santiago de Compostela, Spain
| | - M V Lareu
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain
| | - C Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, 15782 Santiago de Compostela, Spain.
| |
Collapse
|
21
|
Resutik P, Aeschbacher S, Krützen M, Kratzer A, Haas C, Phillips C, Arora N. Comparative evaluation of the MAPlex, Precision ID Ancestry Panel, and VISAGE Basic Tool for biogeographical ancestry inference. Forensic Sci Int Genet 2023; 64:102850. [PMID: 36924679 DOI: 10.1016/j.fsigen.2023.102850] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2022] [Revised: 02/21/2023] [Accepted: 02/23/2023] [Indexed: 02/27/2023]
Abstract
Biogeographical ancestry (BGA) inference from ancestry-informative markers (AIMs) has strong potential to support forensic investigations. Over the past two decades, several forensic panels composed of AIMs have been developed to predict ancestry at a continental scale. These panels typically comprise fewer than 200 AIMs and have been designed and tested with a limited set of populations. How well these panels recover patterns of genetic diversity relative to larger sets of markers, and how accurately they infer ancestry of individuals and populations not included in their design remains poorly understood. The lack of comparative studies addressing these aspects makes the selection of appropriate panels for forensic laboratories difficult. In this study, the model-based genetic clustering tool STRUCTURE was used to compare three popular forensic BGA panels: MAPlex, Precision ID Ancestry Panel (PIDAP), and VISAGE Basic Tool (VISAGE BT) relative to a genome-wide reference set of 10k SNPs. The genotypes for all these markers were obtained for a comprehensive set of 3957 individuals from 228 worldwide human populations. Our results indicate that at the broad continental scale (K=6) typically examined in forensic studies, all forensic panels produced similar genetic structure patterns compared to the reference set (G'≈90%) and had high classification performance across all regions (average AUC-PR > 97%). However, at K= 7 and K= 8, the forensic panels displayed some region-specific clustering deviations from the reference set, particularly in Europe and the region of East and South-East Asia, which may be attributed to differences in the design of the respective panels. Overall, the panel with the most consistent performance in all regions was VISAGE BT with an average weighted AUC̅W score of 96.26% across the three scales of geographical resolution investigated.
Collapse
Affiliation(s)
- Peter Resutik
- Zurich Institute of Forensic Medicine, University of Zurich, Switzerland.
| | - Simon Aeschbacher
- Department of Evolutionary Biology and Environmental Studies, University of Zurich, Switzerland
| | - Michael Krützen
- Department of Evolutionary Anthropology, University of Zurich, Switzerland
| | - Adelgunde Kratzer
- Zurich Institute of Forensic Medicine, University of Zurich, Switzerland
| | - Cordula Haas
- Zurich Institute of Forensic Medicine, University of Zurich, Switzerland
| | - Christopher Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| | - Natasha Arora
- Zurich Institute of Forensic Medicine, University of Zurich, Switzerland.
| |
Collapse
|
22
|
Tao R, Li M, Chai S, Xia R, Qu Y, Yuan C, Yang G, Dong X, Bian Y, Zhang S, Li C. Developmental validation of a 381 Y-chromosome SNP panel for haplogroup analysis in the Chinese populations. Forensic Sci Int Genet 2023; 62:102803. [PMID: 36368220 DOI: 10.1016/j.fsigen.2022.102803] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Revised: 09/19/2022] [Accepted: 10/26/2022] [Indexed: 01/15/2023]
Abstract
Y-chromosome single nucleotide polymorphism (Y-SNP) shows great variation in geographical distribution and population heterogeneity and can be used to map population genetics around the world. Massive parallel sequencing (MPS) methodology enables high-resolution Y-SNP haplogrouping for a certain male and is widely used in forensic genetics and evolutionary studies. In this present study, we used MPS to develop a customized 381 Y-SNP panel (SifaMPS 381 Y-SNP panel) to investigate the basic structure and subbranches of the haplogroup tree of the Chinese populations. The SifaMPS 381 Y-SNP panel covers all the Y-SNPs from our previously designed 183 Y-SNP panel and additional SNPs under the predominant haplogroups in the Chinese populations based on certain criteria. We also evaluated the sequencing matrix, concordance, sensitivity, repeatability of this panel and the ability to analyze mixed and case-type samples based on the Illumina MiSeq System. The results demonstrated that the novel MPS Y-SNP panel possessed good sequencing performance and generated accurate Y-SNP genotyping results. Although the recommended DNA input was greater than 1.25 ng, we observed that a lower DNA amount could still be used to analyze haplogroups correctly. In addition, this panel could handle mixed samples and common case-type samples and had higher resolution among Chinese Han males than previously reported. In conclusion, the SifaMPS 381 Y-SNP panel showed an overall good performance and offers a better choice for Y-SNP haplogrouping of the Chinese population, thereby facilitating paternal lineage classification, familial searching and other forensic applications.
Collapse
Affiliation(s)
- Ruiyang Tao
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China
| | - Min Li
- School of Basic Medical Sciences, Shandong First Medical University & Shandong Academy of Medical Sciences, Jinan 250117, Shandong, China
| | - Siyu Chai
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China; Department of Forensic Medicine, Zunyi Medical University, Zunyi 563099, Guizhou, China
| | - Ruocheng Xia
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China
| | - Yiling Qu
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China; Department of Forensic Science, Medical School of Soochow University, Suzhou 215123, China
| | - Chunyan Yuan
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China; Department of Forensic Medicine, Inner Mongolia Medical University, Hohhot 010110, China
| | - Guangyuan Yang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China; Department of Forensic Medicine, Inner Mongolia Medical University, Hohhot 010110, China
| | - Xinyu Dong
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China; School of Forensic Medicine, Shanxi Medical University, Jinzhong 030619, Shanxi, China
| | - Yingnan Bian
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China
| | - Suhua Zhang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China.
| | - Chengtao Li
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Science, Ministry of Justice, P.R. China, Shanghai 200063, China.
| |
Collapse
|
23
|
Multiplex Snapshot minisequencing for the detection of common PAH gene mutations in Iranian patients with Phenylketonuria. IRANIAN BIOMEDICAL JOURNAL 2023; 27:46-57. [PMID: 36624928 PMCID: PMC9971712 DOI: 10.52547/ibj.3856] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/11/2023]
Abstract
Background Phenylketonuria is a common inborn defect of amino acid metabolism in the world. This failure is caused by an autosomal recessive insufficiency of the hepatic enzyme hyperphenylalaninemia (PAH), which catalyzes the irreversible hydroxylation of phenylalanine to tyrosine. More than 1,040 different disease-causing mutations have already been identified in the PAH gene. The most prominent complication of Phenylketonuria, if not diagnosed and treated, is severe mental retardation. Hence, early diagnosis and initiation of nutritional therapy are the most significant measures in preventing this mental disorder. Given these data, we developed a simple and rapid molecular test to detect the most frequent PAH mutations. Methods Multiplex assay was developed based on the SNaPshot minisequencing approach to simultaneously perform genotyping of the 10 mutations at the PAH gene. We optimized detection of these mutations in one multiplex PCR, followed by 10 single-nucleotide extension reactions. DNA sequencing assay was also used to verify genotyping results obtained by SNaPshot minisequencing. Result All 10 genotypes were determined based on the position and the fluorescent color of the peaks in a single electropherogram. Sequencing results of these frequent mutations showed that by using this method, a 100% detection rate could be achieved in the Iranian population. Conclusion SNaPshot minisequencing can be useful as a secondary test in neonatal screening for HPA in neonates with a positive screening test, and it is also suitable for carrier screening. The assay can be easily applied for accurate and time- and cost-efficient genotyping of the selected SNPs in various population.
Collapse
|
24
|
Butler JM. Recent advances in forensic biology and forensic DNA typing: INTERPOL review 2019-2022. Forensic Sci Int Synerg 2022; 6:100311. [PMID: 36618991 PMCID: PMC9813539 DOI: 10.1016/j.fsisyn.2022.100311] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
This review paper covers the forensic-relevant literature in biological sciences from 2019 to 2022 as a part of the 20th INTERPOL International Forensic Science Managers Symposium. Topics reviewed include rapid DNA testing, using law enforcement DNA databases plus investigative genetic genealogy DNA databases along with privacy/ethical issues, forensic biology and body fluid identification, DNA extraction and typing methods, mixture interpretation involving probabilistic genotyping software (PGS), DNA transfer and activity-level evaluations, next-generation sequencing (NGS), DNA phenotyping, lineage markers (Y-chromosome, mitochondrial DNA, X-chromosome), new markers and approaches (microhaplotypes, proteomics, and microbial DNA), kinship analysis and human identification with disaster victim identification (DVI), and non-human DNA testing including wildlife forensics. Available books and review articles are summarized as well as 70 guidance documents to assist in quality control that were published in the past three years by various groups within the United States and around the world.
Collapse
Affiliation(s)
- John M. Butler
- National Institute of Standards and Technology, Special Programs Office, 100 Bureau Drive, Mail Stop 4701, Gaithersburg, MD, USA
| |
Collapse
|
25
|
Liu J, Jiang L, Zhao M, Du W, Wen Y, Li S, Zhang S, Fang F, Shen J, He G, Wang M, Dai H, Hou Y, Wang Z. Development and validation of a custom panel including 256 Y-SNPs for Chinese Y-chromosomal haplogroups dissection. Forensic Sci Int Genet 2022; 61:102786. [DOI: 10.1016/j.fsigen.2022.102786] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2022] [Revised: 08/25/2022] [Accepted: 10/04/2022] [Indexed: 11/04/2022]
|
26
|
Carratto TMT, Moraes VMS, Recalde TSF, Oliveira MLGD, Teixeira Mendes-Junior C. Applications of massively parallel sequencing in forensic genetics. Genet Mol Biol 2022; 45:e20220077. [PMID: 36121926 PMCID: PMC9514793 DOI: 10.1590/1678-4685-gmb-2022-0077] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Accepted: 07/15/2022] [Indexed: 11/22/2022] Open
Abstract
Massively parallel sequencing, also referred to as next-generation sequencing, has positively changed DNA analysis, allowing further advances in genetics. Its capability of dealing with low quantity/damaged samples makes it an interesting instrument for forensics. The main advantage of MPS is the possibility of analyzing simultaneously thousands of genetic markers, generating high-resolution data. Its detailed sequence information allowed the discovery of variations in core forensic short tandem repeat loci, as well as the identification of previous unknown polymorphisms. Furthermore, different types of markers can be sequenced in a single run, enabling the emergence of DIP-STRs, SNP-STR haplotypes, and microhaplotypes, which can be very useful in mixture deconvolution cases. In addition, the multiplex analysis of different single nucleotide polymorphisms can provide valuable information about identity, biogeographic ancestry, paternity, or phenotype. DNA methylation patterns, mitochondrial DNA, mRNA, and microRNA profiling can also be analyzed for different purposes, such as age inference, maternal lineage analysis, body-fluid identification, and monozygotic twin discrimination. MPS technology also empowers the study of metagenomics, which analyzes genetic material from a microbial community to obtain information about individual identification, post-mortem interval estimation, geolocation inference, and substrate analysis. This review aims to discuss the main applications of MPS in forensic genetics.
Collapse
Affiliation(s)
- Thássia Mayra Telles Carratto
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | - Vitor Matheus Soares Moraes
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| | | | | | - Celso Teixeira Mendes-Junior
- Universidade de São Paulo, Faculdade de Filosofia, Ciências e Letras de Ribeirão Preto, Departamento de Química, Laboratório de Pesquisas Forenses e Genômicas, Ribeirão Preto, SP, Brazil
| |
Collapse
|
27
|
Targeted Y chromosome capture enrichment in admixed South American samples with haplogroup Q. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2022. [DOI: 10.1016/j.fsigss.2022.09.034] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
28
|
Steffen CR, Romsos EL, Kiesler KM, Borsuk LA, Gettings KB, Vallone PM. Make it "SNPPY" - Updates to SRM 2391d: PCR-Based DNA Profiling Standard. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2022. [DOI: 10.1016/j.fsigss.2022.09.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/01/2022]
|
29
|
HIV-1 Drug Resistance Assay Using Ion Torrent Next Generation Sequencing and On-Instrument End-to-End Analysis Software. J Clin Microbiol 2022; 60:e0025322. [PMID: 35699434 DOI: 10.1128/jcm.00253-22] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
HIV-1 antiretroviral therapy management requires sequencing the protease, reverse transcriptase, and integrase portions of the HIV-1 pol gene. Most resistance testing is performed with Sanger sequencing, which has limited ability to detect minor variants. Next generation sequencing (NGS) platforms enable variant detection at frequencies as low as 1% allowing for earlier detection of resistance and modification of therapy. Implementation of NGS assays in the clinical laboratory is hindered by complicated assay design, cumbersome wet bench procedures, and the complexity of data analysis and bioinformatics. We developed a complete NGS protocol and companion analysis and reporting pipeline using AmpliSeq multiplex PCR, Ion Torrent S5 XL sequencing, and Stanford's HIVdb resistance algorithm. Implemented as a Torrent Suite software plugin, the pipeline runs automatically after sequencing. An optimum variant frequency threshold of 10% was determined by comparing Sanger sequences of archived samples from ViroSeq testing, resulting in a sensitivity of 98.2% and specificity of 99.0%. The majority (91%) of drug resistance mutations were detected by both Sanger and NGS, with 1.7% only by Sanger and 7.3% only by NGS. Variant calls were highly reproducible and there was no cross-reactivity to VZV, HBV, CMV, EBV, and HCV. The limit of detection was 500 copies/mL. The NGS assay performance was comparable to ViroSeq Sanger sequencing and has several advantages, including a publicly available end-to-end analysis and reporting plugin. The assay provides a straightforward path for implementation of NGS for HIV drug resistance testing in the laboratory setting without additional investment in bioinformatics infrastructure and resources.
Collapse
|
30
|
de Knijff P. On the Forensic Use of Y-Chromosome Polymorphisms. Genes (Basel) 2022; 13:genes13050898. [PMID: 35627283 PMCID: PMC9141910 DOI: 10.3390/genes13050898] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 05/09/2022] [Accepted: 05/13/2022] [Indexed: 12/26/2022] Open
Abstract
Nowadays, the use of Y-chromosome polymorphisms forms an essential part of many forensic DNA investigations. However, this was not always the case. Only since 1992 have we seen that some forensic scientists started to have an interest in this chromosome. In this review, I will sketch a brief history focusing on the forensic use of Y-chromosome polymorphisms. Before describing the various applications of short-tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) on the Y-chromosome, I will discuss a few often ignored aspects influencing proper use and interpretation of Y-chromosome information: (i) genotyping Y-SNPs and Y-STRs, (ii) Y-STR haplotypes shared identical by state (IBS) or identical by descent (IBD), and (iii) Y-haplotype database frequencies.
Collapse
Affiliation(s)
- Peter de Knijff
- Department of Human Genetics, Leiden University Medical Center, P.O. Box 9600, 2300 RC Leiden, The Netherlands
| |
Collapse
|
31
|
Testing the Ion AmpliSeq™ HID Y-SNP Research Panel v1 for performance and resolution in admixed South Americans of haplogroup Q. Forensic Sci Int Genet 2022; 59:102708. [DOI: 10.1016/j.fsigen.2022.102708] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2022] [Revised: 04/04/2022] [Accepted: 04/11/2022] [Indexed: 11/18/2022]
|
32
|
Y-SNP Haplogroup Hierarchy Finder: a web tool for Y-SNP haplogroup assignment. J Hum Genet 2022; 67:487-493. [DOI: 10.1038/s10038-022-01033-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/01/2021] [Revised: 03/03/2022] [Accepted: 03/09/2022] [Indexed: 11/08/2022]
|
33
|
A multi-dimensional evaluation of the 'NIST 1032' sample set across four forensic Y-STR multiplexes. Forensic Sci Int Genet 2022; 57:102655. [PMID: 35007854 PMCID: PMC9901497 DOI: 10.1016/j.fsigen.2021.102655] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/09/2021] [Accepted: 12/12/2021] [Indexed: 02/08/2023]
Abstract
This manuscript reports Y-chromosomal short tandem repeat (Y-STR) haplotypes for 1032 male U.S. population samples across 30 Y-STR loci characterized by three capillary electrophoresis (CE) length-based kits (PowerPlex Y23 System, Yfiler Plus PCR Amplification Kit, and Investigator Argus Y-28 QS Kit) and one sequence-based kit (ForenSeq DNA Signature Prep Kit): DYF387S1, DYS19, DYS385 a/b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS449, DYS456, DYS458, DYS460, DYS481, DYS505, DYS518, DYS522, DYS533, DYS549, DYS570, DYS576, DYS612, DYS627, DYS635, DYS643, and Y-GATA-H4. The length-based Y-STR haplotypes include six loci that are not reported in the sequence-based kit (DYS393, DYS449, DYS456, DYS458, DYS518, and DYS627), whereas three loci included in the sequence-based kit are not present in length-based kits (DYS505, DYS522, and DYS612). For the latter, a custom multiplex was used to generate CE length-based data, allowing 1032 samples to be evaluated for concordance across the 30 Y-STR loci included in these four commercial Y-STR typing kits. Discordances between typing methods were analyzed further to assess underlying causes such as primer binding site mutations and flanking region insertions/deletions. Allele-level frequency and statistical information is provided for sequenced loci, excluding the multi-copy loci DYF387S1 and DYS385 a/b, for which locus-specific haplotype-level frequencies are provided instead. The resulting data reveals the degree of information gained through sequencing: 88% of sequenced Y-STR loci contain additional sequence-based alleles compared to length-based data, with the DYS389II locus containing the most additional alleles (51) observed by sequencing. Despite these allelic increases, only minimal improvement was observed in haplotype resolution by sequence, with all four commercial kits providing a similar ability to differentiate length-based haplotypes in this sample set. Finally, a subset of 369 male samples were compared to their corresponding additionally sequenced father samples, revealing the sequence basis for the 50 length-based changes observed, and no additional sequence-based mutations. GenBank accession numbers are reported for each unique sequence, and associated records are available in the STRSeq Y-Chromosomal STR Loci National Center for Biotechnology Information (NCBI) BioProject, accession PRJNA380347. Haplotype data is updated in the Y-STR Haplotype Reference Database (YHRD) for the 'NIST 1032' data set to now achieve the level of maximal haplotype of YHRD. All supplementary files including revisions to previously published Y-STR data are available in the NIST Public Data Repository: U.S. population data for human identification markers, DOI 10.18434/t4/1500024.
Collapse
|
34
|
Improving the regional Y-STR haplotype resolution utilizing haplogroup-determining Y-SNPs and the application of machine learning in Y-SNP haplogroup prediction in a forensic Y-STR database: A pilot study on male Chinese Yunnan Zhaoyang Han population. Forensic Sci Int Genet 2021; 57:102659. [PMID: 35007855 DOI: 10.1016/j.fsigen.2021.102659] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 12/14/2021] [Accepted: 12/27/2021] [Indexed: 11/23/2022]
Abstract
Improving the resolution of the current widely used Y-chromosomal short tandem repeat (Y-STR) dataset is of great importance for forensic investigators, and the current approach is limited, except for the addition of more Y-STR loci. In this research, a regional Y-DNA database was investigated to improve the Y-STR haplotype resolution utilizing a Y-SNP Pedigree Tagging System that includes 24 Y-chromosomal single nucleotide polymorphism (Y-SNP) loci. This pilot study was conducted in the Chinese Yunnan Zhaoyang Han population, and 3473 unrelated male individuals were enrolled. Based on data on the male haplogroups under different panels, the matched or near-matching (NM) Y-STR haplotype pairs from different haplogroups indicated the critical roles of haplogroups in improving the regional Y-STR haplotype resolution. A classic median-joining network analysis was performed using Y-STR or Y-STR/Y-SNP data to reconstruct population substructures, which revealed the ability of Y-SNPs to correct misclassifications from Y-STRs. Additionally, population substructures were reconstructed using multiple unsupervised or supervised dimensionality reduction methods, which indicated the potential of Y-STR haplotypes in predicting Y-SNP haplogroups. Haplogroup prediction models were built based on nine publicly accessible machine-learning (ML) approaches. The results showed that the best prediction accuracy score could reach 99.71% for major haplogroups and 98.54% for detailed haplogroups. Potential influences on prediction accuracy were assessed by adjusting the Y-STR locus numbers, selecting Y-STR loci with various mutabilities, and performing data processing. ML-based predictors generally presented a better prediction accuracy than two available predictors (Nevgen and EA-YPredictor). Three tree models were developed based on the Yfiler Plus panel with unprocessed input data, which showed their strong generalization ability in classifying various Chinese Han subgroups (validation dataset). In conclusion, this study revealed the significance and application prospects of Y-SNP haplogroups in improving regional Y-STR databases. Y-SNP haplogroups can be used to discriminate NM Y-STR haplotype pairs, and it is important for forensic Y-STR databases to develop haplogroup prediction tools to improve the accuracy of biogeographic ancestry inferences.
Collapse
|
35
|
The FORCE Panel: An All-in-One SNP Marker Set for Confirming Investigative Genetic Genealogy Leads and for General Forensic Applications. Genes (Basel) 2021; 12:genes12121968. [PMID: 34946917 PMCID: PMC8702142 DOI: 10.3390/genes12121968] [Citation(s) in RCA: 32] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/29/2021] [Revised: 12/07/2021] [Accepted: 12/08/2021] [Indexed: 12/03/2022] Open
Abstract
The FORensic Capture Enrichment (FORCE) panel is an all-in-one SNP panel for forensic applications. This panel of 5422 markers encompasses common, forensically relevant SNPs (identity, ancestry, phenotype, X- and Y-chromosomal SNPs), a novel set of 3931 autosomal SNPs for extended kinship analysis, and no clinically relevant/disease markers. The FORCE panel was developed as a custom hybridization capture assay utilizing ~20,000 baits to target the selected SNPs. Five non-probative, previously identified World War II (WWII) cases were used to assess the kinship panel. Each case included one bone sample and associated family reference DNA samples. Additionally, seven reference quality samples, two 200-year-old bone samples, and four control DNAs were processed for kit performance and concordance assessments. SNP recovery after capture resulted in a mean of ~99% SNPs exceeding 10X coverage for reference and control samples, and 44.4% SNPs for bone samples. The WWII case results showed that the FORCE panel could predict first to fifth degree relationships with strong statistical support (likelihood ratios over 10,000 and posterior probabilities over 99.99%). To conclude, SNPs will be important for further advances in forensic DNA analysis. The FORCE panel shows promising results and demonstrates the utility of a 5000 SNP panel for forensic applications.
Collapse
|
36
|
An Introductory Overview of Open-Source and Commercial Software Options for the Analysis of Forensic Sequencing Data. Genes (Basel) 2021; 12:genes12111739. [PMID: 34828345 PMCID: PMC8618049 DOI: 10.3390/genes12111739] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 10/27/2021] [Accepted: 10/27/2021] [Indexed: 12/30/2022] Open
Abstract
The top challenges of adopting new methods to forensic DNA analysis in routine laboratories are often the capital investment and the expertise required to implement and validate such methods locally. In the case of next-generation sequencing, in the last decade, several specifically forensic commercial options became available, offering reliable and validated solutions. Despite this, the readily available expertise to analyze, interpret and understand such data is still perceived to be lagging behind. This review gives an introductory overview for the forensic scientists who are at the beginning of their journey with implementing next-generation sequencing locally and because most in the field do not have a bioinformatics background may find it difficult to navigate the new terms and analysis options available. The currently available open-source and commercial software for forensic sequencing data analysis are summarized here to provide an accessible starting point for those fairly new to the forensic application of massively parallel sequencing.
Collapse
|
37
|
Claerhout S, Verstraete P, Warnez L, Vanpaemel S, Larmuseau M, Decorte R. CSYseq: The first Y-chromosome sequencing tool typing a large number of Y-SNPs and Y-STRs to unravel worldwide human population genetics. PLoS Genet 2021; 17:e1009758. [PMID: 34491993 PMCID: PMC8423258 DOI: 10.1371/journal.pgen.1009758] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/19/2020] [Accepted: 08/05/2021] [Indexed: 11/26/2022] Open
Abstract
Male-specific Y-chromosome (chrY) polymorphisms are interesting components of the DNA for population genetics. While single nucleotide polymorphisms (Y-SNPs) indicate distant evolutionary ancestry, short tandem repeats (Y-STRs) are able to identify close familial kinships. Detailed chrY analysis provides thus both biogeographical background information as paternal lineage identification. The rapid advancement of high-throughput massive parallel sequencing (MPS) technology in the past decade has revolutionized genetic research. Using MPS, single-base information of both Y-SNPs as Y-STRs can be analyzed in a single assay typing multiple samples at once. In this study, we present the first extensive chrY-specific targeted resequencing panel, the 'CSYseq', which simultaneously identifies slow mutating Y-SNPs as evolution markers and rapid mutating Y-STRs as patrilineage markers. The panel was validated by paired-end sequencing of 130 males, distributed over 65 deep-rooted pedigrees covering 1,279 generations. The CSYseq successfully targets 15,611 Y-SNPs including 9,014 phylogenetic informative Y-SNPs to identify 1,443 human evolutionary Y-subhaplogroup lineages worldwide. In addition, the CSYseq properly targets 202 Y-STRs, including 81 slow, 68 moderate, 27 fast and 26 rapid mutating Y-STRs to individualize close paternal relatives. The targeted chrY markers cover a high average number of reads (Y-SNP = 717, Y-STR = 150), easy interpretation, powerful discrimination capacity and chrY specificity. The CSYseq is interesting for research on different time scales: to identify evolutionary ancestry, to find distant family and to discriminate closely related males. Therefore, this panel serves as a unique tool valuable for a wide range of genetic-genealogical applications in interdisciplinary research within evolutionary, population, molecular, medical and forensic genetics.
Collapse
Affiliation(s)
- Sofie Claerhout
- Forensic Biomedical Sciences, Department of Imaging & Pathology, KU Leuven, Leuven, Belgium
| | - Paulien Verstraete
- Forensic Biomedical Sciences, Department of Imaging & Pathology, KU Leuven, Leuven, Belgium
| | - Liesbeth Warnez
- Forensic Biomedical Sciences, Department of Imaging & Pathology, KU Leuven, Leuven, Belgium
| | - Simon Vanpaemel
- KU Leuven, Department of Mechanical Engineering, Noise and Vibration Engineering, Leuven, Belgium
- DMMS Lab, Flanders Make, Heverlee, Belgium
| | - Maarten Larmuseau
- Histories vzw, Mechelen, Belgium
- Department of Human Genetics, KU Leuven, Leuven, Belgium
| | - Ronny Decorte
- Forensic Biomedical Sciences, Department of Imaging & Pathology, KU Leuven, Leuven, Belgium
- Laboratory of Forensic genetics and Molecular Archaeology, UZ Leuven, Leuven, Belgium
| |
Collapse
|
38
|
The analysis of ancestry with small-scale forensic panels of genetic markers. Emerg Top Life Sci 2021; 5:443-453. [PMID: 33949669 DOI: 10.1042/etls20200327] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/03/2021] [Revised: 04/07/2021] [Accepted: 04/19/2021] [Indexed: 11/17/2022]
Abstract
In the last 10 years, forensic genetic analysis has been extended beyond identification tests that link a suspect to crime scene evidence using standard DNA profiling, to new supplementary tests that can provide information to investigators about a suspect in the absence of a database hit or eyewitness testimony. These tests now encompass the prediction of physical appearance, ancestry and age. In this review, we give a comprehensive overview of the full range of DNA-based ancestry inference tests designed to work with forensic contact traces, when the level of DNA is often very low or highly degraded. We outline recent developments in the design of ancestry-informative marker sets, forensic assays that use capillary electrophoresis or massively parallel sequencing, and the statistical analysis frameworks that examine the test profile and compares it to reference population variation. Three casework ancestry analysis examples are described which were successfully accomplished in the authors' laboratory, where the ancestry information obtained was critical to the outcome of the DNA analyses made.
Collapse
|
39
|
Chen Y, Branicki W, Walsh S, Nothnagel M, Kayser M, Liu F. The impact of correlations between pigmentation phenotypes and underlying genotypes on genetic prediction of pigmentation traits. Forensic Sci Int Genet 2020; 50:102395. [PMID: 33070049 DOI: 10.1016/j.fsigen.2020.102395] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2020] [Revised: 08/25/2020] [Accepted: 09/15/2020] [Indexed: 12/31/2022]
Abstract
Predicting appearance phenotypes from genotypes is relevant for various areas of human genetic research and applications such as genetic epidemiology, human history, anthropology, and particularly in forensics. Many appearance phenotypes, and thus their underlying genotypes, are highly correlated, with pigmentation traits serving as primary examples. However, all available genetic prediction models, including those for pigmentation traits currently used in forensic DNA phenotyping, ignore phenotype correlations. Here, we investigated the impact of appearance phenotype correlations on genetic appearance prediction in the exemplary case of three pigmentation traits. We used data for categorical eye, hair and skin colour as well as 41 DNA markers utilized in the recently established HIrisPlex-S system from 762 individuals with complete phenotype and genotype information. Based on these data, we performed genetic prediction modelling of eye, hair and skin colour via three different strategies, namely the established approach of predicting phenotypes solely based on genotypes while not considering phenotype correlations, and two novel approaches that considered phenotype correlations, either incorporating truly observed correlated phenotypes or DNA-predicted correlated phenotypes in addition to the DNA predictors. We found that using truly observed correlated pigmentation phenotypes as additional predictors increased the DNA-based prediction accuracies for almost all eye, hair and skin colour categories, with the largest increase for intermediate eye colour, brown hair colour, dark to black skin colour, and particularly for dark skin colour. Outcomes of dedicated computer simulations suggest that this prediction accuracy increase is due to the additional genetic information that is implicitly provided by the truly observed correlated pigmentation phenotypes used, yet not covered by the DNA predictors applied. In contrast, considering DNA-predicted correlated pigmentation phenotypes as additional predictors did not improve the performance of the genetic prediction of eye, hair and skin colour, which was in line with the results from our computer simulations. Hence, in practical applications of DNA-based appearance prediction where no phenotype knowledge is available, such as in forensic DNA phenotyping, it is not advised to use DNA-predicted correlated phenotypes as predictors in addition to the DNA predictors. In the very least, this is not recommended for the pigmentation traits and the established pigmentation DNA predictors tested here.
Collapse
Affiliation(s)
- Yan Chen
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands; Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China
| | - Wojciech Branicki
- Malopolska Centre of Biotechnology, Jagiellonian University, Krakow, Poland
| | - Susan Walsh
- Department of Biology, Indiana University Purdue University Indianapolis (IUPUI), Indianapolis, IN, USA
| | - Michael Nothnagel
- Cologne Center for Genomics, University of Cologne, Cologne, Germany; University Hospital Cologne, Cologne, Germany
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands
| | - Fan Liu
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands; Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing, China; University of Chinese Academy of Sciences, Beijing, China.
| | | |
Collapse
|
40
|
Li M, Zhang Y, Luo L, Bian Y, Li C. Development and validation of a custom panel including 183 Y-SNPs for Chinese Y-chromosomal haplogroups dissection using a MALDI-TOF MS system. Electrophoresis 2020; 41:2047-2054. [PMID: 32854146 DOI: 10.1002/elps.202000145] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2020] [Revised: 08/21/2020] [Accepted: 08/25/2020] [Indexed: 11/12/2022]
Abstract
Y-chromosome SNP haplogroups exhibit geographic structuring in many populations around the world. Therefore, Y-chromosome haplogroups have been widely used to infer paternal biogeographical ancestry and high-resolution paternal lineage classification. In the present study, we designed a customized panel containing 183 Y-SNPs based on previous studies and evaluated the genotyping performance and repeatability, concordance, sensitivity, and ability of analysing case-type samples using a MALDI-TOF MS platform. The average call rate for duplicate typing of any one SNP in the panel was 97.0%. In the concordance and accuracy study, the results of haplogroup designation obtained from MALDI-TOF MS platform were fully consistent with those obtained from the next-generation sequencing (NGS) platform. The optimal amount of template DNA in the PCR seemed to be 10 ng. However, if less DNA (≥156.25 pg) was available, it was still possible to obtain meaningful haplogroup information. For the application part, this panel could be applied for the detection of blood, semen, and buccal swabs samples. Particularly, blood stain on FTA card samples could be dissected by direct PCR amplification on the MALDI-TOF MS platform. Besides, 371 unrelated male individuals from four Chinese ethnic groups (Han, Hui, Mongolian, and Kazak) were detected using this panel. Total 78 terminal haplogroups were found and the haplogroup diversity was 0.933576. The results demonstrate that this panel could be an accurate, fast, and cost-effective method for database construction where the amount of sample material is less of a concern and when the cost of the assay is taken into consideration.
Collapse
Affiliation(s)
- Min Li
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, P. R. China.,Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, P.R. China, Ministry of Justice, Shanghai, P. R. China
| | - Yilun Zhang
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, P.R. China, Ministry of Justice, Shanghai, P. R. China.,School of Basic Medicine, Inner Mongolia Autonomous Region, Baotou Medical College, Baotou, P. R. China
| | - Li Luo
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, P.R. China, Ministry of Justice, Shanghai, P. R. China.,Department of Forensic Medicine, Zunyi Medical University, Zunyi, Guizhou, P. R. China
| | - Yingnan Bian
- Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, P.R. China, Ministry of Justice, Shanghai, P. R. China
| | - Chengtao Li
- Institute of Forensic Medicine, West China School of Basic Medical Sciences & Forensic Medicine, Sichuan University, Chengdu, P. R. China.,Shanghai Key Laboratory of Forensic Medicine, Shanghai Forensic Service Platform, Academy of Forensic Sciences, P.R. China, Ministry of Justice, Shanghai, P. R. China
| |
Collapse
|
41
|
Bouakaze C, Delehelle F, Saenz-Oyhéréguy N, Moreira A, Schiavinato S, Croze M, Delon S, Fortes-Lima C, Gibert M, Bujan L, Huyghe E, Bellis G, Calderon R, Hernández CL, Avendaño-Tamayo E, Bedoya G, Salas A, Mazières S, Charioni J, Migot-Nabias F, Ruiz-Linares A, Dugoujon JM, Thèves C, Mollereau-Manaute C, Noûs C, Poulet N, King T, D'Amato ME, Balaresque P. Predicting haplogroups using a versatile machine learning program (PredYMaLe) on a new mutationally balanced 32 Y-STR multiplex (CombYplex): Unlocking the full potential of the human STR mutation rate spectrum to estimate forensic parameters. Forensic Sci Int Genet 2020; 48:102342. [PMID: 32818722 DOI: 10.1016/j.fsigen.2020.102342] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 06/10/2020] [Accepted: 06/11/2020] [Indexed: 12/24/2022]
Abstract
We developed a new mutationally well-balanced 32 Y-STR multiplex (CombYplex) together with a machine learning (ML) program PredYMaLe to assess the impact of STR mutability on haplogourp prediction, while respecting forensic community criteria (high DC/HD). We designed CombYplex around two sub-panels M1 and M2 characterized by average and high-mutation STR panels. Using these two sub-panels, we tested how our program PredYmale reacts to mutability when considering basal branches and, moving down, terminal branches. We tested first the discrimination capacity of CombYplex on 996 human samples using various forensic and statistical parameters and showed that its resolution is sufficient to separate haplogroup classes. In parallel, PredYMaLe was designed and used to test whether a ML approach can predict haplogroup classes from Y-STR profiles. Applied to our kit, SVM and Random Forest classifiers perform very well (average 97 %), better than Neural Network (average 91 %) and Bayesian methods (< 90 %). We observe heterogeneity in haplogroup assignation accuracy among classes, with most haplogroups having high prediction scores (99-100 %) and two (E1b1b and G) having lower scores (67 %). The small sample sizes of these classes explain the high tendency to misclassify the Y-profiles of these haplogroups; results were measurably improved as soon as more training data were added. We provide evidence that our ML approach is a robust method to accurately predict haplogroups when it is combined with a sufficient number of markers, well-balanced mutation rate Y-STR panels, and large ML training sets. Further research on confounding factors (such as CNV-STR or gene conversion) and ideal STR panels in regard to the branches analysed can be developed to help classifiers further optimize prediction scores.
Collapse
Affiliation(s)
- Caroline Bouakaze
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Franklin Delehelle
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France; REVA Unit, UMR 5505 - CNRS & Université de Toulouse, Institut de Recherche en Informatique de Toulouse, 31400 Toulouse, France
| | - Nancy Saenz-Oyhéréguy
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Andreia Moreira
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Stéphanie Schiavinato
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Myriam Croze
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Solène Delon
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Cesar Fortes-Lima
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Morgane Gibert
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Louis Bujan
- Equipe d'acceuil EA3694, Hôpital Paule de Viguier, 330 Avenue de Grande Bretagne, TSA 70034, 31059 Toulouse Cedex 9, France
| | - Eric Huyghe
- Equipe d'acceuil EA3694, Hôpital Paule de Viguier, 330 Avenue de Grande Bretagne, TSA 70034, 31059 Toulouse Cedex 9, France
| | - Gil Bellis
- INED Institut National d'Etudes Démographiques, 133 Boulevard Davout, 75980 Paris cedex 20, France
| | - Rosario Calderon
- Department of Biodiversity, Ecology and Evolution, Faculty of Biology, Complutense University. 28040 Madrid, Spain
| | - Candela Lucia Hernández
- Department of Biodiversity, Ecology and Evolution, Faculty of Biology, Complutense University. 28040 Madrid, Spain
| | - Efren Avendaño-Tamayo
- Grupo de Ciencias Básicas Aplicadas del Tecnológico de Antioquia, Tecnológico de Antioquia, Institución Universitaria, Medellín 050034, Colombia
| | - Gabriel Bedoya
- GENMOL (Genética Molecular), Instituto de Biología, Universidad de Antioquia Medellín Colombia, Colombia
| | - Antonio Salas
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, GenPoB Research Group, Instituto de Investigaciones, Sanitarias (IDIS), Hospital Clínico Universitario de Santiago (SERGAS), Galicia, Spain
| | | | - Jacques Charioni
- Aix Marseille Univ, CNRS, EFS, ADES, Marseille, France; Etablissement Français du Sang PACA Corse, Marseille, France
| | | | - Andres Ruiz-Linares
- Aix Marseille Univ, CNRS, EFS, ADES, Marseille, France; Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | - Jean-Michel Dugoujon
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Catherine Thèves
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Catherine Mollereau-Manaute
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Camille Noûs
- Laboratoire Cogitamous, CNRS & Université Toulouse III, 31000 Toulouse, France
| | - Nicolas Poulet
- Pôle écohydraulique AFB-IMT, allée du Pr Camille Soula, 31400 Toulouse, France
| | - Turi King
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Maria Eugenia D'Amato
- Forensic DNA Laboratory, Department of Biotechnology, Faculty of Natural Sciences, University of Western Cape, Cape Town, South Africa
| | - Patricia Balaresque
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France.
| |
Collapse
|
42
|
The construction and application of a new 17-plex Y-STR system using universal fluorescent PCR. Int J Legal Med 2020; 134:2015-2027. [PMID: 32322984 DOI: 10.1007/s00414-020-02291-3] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/19/2020] [Accepted: 03/30/2020] [Indexed: 01/16/2023]
Abstract
Y-chromosomal short tandem repeat (Y-STR) polymorphisms are useful in forensic identification, population genetics, and human structures. However, the current Y-STR systems are limited in discriminating distant relatives in a family with a low discrimination power. Increasing the capacity of detecting Y chromosomal polymorphisms will drastically narrow down the matching number of genealogy populations or pedigrees. In this study, we developed a system containing 17 Y-STRs that are complementary to the current commercially available Y-STR kits. This system was constructed by multiplex PCR with expected sizes of 126-400 bp labeled by different fluorescence molecules (DYS715, DYS709, DYS716, DYS713, and DYS607 labeled by FAM; DYS718, DYS723, DYS708, and DYS714 labeled by JOE; DYS712, DYS717, DYS721, and DYS605 labeled by TAMRA; and DYS719, DYS726, DYS598, and DYS722 labeled by ROX). The system was extensively tested for sensitivity, male specificity, species specificity, mixture, population genetics, and mutation rates following the Scientific Working Group on DNA Analysis Methods (SWGDAM) guidelines. The genetic data were obtained from eight populations with a total of 1260 individuals. Our results showed that all the 17 Y-STRs are human- and male-specific and include only one copy of the Y-chromosome. The 17 Y-STR system detects 143 alleles and has a high discrimination power (0.996031746). Mutation rates were different among the 17 Y-STRs, ranging from 0.30 to 3.03%. In conclusion, our study provides a robust, sensitive, and cost-effective genotyping method for human identification, which will be beneficial for narrowing the search scope when applied to genealogy searching with the Y-STR DNA databank.
Collapse
|
43
|
The MASTiFF panel-a versatile multiple-allele SNP test for forensics. Int J Legal Med 2019; 134:441-450. [PMID: 31863187 DOI: 10.1007/s00414-019-02233-8] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2019] [Accepted: 12/11/2019] [Indexed: 10/25/2022]
Abstract
Forensic identification tests often need recourse to markers that can successfully type highly degraded DNA, and binary single nucleotide polymorphisms (SNPs) have become the variants of choice for such analyses because of their short amplified fragment lengths. The two main drawbacks of SNPs are their reduced power of discrimination per marker compared with mainstream forensic STRs and an inability to robustly detect mixed DNA-particularly using capillary electrophoresis genotyping systems such as SNaPshot™, where the dye signals are much more imbalanced than those of STR profiles. This study compiled a compact set of multiple-allele SNPs consisting of loci that had three or four nucleotide variants at the same site in order to address the lack of mixture detection capability with binary SNP tests, as well as improving levels of polymorphism per SNP by transitioning to a maximum of six or ten genotypes per locus. We report the development and optimisation of a SNaPshot-based forensic test comprising 27 tri-allelic and 2 tetra-allelic SNPs, which we named MASTiFF: a multiple-allele SNP test for forensics. Assessments of the MASTiFF panel's levels of discrimination power in the five main population groups indicate random match probabilities ranging from 10-15 down to 10-20-improving the levels possible from an equivalent number of binary SNPs. The SNaPshot test was able to detect simple mixtures successfully with more than two alleles observed in 30% of SNPs. From allele frequency data, it is estimated that more than two alleles will be present in at least one MASTiFF SNP in 99.8% of two-person mixtures, making this panel an ideal supplementary test when SNPs are chosen for the analysis of degraded forensic DNA.
Collapse
|
44
|
Liu F, Zhong K, Jing X, Uitterlinden AG, Hendriks AEJ, Drop SLS, Kayser M. Update on the predictability of tall stature from DNA markers in Europeans. Forensic Sci Int Genet 2019; 42:8-13. [PMID: 31207428 DOI: 10.1016/j.fsigen.2019.05.006] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2018] [Revised: 05/08/2019] [Accepted: 05/27/2019] [Indexed: 10/26/2022]
Abstract
Predicting adult height from DNA has important implications in forensic DNA phenotyping. In 2014, we introduced a prediction model consisting of 180 height-associated SNPs based on data from 10,361 Northwestern Europeans enriched with tall individuals (770 > 1.88 standard deviation), which yielded a mid-ranged accuracy (AUC = 0.75 for binary prediction of tall stature and R2 = 0.12 for quantitative prediction of adult height). Here, we provide an update on DNA-based height predictability considering an enlarged list of subsequently-published height-associated SNPs using data from the same set of 10,361 Europeans. A prediction model based on the full set of 689 SNPs showed an improved accuracy relative to previous models for both tall stature (AUC = 0.79) and quantitative height (R2 = 0.21). A feature selection analysis revealed a subset of 412 most informative SNPs while the corresponding prediction model retained most of the accuracy (AUC = 0.76 and R2 = 0.19) achieved with the full model. Over all, our study empirically exemplifies that the accuracy for predicting human appearance phenotypes with very complex underlying genetic architectures, such as adult height, can be improved by increasing the number of phenotype-associated DNA variants. Our work also demonstrates that a careful sub-selection allows for a considerable reduction of the number of DNA predictors that achieve similar prediction accuracy as provided by the full set. This is forensically relevant due to restrictions in the number of SNPs simultaneously analyzable with forensically suitable DNA technologies in the current days of targeted massively parallel sequencing in forensic genetics.
Collapse
Affiliation(s)
- Fan Liu
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing, China; Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Kaiyin Zhong
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Xiaoxi Jing
- CAS Key Laboratory of Genomic and Precision Medicine, Beijing Institute of Genomics, University of Chinese Academy of Sciences, Chinese Academy of Sciences, Beijing, China.
| | - André G Uitterlinden
- Department of Internal Medicine, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands; Department of Epidemiology, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - A Emile J Hendriks
- Department of Pediatrics, Pediatric Endocrinology and Diabetes, University of Cambridge, United Kingdom.
| | - Stenvert L S Drop
- Department of Pediatrics, Division of Endocrinology, Sophia Children's Hospital, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| | - Manfred Kayser
- Department of Genetic Identification, Erasmus MC University Medical Center Rotterdam, Rotterdam, the Netherlands.
| |
Collapse
|