1
|
Shabalala S, Ghai M, Okpeku M. Analysis of Y-STR diversity and DNA methylation variation among Black and Indian males from KwaZulu-Natal, South Africa. Forensic Sci Int 2023; 348:111682. [PMID: 37094501 DOI: 10.1016/j.forsciint.2023.111682] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2023] [Accepted: 04/05/2023] [Indexed: 04/26/2023]
Abstract
Y-chromosome short tandem repeats (Y-STRs) are essential in understanding genetic structure and diversity of human populations and, most importantly, in identification of male perpetrators in criminal investigations. DNA methylation differences have been reported in human populations and methylation pattern at the CpG sites found within or flanking the Y-STR sites could also aid in human identification. Studies based on DNA methylation (DNAm) at Y-STRs are currently limited. The current study aimed to analyze the Y-STR diversity in South African Black and Indian individuals living in KwaZulu-Natal, Durban, South Africa, with the Yfiler™ Plus Kit and to analyze DNAm patterns in Y-STR markers CpG sites. DNA from 247 stored saliva samples were isolated and quantified. Across the 27 Y-STR loci in the Yfiler™ Plus Kit, 253 alleles were observed in 113 South African Black and Indian males, 112 unique haplotypes were observed, and one haplotype appeared twice (two Black individuals). No statistically significant differences were observed in the genetic diversity between the two population groups (Fst = 0.028, p-value ≥ 0.05). The kit showed a high discrimination capacity (DC) of 0.9912 and an overall haplotype diversity (HD) = 0.9995 among the sampled population groups. DYS438 and DYS448 markers displayed 2 and 3 CpG sites, respectively. Based on the two-tailed Fisher's Exact test, there were no statistically significant differences in the DNAm levels at DYS438 CpGs of Black and Indian males (p > 0.05). The Yfiler™ Plus Kit can be considered highly discriminatory among South African Black and Indian males. Studies on the South African population using Yfiler™ Plus Kit are scarce. Hence, accumulating Y-STR data on the diverse South African population will enhance the representation of South Africa in STR databases. Knowing which Y-STR markers are significantly informative for South Africa is essential for developing Y-STR kits better suited for the different ethnic groups. And to the best of our knowledge, DNA methylation analysis in Y-STR for different ethnic groups has never been done before. Complementing Y-STR data with methylation knowledge could provide population-specific information for forensic identification.
Collapse
Affiliation(s)
- Sthabile Shabalala
- School of Life Sciences, University of KwaZulu-Natal, Private Bag X54001, Westville, Durban 4000, South Africa
| | - Meenu Ghai
- School of Life Sciences, University of KwaZulu-Natal, Private Bag X54001, Westville, Durban 4000, South Africa.
| | - Moses Okpeku
- School of Life Sciences, University of KwaZulu-Natal, Private Bag X54001, Westville, Durban 4000, South Africa
| |
Collapse
|
2
|
Bouakaze C, Delehelle F, Saenz-Oyhéréguy N, Moreira A, Schiavinato S, Croze M, Delon S, Fortes-Lima C, Gibert M, Bujan L, Huyghe E, Bellis G, Calderon R, Hernández CL, Avendaño-Tamayo E, Bedoya G, Salas A, Mazières S, Charioni J, Migot-Nabias F, Ruiz-Linares A, Dugoujon JM, Thèves C, Mollereau-Manaute C, Noûs C, Poulet N, King T, D'Amato ME, Balaresque P. Predicting haplogroups using a versatile machine learning program (PredYMaLe) on a new mutationally balanced 32 Y-STR multiplex (CombYplex): Unlocking the full potential of the human STR mutation rate spectrum to estimate forensic parameters. Forensic Sci Int Genet 2020; 48:102342. [PMID: 32818722 DOI: 10.1016/j.fsigen.2020.102342] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/19/2019] [Revised: 06/10/2020] [Accepted: 06/11/2020] [Indexed: 12/24/2022]
Abstract
We developed a new mutationally well-balanced 32 Y-STR multiplex (CombYplex) together with a machine learning (ML) program PredYMaLe to assess the impact of STR mutability on haplogourp prediction, while respecting forensic community criteria (high DC/HD). We designed CombYplex around two sub-panels M1 and M2 characterized by average and high-mutation STR panels. Using these two sub-panels, we tested how our program PredYmale reacts to mutability when considering basal branches and, moving down, terminal branches. We tested first the discrimination capacity of CombYplex on 996 human samples using various forensic and statistical parameters and showed that its resolution is sufficient to separate haplogroup classes. In parallel, PredYMaLe was designed and used to test whether a ML approach can predict haplogroup classes from Y-STR profiles. Applied to our kit, SVM and Random Forest classifiers perform very well (average 97 %), better than Neural Network (average 91 %) and Bayesian methods (< 90 %). We observe heterogeneity in haplogroup assignation accuracy among classes, with most haplogroups having high prediction scores (99-100 %) and two (E1b1b and G) having lower scores (67 %). The small sample sizes of these classes explain the high tendency to misclassify the Y-profiles of these haplogroups; results were measurably improved as soon as more training data were added. We provide evidence that our ML approach is a robust method to accurately predict haplogroups when it is combined with a sufficient number of markers, well-balanced mutation rate Y-STR panels, and large ML training sets. Further research on confounding factors (such as CNV-STR or gene conversion) and ideal STR panels in regard to the branches analysed can be developed to help classifiers further optimize prediction scores.
Collapse
Affiliation(s)
- Caroline Bouakaze
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Franklin Delehelle
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France; REVA Unit, UMR 5505 - CNRS & Université de Toulouse, Institut de Recherche en Informatique de Toulouse, 31400 Toulouse, France
| | - Nancy Saenz-Oyhéréguy
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Andreia Moreira
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Stéphanie Schiavinato
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Myriam Croze
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Solène Delon
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Cesar Fortes-Lima
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Morgane Gibert
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Louis Bujan
- Equipe d'acceuil EA3694, Hôpital Paule de Viguier, 330 Avenue de Grande Bretagne, TSA 70034, 31059 Toulouse Cedex 9, France
| | - Eric Huyghe
- Equipe d'acceuil EA3694, Hôpital Paule de Viguier, 330 Avenue de Grande Bretagne, TSA 70034, 31059 Toulouse Cedex 9, France
| | - Gil Bellis
- INED Institut National d'Etudes Démographiques, 133 Boulevard Davout, 75980 Paris cedex 20, France
| | - Rosario Calderon
- Department of Biodiversity, Ecology and Evolution, Faculty of Biology, Complutense University. 28040 Madrid, Spain
| | - Candela Lucia Hernández
- Department of Biodiversity, Ecology and Evolution, Faculty of Biology, Complutense University. 28040 Madrid, Spain
| | - Efren Avendaño-Tamayo
- Grupo de Ciencias Básicas Aplicadas del Tecnológico de Antioquia, Tecnológico de Antioquia, Institución Universitaria, Medellín 050034, Colombia
| | - Gabriel Bedoya
- GENMOL (Genética Molecular), Instituto de Biología, Universidad de Antioquia Medellín Colombia, Colombia
| | - Antonio Salas
- Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, GenPoB Research Group, Instituto de Investigaciones, Sanitarias (IDIS), Hospital Clínico Universitario de Santiago (SERGAS), Galicia, Spain
| | | | - Jacques Charioni
- Aix Marseille Univ, CNRS, EFS, ADES, Marseille, France; Etablissement Français du Sang PACA Corse, Marseille, France
| | | | - Andres Ruiz-Linares
- Aix Marseille Univ, CNRS, EFS, ADES, Marseille, France; Ministry of Education Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai, China
| | - Jean-Michel Dugoujon
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Catherine Thèves
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Catherine Mollereau-Manaute
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France
| | - Camille Noûs
- Laboratoire Cogitamous, CNRS & Université Toulouse III, 31000 Toulouse, France
| | - Nicolas Poulet
- Pôle écohydraulique AFB-IMT, allée du Pr Camille Soula, 31400 Toulouse, France
| | - Turi King
- Department of Genetics, University of Leicester, Leicester, United Kingdom
| | - Maria Eugenia D'Amato
- Forensic DNA Laboratory, Department of Biotechnology, Faculty of Natural Sciences, University of Western Cape, Cape Town, South Africa
| | - Patricia Balaresque
- Laboratoire d´Anthropologie Moléculaire et Imagerie de Synthèse (AMIS), UMR5288 - CNRS & Université Toulouse III, 37 allées Jules Guesde, 31073 Toulouse Cedex 3, France.
| |
Collapse
|