1
|
Antão-Sousa S, Gusmão L, Modesti NM, Feliziani S, Faustino M, Marcucci V, Sarapura C, Ribeiro J, Carvalho E, Pereira V, Tomas C, de Pancorbo MM, Baeta M, Alghafri R, Almheiri R, Builes JJ, Gouveia N, Burgos G, Pontes MDL, Ibarra A, da Silva CV, Parveen R, Benitez M, Amorim A, Pinto N. Microsatellites' mutation modeling through the analysis of the Y-chromosomal transmission: Results of a GHEP-ISFG collaborative study. Forensic Sci Int Genet 2024; 69:102999. [PMID: 38181588 DOI: 10.1016/j.fsigen.2023.102999] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 10/25/2023] [Accepted: 12/10/2023] [Indexed: 01/07/2024]
Abstract
The Spanish and Portuguese Speaking Working Group of the International Society for Forensic Genetics (GHEP-ISFG) organized a collaborative study on mutations of Y-chromosomal short tandem repeats (Y-STRs). New data from 2225 father-son duos and data from 44 previously published reports, corresponding to 25,729 duos, were collected and analyzed. Marker-specific mutation rates were estimated for 33 Y-STRs. Although highly dependent on the analyzed marker, mutations compatible with the gain or loss of a single repeat were 23.2 times more likely than those involving a greater number of repeats. Longer alleles (relatively to the modal one) showed to be nearly twice more mutable than the shorter ones. Within the subset of longer alleles, the loss of repeats showed to be nearly twice more likely than the gain. Conversely, shorter alleles showed a symmetrical trend, with repeat gains being twofold more frequent than reductions. A positive correlation between the paternal age and the mutation rate was observed, strengthening previous findings. The results of a machine learning approach, via logistic regression analyses, allowed the establishment of algebraic formulas for estimating the probability of mutation depending on paternal age and allele length for DYS389I, DYS393 and DYS627. Algebraic formulas could also be established considering only the allele length as predictor for DYS19, DYS389I, DYS389II-I, DYS390, DYS391, DYS393, DYS437, DYS439, DYS449, DYS456, DYS458, DYS460, DYS481, DYS518, DYS533, DYS576, DYS626 and DYS627 loci. For the remaining Y-STRs, a lack of statistical significance was observed, probably as a consequence of the small effective size of the subsets available, a common difficulty in the modeling of rare events as is the case of mutations. The amount of data used in the different analyses varied widely, depending on how the data were reported in the publications analyzed. This shows a regrettable waste of produced data, due to inadequate communication of the results, supporting an urgent need of publication guidelines for mutation studies.
Collapse
Affiliation(s)
- Sofia Antão-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal; DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Nidia M Modesti
- Centro de Genética Forense, Poder Judicial de Córdoba, Argentina
| | - Sofía Feliziani
- Centro de Genética Forense, Poder Judicial de Córdoba, Argentina
| | - Marisa Faustino
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal
| | - Valeria Marcucci
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Claudia Sarapura
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Julyana Ribeiro
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Elizeu Carvalho
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| | - Vania Pereira
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Carmen Tomas
- Section of Forensic Genetics, Department of Forensic Medicine, Faculty of Health and Medical Sciences, University of Copenhagen, Denmark
| | - Marian M de Pancorbo
- BIOMICs Research Group, Lascaray Research Center, Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Vitoria-Gasteiz, Spain
| | - Miriam Baeta
- BIOMICs Research Group, Lascaray Research Center, Department of Zoology and Animal Cell Biology, University of the Basque Country UPV/EHU, Vitoria-Gasteiz, Spain
| | - Rashed Alghafri
- International Center for Forensic Sciences, Dubai Police G.H.Q., Dubai, United Arab Emirates
| | - Reem Almheiri
- International Center for Forensic Sciences, Dubai Police G.H.Q., Dubai, United Arab Emirates
| | - Juan José Builes
- GENES SAS Laboratory, Medellín, Colombia; Institute of Biology, University of Antioquia, Medellín, Colombia
| | - Nair Gouveia
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Centro, Portugal
| | - German Burgos
- One Health Global Research Group, Facultad de Medicina, Universidad de Las Américas (UDLA), Quito, Ecuador; Grupo de Medicina Xenómica, Universidad de Santiago de Compostela, Santiago de Compostela, Spain
| | - Maria de Lurdes Pontes
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Norte, Portugal
| | - Adriana Ibarra
- Laboratorio IDENTIGEN, Universidad de Antioquia, Colombia
| | - Claudia Vieira da Silva
- Instituto Nacional de Medicina Legal e Ciências Forenses, I.P. / Serviço de Genética e Biologia Forenses, Delegação do Sul, Portugal
| | - Rukhsana Parveen
- Forensic Services Laboratory, Centre for Applied Molecular Biology, University of the Punjab, Lahore, Pakistan
| | - Marc Benitez
- Policia de la Generalitat de Catalunya - Mossos d'Esquadra. Unitat Central del Laboratori Biològic, Barcelona, Spain
| | - António Amorim
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Faculty of Sciences of the University of Porto (FCUP), Porto, Portugal
| | - Nadia Pinto
- Instituto de Investigação e Inovação em Saúde (i3S), Porto, Portugal; Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), Porto, Portugal; Centre of Mathematics of the University of Porto, Porto, Portugal.
| |
Collapse
|
2
|
Antão-Sousa S, Pinto N, Rende P, Amorim A, Gusmão L. The sequence of the repetitive motif influences the frequency of multistep mutations in Short Tandem Repeats. Sci Rep 2023; 13:10251. [PMID: 37355683 PMCID: PMC10290632 DOI: 10.1038/s41598-023-32137-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/28/2022] [Accepted: 03/23/2023] [Indexed: 06/26/2023] Open
Abstract
Microsatellites, or Short Tandem Repeats (STRs), are subject to frequent length mutations that involve the loss or gain of an integer number of repeats. This work aimed to investigate the correlation between STRs' specific repetitive motif composition and mutational dynamics, specifically the occurrence of single- or multistep mutations. Allelic transmission data, comprising 323,818 allele transfers and 1,297 mutations, were gathered for 35 Y-chromosomal STRs with simple structure. Six structure groups were established: ATT, CTT, TCTA/GATA, GAAA/CTTT, CTTTT, and AGAGAT, according to the repetitive motif present in the DNA leading strand of the markers. Results show that the occurrence of multistep mutations varies significantly among groups of markers defined by the repetitive motif. The group of markers with the highest frequency of multistep mutations was the one with repetitive motif CTTTT (25% of the detected mutations) and the lowest frequency corresponding to the group with repetitive motifs TCTA/GATA (0.93%). Statistically significant differences (α = 0.05) were found between groups with repetitive motifs with different lengths, as is the case of TCTA/GATA and ATT (p = 0.0168), CTT (p < 0.0001) and CTTTT (p < 0.0001), as well as between GAAA/CTTT and CTTTT (p = 0.0102). The same occurred between the two tetrameric groups GAAA/CTTT and TCTA/GATA (p < 0.0001) - the first showing 5.7 times more multistep mutations than the second. When considering the number of repeats of the mutated paternal alleles, statistically significant differences were found for alleles with 10 or 12 repeats, between GATA and ATT structure groups. These results, which demonstrate the heterogeneity of mutational dynamics across repeat motifs, have implications in the fields of population genetics, epidemiology, or phylogeography, and whenever STR mutation models are used in evolutionary studies in general.
Collapse
Affiliation(s)
- Sofia Antão-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal.
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal.
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal.
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil.
| | - Nádia Pinto
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
- Center of Mathematics of University of Porto (CMUP), Porto, Portugal
| | - Pablo Rende
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal
| | - António Amorim
- Instituto de Investigação e Inovação em Saúde (i3S), University of Porto, Porto, Portugal
- Institute of Molecular Pathology and Immunology of the University of Porto (IPATIMUP), Porto, Portugal
- Department of Biology, Faculty of Sciences of University of Porto (FCUP), Porto, Portugal
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro, Brazil
| |
Collapse
|
3
|
Antão-Sousa S, Conde-Sousa E, Gusmão L, Amorim A, Pinto N. Estimations of Mutation Rates Depend on Population Allele Frequency Distribution: The Case of Autosomal Microsatellites. Genes (Basel) 2022; 13:genes13071248. [PMID: 35886031 PMCID: PMC9323320 DOI: 10.3390/genes13071248] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2022] [Revised: 06/28/2022] [Accepted: 07/11/2022] [Indexed: 01/27/2023] Open
Abstract
Microsatellites (or short-tandem repeats (STRs)) are widely used in anthropology and evolutionary studies. Their extensive polymorphism and rapid evolution make them the ideal genetic marker for dating events, such as the age of a gene or a population. This usage requires the estimation of mutation rates, which are usually estimated by counting the observed Mendelian incompatibilities in one-generation familial configurations (typically parent(s)–child duos or trios). Underestimations are inevitable when using this approach, due to the occurrence of mutational events that do not lead to incompatibilities with the parental genotypes (‘hidden’ or ‘covert’ mutations). It is known that the likelihood that one mutation event leads to a Mendelian incompatibility depends on the mode of genetic transmission considered, the type of familial configuration (duos or trios) considered, and the genotype(s) of the progenitor(s). In this work, we show how the magnitude of the underestimation of autosomal microsatellite mutation rates varies with the populations’ allele frequency distribution spectrum. The Mendelian incompatibilities approach (MIA) was applied to simulated parent(s)/offspring duos and trios in different populational scenarios. The results showed that the magnitude and type of biases depend on the population allele frequency distribution, whatever the type of familial data considered, and are greater when duos, instead of trios, are used to obtain the estimates. The implications for molecular anthropology are discussed and a simple framework is presented to correct the naïf estimates, along with an informatics tool for the correction of incompatibility rates obtained through the MIA.
Collapse
Affiliation(s)
- Sofia Antão-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), 4200-135 Porto, Portugal; (E.C.-S.); (A.A.); (N.P.)
- Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), 4200-465 Porto, Portugal
- Faculty of Sciences, University of Porto (FCUP), 4169-007 Porto, Portugal
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro 20550-013, Brazil;
- Correspondence:
| | - Eduardo Conde-Sousa
- Instituto de Investigação e Inovação em Saúde (i3S), 4200-135 Porto, Portugal; (E.C.-S.); (A.A.); (N.P.)
- Instituto de Engenharia Biomédica (INEB), 4200-135 Porto, Portugal
| | - Leonor Gusmão
- DNA Diagnostic Laboratory (LDD), State University of Rio de Janeiro (UERJ), Rio de Janeiro 20550-013, Brazil;
| | - António Amorim
- Instituto de Investigação e Inovação em Saúde (i3S), 4200-135 Porto, Portugal; (E.C.-S.); (A.A.); (N.P.)
- Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), 4200-465 Porto, Portugal
- Faculty of Sciences, University of Porto (FCUP), 4169-007 Porto, Portugal
| | - Nádia Pinto
- Instituto de Investigação e Inovação em Saúde (i3S), 4200-135 Porto, Portugal; (E.C.-S.); (A.A.); (N.P.)
- Institute of Molecular Pathology and Immunology, University of Porto (IPATIMUP), 4200-465 Porto, Portugal
- Center of Mathematics, University of Porto (CMUP), 4169-007 Porto, Portugal
| |
Collapse
|
4
|
Paternal and maternal mutations in X-STRs: A GHEP-ISFG collaborative study. Forensic Sci Int Genet 2020; 46:102258. [PMID: 32066109 DOI: 10.1016/j.fsigen.2020.102258] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2019] [Revised: 10/06/2019] [Accepted: 01/29/2020] [Indexed: 11/22/2022]
Abstract
The GHEP-ISFG organized a collaborative study to estimate mutation rates for the markers included in the Investigator Argus X-12 QS kit Qiagen. A total of 16 laboratories gathered data from 1,612 father/mother/daughter trios, which were used to estimate both maternal and paternal mutation rates, when pooled together with other already published data. Data on fathers and mothers' age at the time of birth of the daughter were also available for ∼93 % of the cases. Population analyses were computed considering the genetic information of a subset of 1,327 unrelated daughters, corresponding to 2,654 haplotypes from residents in several regions of five countries: Argentina, Brazil, Ecuador, Portugal and Spain. Genetic differentiation analyses between the population samples from the same country did not reveal signs of significant stratification, although results from Hardy-Weinberg and linkage disequilibrium tests indicated the need of larger studies for Ecuador and Brazilian populations. The high genetic diversity of the markers resulted in a large number of haplotype combinations, showing the need of huge databases for reliable estimates of their frequencies. It should also be noted the high number of new alleles found, many of them not included in the allelic ladders provided with the kit, as very diverse populations were analyzed. The overall estimates for locus specific average mutation rates varied between 7.5E-04 (for DXS7423) and 1.1E-02 (for DXS10135), the latter being a troublesome figure for kinship analyses. Most of the found mutations (∼92 %) are compatible with the gain or loss of a single repeat. Paternal mutation rates showed to be 5.2 times higher than maternal ones. We also found that older fathers were more prone to transmit mutated alleles, having this trend not been observed in the case of the mothers.
Collapse
|