1
|
Fan H, Xu Y, Zhao Y, Feng K, Hong L, Zhao Q, Lu X, Shi M, Li H, Wang L, Wen S. Development and validation of YARN: A novel SE-400 MPS kit for East Asian paternal lineage analysis. Forensic Sci Int Genet 2024; 71:103029. [PMID: 38518712 DOI: 10.1016/j.fsigen.2024.103029] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2023] [Revised: 02/12/2024] [Accepted: 03/03/2024] [Indexed: 03/24/2024]
Abstract
Y-chromosomal short tandem repeat polymorphisms (Y-STRs) and Y-chromosomal single nucleotide polymorphisms (Y-SNPs) are valuable genetic markers used in paternal lineage identification and population genetics. Currently, there is a lack of an effective panel that integrates Y-STRs and Y-SNPs for studying paternal lineages, particularly in East Asian populations. Hence, we developed a novel Y-chromosomal targeted panel called YARN (Y-chromosome Ancestry and Region Network) based on multiplex PCR and a single-end 400 massive parallel sequencing (MPS) strategy, consisting of 44 patrilineage Y-STRs and 260 evolutionary Y-SNPs. A total of 386 reactions were validated for the effectiveness and applicability of YARN according to SWGDAM validation guidelines, including sensitivity (with a minimum input gDNA of 0.125 ng), mixture identification (ranging from 1:1-1:10), PCR inhibitor testing (using substances such as 50 μM hematin, 100 μM hemoglobin, 100 μM humic acid, and 2.5 mM indigo dye), species specificity (successfully distinguishing humans from other animals), repeatability study (achieved 100% accuracy), and concordance study (with 99.91% accuracy for 1121 Y-STR alleles). Furthermore, we conducted a pilot study using YARN in a cohort of 484 Han Chinese males from Huaiji County, Zhaoqing City, Guangdong, China (GDZQHJ cohort). In this cohort, we identified 52 different Y-haplogroups and 73 different surnames. We found weak to moderate correlations between the Y-haplogroups, Chinese surnames, and geographical locations of the GDZQHJ cohort (with λ values ranging from 0.050 to 0.340). However, when we combined two different categories into a new independent variable, we observed stronger correlations (with λ values ranging from 0.617 to 0.754). Overall, the YARN panel, which combines Y-STR and Y-SNP genetic markers, meets forensic DNA quality assurance guidelines and holds potential for East Asian geographical origin inference and paternal lineage analysis.
Collapse
Affiliation(s)
- Haoliang Fan
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; School of Forensic Medicine, Shanxi Medical University, Taiyuan 030001, China.
| | - Yiran Xu
- Institute of Archaeological Science, Fudan University, Shanghai 200433, China.
| | - Yutao Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Kai Feng
- Duanzhou Branch of Zhaoqing Public Security Bureau, Zhaoqing 526060, China.
| | - Liuxi Hong
- Sihui Public Security Bureau of Guangdong Province, Zhaoqing 526299, China.
| | - Qiancheng Zhao
- Public Security Bureau of Zhaoqing Municipality, Zhaoqing 526000, China.
| | - Xiaoyu Lu
- Deepreads Biotech Company Limited, Guangzhou 510663, China.
| | - Meisen Shi
- Criminal Justice College of China University of Political Science and Law, Beijing 100088, China.
| | - Haiyan Li
- Criminal Technology Center of Guangdong Provincial Public Security Department, Guangzhou 510050, China.
| | - Lingxiang Wang
- MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| | - Shaoqing Wen
- MOE Key Laboratory of Contemporary Anthropology, School of Life Sciences, Fudan University, Shanghai 200433, China; Institute of Archaeological Science, Fudan University, Shanghai 200433, China; MOE Laboratory for National Development and Intelligent Governance, Fudan University, Shanghai 200433, China.
| |
Collapse
|
2
|
Guo F, Jing G, Lang Y, Liu Z, Yu S. Simultaneous sequencing of 102 Y-STRs on Ion Torrent ™ GeneStudio ™ S5 System. Forensic Sci Int Genet 2024; 71:103059. [PMID: 38749212 DOI: 10.1016/j.fsigen.2024.103059] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Revised: 05/02/2024] [Accepted: 05/05/2024] [Indexed: 06/18/2024]
Abstract
The Precision ID NGS System from Thermo Fisher Scientific is a mainstream next-generation sequencing (NGS) platform used in forensic laboratories to detect almost all commonly used forensic markers, except for Y-chromosomal short tandem repeats (Y-STRs). This study aimed to: 1) develop a Y-STR panel compatible with the automatic workflow of the NGS system using Ion AmpliSeq Technology, 2) evaluate the panel performance following the SWGDAM guidelines, and 3) explore the possibility of using a combination workflow to detect autosomal STRs and Y-STRs (AY-STR NGS workflow). The GrandFiler Y-STR Panel was successfully designed using the 'separating' and 'merging' strategies, including 102 Y-STRs and Amelogenin with an average amplicon length of 133 bp. It is a mega Y-STR multiplex system in which up to 16 samples can be sequenced simultaneously on an Ion 530 ™ Chip. Developmental validation studies of the performance of the NGS platform, species specificity, reproducibility, concordance, sensitivity, degraded samples, case-type samples, and mixtures were conducted to unequivocally determine whether the GrandFiler Y-STR Panel is suitable for real scenarios. The newly developed Y-STR panel showed compelling run metrics and NGS performance, including 92.47% bases with ≥ Q20, 91.80% effective reads, 2106 × depth of coverage (DoC), and 97.09% inter-locus balance. Additionally, it showed high specificity for human males and 99.40% methodological and bioinformatical concordance, generated complete profiles at ≥ 0.1 ng input DNA, and recovered more genetic information from severely degraded and diverse case samples. Although the outcome when used on mixtures was not as expected, more genetic information was obtained compared to that from capillary electrophoresis (CE) methods. The AY-STR NGS workflow was established by combining the GrandFiler Y-STR Panel with the Precision ID GlobalFiler ™ NGS STR Panel v2 at a 2:1 concentration ratio. The combination workflow on NGS performance, reproducibility, concordance, and sensitivity was as stable as the single Y-STR NGS workflow, providing more options for forensic scientists when dealing with different case scenarios. Overall, the GrandFiler Y-STR Panel was confirmed as the first to effectively detect a large number of Y-STR markers on the Precision ID NGS System, which is compatible with 51 Y-STRs in commercial CE kits and 51 Y-STRs in commercial NGS kits and the STRBase. The panel is as robust, reliable, and sensitive as current CE/NGS kits, and is suitable for solving real cases, especially for severely degraded samples (degradation index > 10).
Collapse
Affiliation(s)
- Fei Guo
- Shenyang Medical College, Shenyang, Liaoning 110034, PR China; Key Laboratory of Human Ethnic Specificity and Phenomics of Critical Illness in Liaoning Province, Shenyang, Liaoning 110034, PR China; Key Laboratory of Phenomics in Shenyang City, Shenyang, Liaoning 110034, PR China.
| | - Guangxin Jing
- Heyuan Public Security Forensic Science Center, Heyuan, Guangdong 517000, PR China
| | - Yubo Lang
- School of Public Security Information Technology and Intelligence, Criminal Investigation Police University of China, Shenyang, Liaoning 110854, PR China
| | - Ze Liu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China
| | - Shaobo Yu
- DNA Laboratory of Forensic Science Center, Shenyang Public Security Bureau, Shenyang, Liaoning 110002, PR China.
| |
Collapse
|
3
|
Soldati G, Turrina S, Treccani M, Saccardo C, Ausania F, De Leo D. Concordance study on Y-STRs typing between SeqStudio™ genetic analyzer for HID and MiSeq™ FGx forensic genomics system. Mol Biol Rep 2023; 50:9779-9789. [PMID: 37812349 PMCID: PMC10676315 DOI: 10.1007/s11033-023-08808-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 09/08/2023] [Indexed: 10/10/2023]
Abstract
BACKGROUND Massively Parallel Sequencing (MPS) allowed an increased number of information to be retrieved from short tandem repeat (STR) analysis, expanding them not only to the size, as already performed in Capillary Electrophoresis (CE), but also to the sequence. MPS requires constant development and validation of the analytical parameters to ensure that the genotyping results of STRs correspond to those obtained by CE. Given the increased frequency of usage of Y-STRs as supplementary markers to the autosomal STRs analysis, it is urgent to validate the concordance of the typing results between CE and MPS analyses. METHODS AND RESULTS DNA extracted from 125 saliva samples of unrelated males was genotyped using Yfiler™ Plus PCR Amplification Kit and ForenSeq™ DNA Signature Prep Kit, which were analyzed by SeqStudio™ Genetic Analyzer for HID and MiSeq™ FGx Forensic Genomics System, respectively. For each shared Y-STR, allele designation, number of length- and sequence-based alleles per locus, stutter percentage, and the intra-locus balance of multicopy Y-STRs were screened. CONCLUSIONS Although the number of forensic genetics laboratories that are applying the MPS technique in routine analysis is small and does not allow a global assessment of MPS limitations, this comparative study highlights the ability of MPS to produce reliable profiles despite the generation of large amounts of raw data.
Collapse
Affiliation(s)
- Giulia Soldati
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy.
| | - Stefania Turrina
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Mirko Treccani
- GM Lab, Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| | - Chiara Saccardo
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Francesco Ausania
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Domenico De Leo
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| |
Collapse
|
4
|
Riman S, Ghemrawi M, Borsuk LA, Mahfouz R, Walsh S, Vallone PM. Sequence-based allelic variations and frequencies for 22 autosomal STR loci in the Lebanese population. Forensic Sci Int Genet 2023; 65:102872. [PMID: 37068444 DOI: 10.1016/j.fsigen.2023.102872] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2022] [Revised: 04/06/2023] [Accepted: 04/08/2023] [Indexed: 04/19/2023]
Abstract
This is the first study that characterizes the sequence-based allelic variations of 22 autosomal Short Tandem Repeat (aSTR) loci in a population dataset collected from Lebanon. Genomic DNA extracts from 195 unrelated Lebanese individuals were amplified with PowerSeq 46GY System Prototype. Targeted amplicons were subjected to DNA library preparation and sequenced on the Verogen MiSeq FGx Sequencing System. Raw FASTQ data files were processed by STRait Razor v3. Sequence strings were annotated according to the considerations of the DNA Commission of the International Society for Forensic Genetics (ISFG) and tabulated herein with their respective allelic frequencies and GeneBank accession and version numbers. The sequenced Lebanese dataset resulted in 429 distinct allelic sequences as compared to the 236 alleles identified by length only. The increase in the number of alleles was observed at 18 out of 22 aSTR loci and was attributed to the sequence variations residing in both the STR repeat motifs and flanking regions. The study uncovered 25 novel aSTR allelic sequences across 12 loci for which GenBank records did not previously exist in the STRSeq BioProject, PRJNA380127. For a concordance check, the length-based allelic calls derived from the full sequences were compared to those genotyped using capillary electrophoresis (CE) methods. Population genetic parameters relevant to the evaluation of forensic DNA evidence were assessed for the sequence-based data and compared to the parameters generated from the length-based information. Using the sequence-based data, Analysis of MOlecular VAriance (AMOVA), genetic distances, and population genetic structure were evaluated for 1231 individuals sampled from the Lebanese and four U.S. populations (African American, Asian, Caucasian, and Hispanic). The results were tabulated and visualized in a population tree, multidimensional scaling scatter plots, and bar plots. This newly established sequence-based database for the Lebanese population can be beneficial for extending NGS applicability to casework or paternity testing and assessing the strength of evidence for NGS-STR profiles. The described novel sequence variants at certain loci can further help in the effort to characterize the sequence diversity of STR markers from different populations around the world.
Collapse
Affiliation(s)
- Sarah Riman
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA.
| | - Mirna Ghemrawi
- Department of Chemistry and Biochemistry and International Forensic Research Institute, Florida International University, Miami, FL 33199, USA
| | - Lisa A Borsuk
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA
| | - Rami Mahfouz
- Department of Pathology and Laboratory Medicine, American University of Beirut Medical Center, Beirut, Lebanon
| | - Susan Walsh
- Department of Biology, Indiana University Purdue University Indianapolis, Indianapolis, IN 46202, USA
| | - Peter M Vallone
- Applied Genetics Group, National Institute of Standards and Technology, Gaithersburg, MD 20899, USA
| |
Collapse
|
5
|
Sala A, Marino M, Guinudinik A, Marcucci V, Cano H, Rey SV, Bobillo C, Castagnola J, Garrigós-Calivares L, Ginart S, Caputo M, Corach D. Detection of a novel 16.3 variant allele at locus DYS533 in R1b males inhabiting southern South America: A 19-nucleotide insertion explains its origin based on Sanger sequencing results. Forensic Sci Int Genet 2023; 62:102789. [PMID: 36252401 DOI: 10.1016/j.fsigen.2022.102789] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2022] [Revised: 08/24/2022] [Accepted: 10/07/2022] [Indexed: 12/14/2022]
Abstract
We typed 1541 Y-STR haplotypes from reference samples along forensic casework investigations. In three haplotypes, we detected a variant allele designed as 16.3 at locus DYS533. This was confirmed by amplification using two commercial kits. Sanger sequencing revealing a novel motif corresponding to [TATC]12 repeats with a 19-bp insertion in the flanking upstream region. We propose its origin as an insertion at - 9.1 upstream of the repeat motifs. We searched other local databases and found this allele in various geographical areas of Argentina and neighbouring countries. The haplotypes share a common core of 10 Y-STRs (DYS389-I/13; DYS389-II/30; DYS19/14; DYS481/22; DYS438/12; DYS437/16; DYS635/23; DYS392/13; DYS393/13; GATA H4/11) and belong to the R1b haplogroup. This 16.3 allele is restricted to southern South America, which allows us to propose a local and relatively recent origin. The sequence described herein constitutes a novelty that could be considered in future criteria for the nomenclature of STRs based on massively parallel sequencing.
Collapse
Affiliation(s)
- Andrea Sala
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas-CONICET, Argentina.
| | - Miguel Marino
- Registro Provincial de Huellas Genéticas Digitalizadas, Lab. de Genética Forense-Ministerio Público Fiscal de Mendoza, Argentina
| | - Alejandra Guinudinik
- Servicio de Biología Molecular del Cuerpo de Investigaciones Fiscales, Ministerio Público de Salta, Argentina
| | - Valeria Marcucci
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Hortensia Cano
- Laboratorio Regional de Investigación Forense, Tribunal Superior de Justicia de Santa Cruz, Argentina
| | - Silvia Vannelli Rey
- Laboratorio Regional de Genética Forense, Ministerio Público de Río Negro, Argentina
| | - Cecilia Bobillo
- Laboratorio de Genética Forense, Ministerio Público de La Pampa, Argentina
| | - Josefina Castagnola
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina
| | - Lucía Garrigós-Calivares
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas-CONICET, Argentina
| | - Santiago Ginart
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas-CONICET, Argentina
| | - Mariela Caputo
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas-CONICET, Argentina
| | - Daniel Corach
- Universidad de Buenos Aires - Facultad de Farmacia y Bioquímica, Departamento de Microbiología, Inmunología, Biotecnología y Genética. Cátedra de Genética Forense y Servicio de Huellas Digitales Genéticas (SHDG), Junin 956, Ciudad Autónoma de Buenos Aires 1113, Argentina; Consejo Nacional de Investigaciones Científicas y Técnicas-CONICET, Argentina
| |
Collapse
|
6
|
Assessing Sequence Variation and Genetic Diversity of Currently Untapped Y-STR Loci. FORENSIC SCIENCE INTERNATIONAL: REPORTS 2022. [DOI: 10.1016/j.fsir.2022.100298] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
|
7
|
de Knijff P. On the Forensic Use of Y-Chromosome Polymorphisms. Genes (Basel) 2022; 13:genes13050898. [PMID: 35627283 PMCID: PMC9141910 DOI: 10.3390/genes13050898] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2022] [Revised: 05/09/2022] [Accepted: 05/13/2022] [Indexed: 12/26/2022] Open
Abstract
Nowadays, the use of Y-chromosome polymorphisms forms an essential part of many forensic DNA investigations. However, this was not always the case. Only since 1992 have we seen that some forensic scientists started to have an interest in this chromosome. In this review, I will sketch a brief history focusing on the forensic use of Y-chromosome polymorphisms. Before describing the various applications of short-tandem repeats (STRs) and single nucleotide polymorphisms (SNPs) on the Y-chromosome, I will discuss a few often ignored aspects influencing proper use and interpretation of Y-chromosome information: (i) genotyping Y-SNPs and Y-STRs, (ii) Y-STR haplotypes shared identical by state (IBS) or identical by descent (IBD), and (iii) Y-haplotype database frequencies.
Collapse
Affiliation(s)
- Peter de Knijff
- Department of Human Genetics, Leiden University Medical Center, P.O. Box 9600, 2300 RC Leiden, The Netherlands
| |
Collapse
|
8
|
Sequence polymorphisms of forensic Y-STRs revealed by a 68-plex in-house massively parallel sequencing panel. Forensic Sci Int Genet 2022; 59:102727. [DOI: 10.1016/j.fsigen.2022.102727] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/30/2022] [Revised: 05/03/2022] [Accepted: 05/23/2022] [Indexed: 11/20/2022]
|
9
|
Huszar TI, Bodmer WF, Hutnik K, Wetton JH, Jobling MA. Sequencing of autosomal, mitochondrial and Y-chromosomal forensic markers in the People of the British Isles cohort detects population structure dominated by patrilineages. Forensic Sci Int Genet 2022; 59:102725. [DOI: 10.1016/j.fsigen.2022.102725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/07/2022] [Revised: 05/08/2022] [Accepted: 05/13/2022] [Indexed: 11/27/2022]
|
10
|
A multi-dimensional evaluation of the 'NIST 1032' sample set across four forensic Y-STR multiplexes. Forensic Sci Int Genet 2022; 57:102655. [PMID: 35007854 PMCID: PMC9901497 DOI: 10.1016/j.fsigen.2021.102655] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2021] [Revised: 12/09/2021] [Accepted: 12/12/2021] [Indexed: 02/08/2023]
Abstract
This manuscript reports Y-chromosomal short tandem repeat (Y-STR) haplotypes for 1032 male U.S. population samples across 30 Y-STR loci characterized by three capillary electrophoresis (CE) length-based kits (PowerPlex Y23 System, Yfiler Plus PCR Amplification Kit, and Investigator Argus Y-28 QS Kit) and one sequence-based kit (ForenSeq DNA Signature Prep Kit): DYF387S1, DYS19, DYS385 a/b, DYS389I, DYS389II, DYS390, DYS391, DYS392, DYS393, DYS437, DYS438, DYS439, DYS448, DYS449, DYS456, DYS458, DYS460, DYS481, DYS505, DYS518, DYS522, DYS533, DYS549, DYS570, DYS576, DYS612, DYS627, DYS635, DYS643, and Y-GATA-H4. The length-based Y-STR haplotypes include six loci that are not reported in the sequence-based kit (DYS393, DYS449, DYS456, DYS458, DYS518, and DYS627), whereas three loci included in the sequence-based kit are not present in length-based kits (DYS505, DYS522, and DYS612). For the latter, a custom multiplex was used to generate CE length-based data, allowing 1032 samples to be evaluated for concordance across the 30 Y-STR loci included in these four commercial Y-STR typing kits. Discordances between typing methods were analyzed further to assess underlying causes such as primer binding site mutations and flanking region insertions/deletions. Allele-level frequency and statistical information is provided for sequenced loci, excluding the multi-copy loci DYF387S1 and DYS385 a/b, for which locus-specific haplotype-level frequencies are provided instead. The resulting data reveals the degree of information gained through sequencing: 88% of sequenced Y-STR loci contain additional sequence-based alleles compared to length-based data, with the DYS389II locus containing the most additional alleles (51) observed by sequencing. Despite these allelic increases, only minimal improvement was observed in haplotype resolution by sequence, with all four commercial kits providing a similar ability to differentiate length-based haplotypes in this sample set. Finally, a subset of 369 male samples were compared to their corresponding additionally sequenced father samples, revealing the sequence basis for the 50 length-based changes observed, and no additional sequence-based mutations. GenBank accession numbers are reported for each unique sequence, and associated records are available in the STRSeq Y-Chromosomal STR Loci National Center for Biotechnology Information (NCBI) BioProject, accession PRJNA380347. Haplotype data is updated in the Y-STR Haplotype Reference Database (YHRD) for the 'NIST 1032' data set to now achieve the level of maximal haplotype of YHRD. All supplementary files including revisions to previously published Y-STR data are available in the NIST Public Data Repository: U.S. population data for human identification markers, DOI 10.18434/t4/1500024.
Collapse
|
11
|
Moon MH, Hong SR, Shin KJ. Sequence Variations of 31 Υ-Chromosomal Short Tandem Repeats Analyzed by Massively Parallel Sequencing in Three U.S. Population Groups and Korean Population. J Korean Med Sci 2022; 37:e40. [PMID: 35166077 PMCID: PMC8845103 DOI: 10.3346/jkms.2022.37.e40] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 10/26/2021] [Accepted: 12/19/2021] [Indexed: 12/02/2022] Open
Abstract
BACKGROUND Rapidly mutating (RM) Y-chromosomal short tandem repeats (Y-STRs) have been demonstrated to increase the possibility of distinguishing between male relatives due to a higher mutation rate than conventional Y-STRs. Massively parallel sequencing (MPS) can be useful for forensic DNA typing as it allows the detection of sequence variants of many forensic markers. Here, we present sequence variations of 31 Y-STRs including nine RM Y-STRs (DYF387S1, DYF399S1, DYF404S1, DYS449, DYS518, DYS570, DYS576, DYS612, and DYS627), their frequencies, distribution, and the gain in the number of alleles using MPS. METHODS We constructed a multiplex MPS assay capable of simultaneously amplifying 32 Y-chromosomal markers, producing amplicons ranging from 85-274 bp. Barcoded libraries from 220 unrelated males from four populations-African Americans, Caucasians, Hispanics, and Koreans-were generated via two-step polymerase chain reaction and sequenced on a MiSeq system. Genotype concordance between the capillary electrophoresis (CE) and MPS method and sequence variation of Y-STRs were investigated. RESULTS In total, 195 alleles were increased by MPS compared to CE-based alleles (261 to 456). The DYS518 marker showed the largest increase due to repeat region variation (a 3.69-fold increase). The highest increase in the number of alleles due to single nucleotide polymorphisms in the flanking region was found in DYF399S1. RM Y-STRs had more diverse sequences than conventional Y-STRs. Furthermore, null alleles were observed in DYS576 due to primer-binding site mutation, and allele drop-outs in DYS449 resulted from low marker coverage of less than the threshold. CONCLUSION The results suggest that the expanded and discriminative MPS assay could provide more genetic information for Y-STRs, especially for RM Y-STRs, and could advance male individualization. Compiling sequence-based Y-STR data for worldwide populations would facilitate the application of MPS in the field of forensic genetics and could be applicable in solving male-related forensic cases.
Collapse
Affiliation(s)
- Mi Hyeon Moon
- Department of Forensic Medicine, Yonsei University College of Medicine, Seoul, Korea
- Graduate School of Medical Science and Brain Korea 21 Project, Yonsei University, Seoul, Korea
| | - Sae Rom Hong
- Department of Forensic Medicine, Yonsei University College of Medicine, Seoul, Korea
| | - Kyoung-Jin Shin
- Department of Forensic Medicine, Yonsei University College of Medicine, Seoul, Korea
- Graduate School of Medical Science and Brain Korea 21 Project, Yonsei University, Seoul, Korea.
| |
Collapse
|
12
|
Tyazhelova TV, Kuznetsova IL, Andreeva TV, Kunizheva SS, Rogaev EI. Application of Massive Parallel Sequencing Technology in Forensics: Comparative Analysis of Sequencing Platforms. RUSS J GENET+ 2021. [DOI: 10.1134/s1022795421120127] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/23/2023]
|
13
|
An Introductory Overview of Open-Source and Commercial Software Options for the Analysis of Forensic Sequencing Data. Genes (Basel) 2021; 12:genes12111739. [PMID: 34828345 PMCID: PMC8618049 DOI: 10.3390/genes12111739] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 10/27/2021] [Accepted: 10/27/2021] [Indexed: 12/30/2022] Open
Abstract
The top challenges of adopting new methods to forensic DNA analysis in routine laboratories are often the capital investment and the expertise required to implement and validate such methods locally. In the case of next-generation sequencing, in the last decade, several specifically forensic commercial options became available, offering reliable and validated solutions. Despite this, the readily available expertise to analyze, interpret and understand such data is still perceived to be lagging behind. This review gives an introductory overview for the forensic scientists who are at the beginning of their journey with implementing next-generation sequencing locally and because most in the field do not have a bioinformatics background may find it difficult to navigate the new terms and analysis options available. The currently available open-source and commercial software for forensic sequencing data analysis are summarized here to provide an accessible starting point for those fairly new to the forensic application of massively parallel sequencing.
Collapse
|
14
|
Massively parallel sequencing and capillary electrophoresis of a novel panel of falcon STRs: Concordance with minisatellite DNA profiles from historical wildlife crime. Forensic Sci Int Genet 2021; 54:102550. [PMID: 34174583 PMCID: PMC8430417 DOI: 10.1016/j.fsigen.2021.102550] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2021] [Revised: 06/03/2021] [Accepted: 06/04/2021] [Indexed: 12/12/2022]
Abstract
Birds of prey have suffered persecution for centuries through trapping, shooting, poisoning and theft from the wild to meet the demand from egg collectors and falconers; they were also amongst the earliest beneficiaries of DNA testing in wildlife forensics. Here we report the identification and characterisation of 14 novel tetramer, pentamer and hexamer short tandem repeat (STR) markers which can be typed either by capillary electrophoresis or massively parallel sequencing (MPS) and apply them to historical casework samples involving 49 peregrine falcons, 30 of which were claimed to be the captively bred offspring of nine pairs. The birds were initially tested in 1994 with a multilocus DNA fingerprinting probe, a sex test and eight single-locus minisatellite probes (SLPs) demonstrating that 23 birds were unrelated to the claimed parents. The multilocus and SLP approaches were highly discriminating but extremely time consuming and required microgram quantities of high molecular weight DNA and the use of radioisotopes. The STR markers displayed between 2 and 21 alleles per locus (mean = 7.6), lengths between 140 and 360 bp, and heterozygosities from 0.4 to 0.93. They produced wholly concordant conclusions with similar discrimination power but in a fraction of the time using a hundred-fold less DNA and with standard forensic equipment. Furthermore, eleven of these STRs were amplified in a single reaction and typed using MPS on the Illumina MiSeq platform revealing eight additional alleles (three with variant repeat structures and five solely due to flanking SNPs) across four loci. This approach gave a random match probability of < 1E-9, and a parental pair false inclusion probability of < 1E-5, with a further ten-fold reduction in the amount of DNA required (~3 ng) and the potential to analyse mixed samples. These STRs will be of value in monitoring wild populations of these key indicator species as well as for testing captive breeding claims and establishing a database of captive raptors. They have the potential to resolve complex cases involving trace, mixed and degraded samples from raptor persecution casework representing a significant advance over the previously applied methods.
Collapse
|
15
|
Bredemeyer S, Roewer L, Willuweit S. Next generation sequencing of Y-STRs in father-son pairs and comparison with traditional capillary electrophoresis. Forensic Sci Res 2021; 7:484-489. [PMID: 36353309 PMCID: PMC9639519 DOI: 10.1080/20961790.2021.1898078] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
To evaluate the promising advantages of massively parallel sequencing (MPS) in our casework, we analysed a total of 33 Y-chromosomal short tandem repeats (Y-STRs) with traditional capillary electrophoresis (CE) and 25 Y-STRs using the newer MPS technology. We studied the outcome of both technologies in 64 father-son pairs using stock and custom-designed kits. Current MPS technology confirmed the 13 mutational events observed with CE and improved our understanding of the complex nature of STR mutations. By detecting isometric sequence variants between unrelated males, we show that sequencing Y-STRs using MPS can boost discrimination power.
Collapse
Affiliation(s)
- Steffi Bredemeyer
- Department of Forensic Genetics, Institute of Legal Medicine and Forensic Sciences, Charité – Universitätsmedizin Berlin, Berlin, Germany
| | - Lutz Roewer
- Department of Forensic Genetics, Institute of Legal Medicine and Forensic Sciences, Charité – Universitätsmedizin Berlin, Berlin, Germany
| | - Sascha Willuweit
- Department of Forensic Genetics, Institute of Legal Medicine and Forensic Sciences, Charité – Universitätsmedizin Berlin, Berlin, Germany
| |
Collapse
|
16
|
Khubrani YM, Jobling MA, Wetton JH. Massively parallel sequencing of sex-chromosomal STRs in Saudi Arabia reveals patrilineage-associated sequence variants. Forensic Sci Int Genet 2020; 49:102402. [PMID: 33035796 DOI: 10.1016/j.fsigen.2020.102402] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 09/18/2020] [Accepted: 09/27/2020] [Indexed: 11/27/2022]
Abstract
Massively parallel sequencing (MPS) of forensic STRs has the potential to reveal additional allele diversity compared to conventional capillary electrophoresis (CE) typing strategies, but population studies are currently relatively few in number. The Verogen ForenSeq™ DNA Signature Prep Kit includes both Y-STRs and X-STRs among its targeted loci, and here we report the sequences of these loci, analysed using Verogen's ForenSeq™ Universal Analysis Software (UAS) v1.3 and STRait Razor v3.0, in a representative sample of 89 Saudi Arabian males. We identified 56 length variants (equivalent to CE alleles) and 75 repeat sequence sub-variants across the six X-STRs analysed; equivalent figures for the set of 24 Y-STRs were 147 and 192 respectively. We also observed two flanking sequence variants for the X-, and six for the Y-STRs. Recovery of sequence data and concordance with CE data (where available) across the tested loci was good, though rare flanking variation affected interpretation and allele calling at DYF387S1 and DXS7132. Examination of flanking sequences of the Y-STRs revealed five SNPs (L255, M4790, BY7692, Z16708 and S17543) previously shown to define specific haplogroups by Y-chromosome sequencing. These define Y-haplogroups in 62 % of our sample, a proportion that increases to 91 % when haplogroup-associated repeat-sequence motifs are also considered. A population-level comparison of the Saudi Arabian X-STRs with a global sample showed our dataset to be part of a large cluster of populations of West Eurasian and Middle Eastern origin.
Collapse
Affiliation(s)
- Yahya M Khubrani
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester, UK; Forensic Genetics Laboratory, General Administration of Criminal Evidence, Public Security, Ministry of Interior, Saudi Arabia
| | - Mark A Jobling
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester, UK.
| | - Jon H Wetton
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester, UK.
| |
Collapse
|
17
|
Simayijiang H, Morling N, Børsting C. Sequencing of human identification markers in an Uyghur population using the MiSeq FGxTM Forensic Genomics System. Forensic Sci Res 2020; 7:154-162. [PMID: 35784409 PMCID: PMC9246034 DOI: 10.1080/20961790.2020.1779967] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/03/2022] Open
Abstract
Massively parallel sequencing (MPS) offers a useful alternative to capillary electrophoresis (CE) based analysis of human identification markers in forensic genetics. By sequencing short tandem repeats (STRs) instead of determining the fragment lengths by CE, the sequence variation within the repeat region and the flanking regions may be identified. In this study, we typed 264 Uyghur individuals using the MiSeq FGx™ Forensic Genomics System and Primer Mix A of the ForenSeq™ DNA Signature Prep Kit that amplifies 27 autosomal STRs, 25 Y-STRs, seven X-STRs, and 94 HID-SNPs. STRinNGS v.1.0 and GATK 3.6 were used to analyse the STR regions and HID-SNPs, respectively. Increased allelic diversity was observed for 33 STRs with the PCR-MPS assay. The largest increases were found in DYS389II and D12S391, where the numbers of sequenced alleles were 3–4 times larger than those of alleles determined by repeat length alone. A relatively large number of flanking region variants (28 SNPs and three InDels) were observed in the Uyghur population. Seventeen of the flanking region SNPs were rare, and 12 of these SNPs had no accession number in dbSNP. The combined mean match probability and typical paternity index based on 26 sequenced autosomal STRs were 3.85E−36 and 1.49E + 16, respectively. This was 10 000 times lower and 1 000 times higher, respectively, than the same parameters calculated from STR repeat lengths.Key Points Sequencing data on STRs and SNPs used for human identification are presented for the Uyghur population. STRinNGS v.1.0 was used to analyse the flanking regions of STRs. The concordance between PCR-CE and PCR-MPS results was 99.86%. Detection of sequence variation in STRs and their flanking regions increased the allelic diversity.
Collapse
Affiliation(s)
- Halimureti Simayijiang
- Faculty of Health and Medical Sciences, Section of Forensic Genetics, Department of Forensic Medicine, University of Copenhagen, Copenhagen, Denmark
- Faculty of Criminal Science and Technology, Xinjiang Police College, Xinjiang, China
| | - Niels Morling
- Faculty of Health and Medical Sciences, Section of Forensic Genetics, Department of Forensic Medicine, University of Copenhagen, Copenhagen, Denmark
| | - Claus Børsting
- Faculty of Health and Medical Sciences, Section of Forensic Genetics, Department of Forensic Medicine, University of Copenhagen, Copenhagen, Denmark
| |
Collapse
|
18
|
STRinNGS v2.0: Improved tool for analysis and reporting of STR sequencing data. Forensic Sci Int Genet 2020; 48:102331. [DOI: 10.1016/j.fsigen.2020.102331] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/30/2019] [Revised: 04/23/2020] [Accepted: 06/04/2020] [Indexed: 11/18/2022]
|
19
|
Zieger M, Utz S. The Y-chromosomal haplotype and haplogroup distribution of modern Switzerland still reflects the alpine divide as a geographical barrier for human migration. Forensic Sci Int Genet 2020; 48:102345. [PMID: 32622325 DOI: 10.1016/j.fsigen.2020.102345] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 06/16/2020] [Accepted: 06/17/2020] [Indexed: 12/09/2022]
Abstract
A sample of 606 Swiss individuals has been characterized for 27 Y-STR and 34 Y-SNPs, defining major European haplogroups. For the first time, a subsample from the southernmost part of Switzerland, the Italian speaking canton Ticino, has been included. The data reveals significant intra-national differences in the distribution of haplogroups R1b-U106, R1b-U152, I1 and J2a north and south of the alpine divide, with R1b-U152 being the most frequent haplogroup among all Swiss subpopulations, reaching 26 % in average and 53 % in the Ticino sample. In addition, a high percentage of haplogroup E1b1b-M35 in Eastern Switzerland corresponds well with data reported from Western Austria. In general, we detected a low level of differentiation between the subgroups north of the alpine divide. The dataset also revealed a variety of microvariants. Some of them were previously known to be associated with particular haplogroups. However, we discovered one microvariant in DYS533 that seems to be closely associated with haplogroup I2-P215 (xM223). This association had not yet been reported to date. The concordance study with two STR-kits suggests that the DYS533 microvariant is due to an InDel in the flanking regions of the marker. One individual carried a large deletion, frequently detected in people of East Asian ancestry, encompassing the amelogenin locus. To our knowledge, this is the first time that such a deletion has been observed within European haplogroup R1b-U152. This is the first comprehensive Y chromosomal dataset for Switzerland, demonstrating significant population substructure due to an intra-national geographical barrier.
Collapse
Affiliation(s)
- Martin Zieger
- Institute of Forensic Medicine, Forensic Molecular BiologyDpt., University of Bern, Sulgenauweg 40, 3007, Bern, Switzerland.
| | - Silvia Utz
- Institute of Forensic Medicine, Forensic Molecular BiologyDpt., University of Bern, Sulgenauweg 40, 3007, Bern, Switzerland.
| |
Collapse
|
20
|
Kasu M, Fraser M, D’Amato ME. UniQ-Typer ™ Y-10 genotyping in South African populations: novel alleles, sequence variation and allelic ladder updates. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2019. [DOI: 10.1016/j.fsigss.2019.10.056] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/25/2022]
|
21
|
Gettings KB, Ballard D, Bodner M, Borsuk LA, King JL, Parson W, Phillips C. Report from the STRAND Working Group on the 2019 STR sequence nomenclature meeting. Forensic Sci Int Genet 2019; 43:102165. [PMID: 31586814 PMCID: PMC8796868 DOI: 10.1016/j.fsigen.2019.102165] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Revised: 09/19/2019] [Accepted: 09/20/2019] [Indexed: 01/07/2023]
Abstract
This report summarizes topics discussed at the STR sequence nomenclature meeting hosted by the STRAND Working Group in April 2019. Invited attendees for this meeting included researchers known-to-us to be developing STR sequence-based nomenclature schemata, scientific representatives from vendors developing STR sequence bioinformatic methods, DNA intelligence database curators, and academic experts in STR genomics. The goal of this meeting was to provide a forum for individuals developing nomenclature schemata to present and discuss their ideas, encouraging mutual awareness, identification of differences in approaches, opposing aspects, and opportunities for parallelization while some approaches are still under development.
Collapse
Affiliation(s)
- Katherine Butler Gettings
- U.S. National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Drive, Gaithersburg, MD, 20899, USA.
| | - David Ballard
- King's Forensics, King's College London, Franklin-Wilkins Building, 150 Stamford Street, London, UK
| | - Martin Bodner
- Institute of Legal Medicine, Medical University of Innsbruck, Austria
| | - Lisa A Borsuk
- U.S. National Institute of Standards and Technology, Biomolecular Measurement Division, 100 Bureau Drive, Gaithersburg, MD, 20899, USA
| | - Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Walther Parson
- Institute of Legal Medicine, Medical University of Innsbruck, Austria; Forensic Science Program, The Pennsylvania State University, USA
| | - Christopher Phillips
- Forensic Genetics Unit, Institute of Forensic Sciences, University of Santiago de Compostela, Spain
| |
Collapse
|
22
|
A nomenclature for sequence-based forensic DNA analysis. Forensic Sci Int Genet 2019; 42:14-20. [DOI: 10.1016/j.fsigen.2019.06.001] [Citation(s) in RCA: 17] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2019] [Revised: 05/11/2019] [Accepted: 06/05/2019] [Indexed: 12/19/2022]
|
23
|
Novel Y-chromosome short tandem repeat sequence variation for loci DYS710, DYS518, DYS385, DYS644, DYS612, DYS626, DYS504, DYS481, DYS447 and DYS449. Int J Legal Med 2019; 133:1681-1689. [PMID: 30982129 DOI: 10.1007/s00414-019-02056-7] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/10/2018] [Accepted: 03/29/2019] [Indexed: 10/27/2022]
Abstract
In forensic casework, Y-chromosome short tandem repeats (Y-STRs) are essential for differentiating between unrelated males and resolving the male component of admixed biological evidence. While the majority of Y-STRs are adequate for discriminating between different paternal lineages, rapidly mutating Y-STRs are necessary for improving discrimination between males within populations of low Y-chromosome diversity and between paternal relatives. Alternatively, sequencing of Y-STRs may also improve the discrimination between isometric Y-STR alleles by identifying variation in the repeat unit pattern arrangements and by identifying SNPs in the flanking region or within the STR repeat unit itself. In this report, a total of 153 DNA sequences are presented across the Y-STR loci DYS710, DYS518, DYS385, DYS644, DYS612, DYS626, DYS504, DYS481, DYS447 and DYS449. A total of 94 Y-STR sequences provided herein are reported for the first time, of which 37 sequences represent alleles showing size homoplasy, 34 sequences of known alleles for which sequence data has been unavailable and a total of 23 novel allele sequences across loci DYS644, DS447, DYS710 and DYS504. This study further encountered a rare sequence variant in the 5' flanking region of DYS385 and a total of two SNPs in the repeat structure at DYS481 and DYS449.
Collapse
|
24
|
Huszar TI, Wetton JH, Jobling MA. Mitigating the effects of reference sequence bias in single-multiplex massively parallel sequencing of the mitochondrial DNA control region. Forensic Sci Int Genet 2019; 40:9-17. [PMID: 30682697 PMCID: PMC6461131 DOI: 10.1016/j.fsigen.2019.01.008] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Revised: 01/14/2019] [Accepted: 01/15/2019] [Indexed: 01/12/2023]
Abstract
mtDNA control region of 101 diverse samples amplified in a single reaction as 10 overlapping amplicons and sequenced via MPS. Primers create reference bias, compromising ability to call variants or heteroplasmy in primer-binding regions. Bioinformatic selection of overarching reads bypasses effects of proprietary primers and mitigates bias. Data processing permits accurate calling of variants, and heteroplasmies down to 5% level.
Sequence analysis of the mitochondrial DNA (mtDNA) control region can provide forensically useful information, particularly in challenging samples where autosomal DNA profiling fails. Sub-division of the 1122-bp region into shorter PCR fragments improves data recovery, and such fragments can be analysed together via massively parallel sequencing (MPS). Here, we generate mtDNA data using the prototype PowerSeq™ Auto/Mito/Y System (Promega) MPS assay, in which a single PCR reaction amplifies ten overlapping amplicons of the control region, in a set of 101 highly diverse samples representing most major clades of the mtDNA phylogeny. The overlapping multiplex design leads to non-uniform coverage in the regions of overlap, where it is further increased by short amplicons generated alongside the intended products. Primer sequences in targeted amplification libraries are a potential source of reference sequence bias and thus should be removed, but the proprietary nature of the primers in commercial kits necessitates an alternative approach that minimises data loss: here, we introduce the bioinformatic selection of sequencing reads spanning putative primer sites (Overarching Read Enrichment Option, OREO). While OREO performs well in mitigating the effects of primer sequences at the ends of sequence reads, we still find evidence of the internalisation of primer-derived sequences by overlap extension, which may compromise the ability to call variants or to measure heteroplasmy in primer-binding regions. The commercially available PowerSeq™ CRM Nested System design prevents primer internalisation, as shown in a reanalysis of a subset of 57 samples that contain possible heteroplasmies. In combination with OREO, the CRM Nested kit mitigates reference sequence bias, allowing heteroplasmic variants to be estimated down to a 5% threshold. Provided appropriate steps are taken in data processing, single-reaction multiplex assays represent robust tools to analyse mtDNA control region variation. The OREO approach will allow users to bypass the effects of unknown primer sequences in any single-reaction tiled multiplex and eliminate primer-derived bias in overlapping amplicon sequencing studies, in both forensic and non-forensic settings.
Collapse
Affiliation(s)
- Tunde I Huszar
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester LE1 7RH UK
| | - Jon H Wetton
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester LE1 7RH UK.
| | - Mark A Jobling
- Department of Genetics & Genome Biology, University of Leicester, University Road, Leicester LE1 7RH UK.
| |
Collapse
|
25
|
McCord BR, Gauthier Q, Cho S, Roig MN, Gibson-Daw GC, Young B, Taglia F, Zapico SC, Mariot RF, Lee SB, Duncan G. Forensic DNA Analysis. Anal Chem 2019; 91:673-688. [PMID: 30485738 DOI: 10.1021/acs.analchem.8b05318] [Citation(s) in RCA: 26] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Bruce R McCord
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Quentin Gauthier
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Sohee Cho
- Department of Forensic Medicine , Seoul National University , Seoul , 08826 , South Korea
| | - Meghan N Roig
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Georgiana C Gibson-Daw
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Brian Young
- Niche Vision, Inc. , Akron , Ohio 44311 , United States
| | - Fabiana Taglia
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Sara C Zapico
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Roberta Fogliatto Mariot
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| | - Steven B Lee
- Forensic Science Program, Justice Studies Department , San Jose State University , San Jose , California 95192 , United States
| | - George Duncan
- Department of Chemistry , Florida International University , Miami , Florida 33199 , United States
| |
Collapse
|