1
|
van der Gaag KJ, Weiler N, de Jong EAC, Hoogenboom J, van Oers P, de Leeuw RH, Graaf ESM, Kraaijenbrink T, Theelen J, Sijen T. Validation of the IDseek® OmniSTR™ Global Autosomal STR Profiling kit, reverse complement PCR as an improved tool/method for routine massively parallel sequencing of short tandem repeats. Forensic Sci Int Genet 2024; 74:103174. [PMID: 39549676 DOI: 10.1016/j.fsigen.2024.103174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2024] [Revised: 10/21/2024] [Accepted: 11/12/2024] [Indexed: 11/18/2024]
Abstract
Massively Parallel Sequencing (MPS) has gained interest in the forensic community over the past decade. Most of the published MPS methods focus on specialty applications intended for use in a limited number of samples with protocols that are relatively laborious. Recent developments using Reverse-Complement PCR enable an efficient MPS protocol suited for routine analysis of high numbers of samples. This method is implemented in the IDseek® OmniSTR™ Global Autosomal STR Profiling kit (Nimagen) for sequencing 28 of the most commonly used forensic autosomal STRs, one Y-chromosomal STR and Amelogenin. This study describes the validation of this kit and focuses on sensitivity, inhibitor tolerance, sequence variation detection and performance with mixtures up to 5 contributors. Results are compared to a Capillary Electrophoresis method (the PowerPlex® Fusion 6 C system, Promega) and the first commercial forensic MPS kit (ForenSeq™ DNA Signature prep, Qiagen) and for a concordance study with data from the Powerseq® MPS kit as well. Analysis settings in FDSTools are deduced and discussed, and an almost completely automated analysis is achieved. Using FDSTools noise correction, contributions in a mixture down to a level of 1.5 % of the major allele of a marker can be detected.
Collapse
Affiliation(s)
- Kristiaan J van der Gaag
- Division of Biological Traces, Netherlands Forensic Institute, Laan van Ypenburg 6, The Hague 2497 GB, the Netherlands.
| | - Natalie Weiler
- Division of Biological Traces, Netherlands Forensic Institute, Laan van Ypenburg 6, The Hague 2497 GB, the Netherlands
| | - Erik A C de Jong
- NimaGen B.V., Hogelandseweg 88, Nijmegen 6545 AB, the Netherlands
| | - Jerry Hoogenboom
- Division of Biological Traces, Netherlands Forensic Institute, Laan van Ypenburg 6, The Hague 2497 GB, the Netherlands
| | - Pieter van Oers
- NimaGen B.V., Hogelandseweg 88, Nijmegen 6545 AB, the Netherlands
| | - Rick H de Leeuw
- Forensic Laboratory for DNA Research, Department of Human Genetics, Leiden University Medical Center, the Netherlands
| | - Elisabeth S M Graaf
- Division of Biological Traces, Netherlands Forensic Institute, Laan van Ypenburg 6, The Hague 2497 GB, the Netherlands
| | - Thirsa Kraaijenbrink
- Forensic Laboratory for DNA Research, Department of Human Genetics, Leiden University Medical Center, the Netherlands
| | - Joop Theelen
- NimaGen B.V., Hogelandseweg 88, Nijmegen 6545 AB, the Netherlands
| | - Titia Sijen
- Division of Biological Traces, Netherlands Forensic Institute, Laan van Ypenburg 6, The Hague 2497 GB, the Netherlands; University of Amsterdam, Swammerdam Institute for Life Sciences, Amsterdam, the Netherlands
| |
Collapse
|
2
|
Agudo MM, Fantinato C, Roseth A, Aanes H, Gill P, Fonneløp AE, Bleka Ø. A comparison of likelihood ratios calculated from surface DNA mixtures using MPS and CE Technologies. Forensic Sci Int Genet 2024; 73:103111. [PMID: 39128429 DOI: 10.1016/j.fsigen.2024.103111] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Revised: 06/14/2024] [Accepted: 07/30/2024] [Indexed: 08/13/2024]
Abstract
This study evaluates the performance of analysing surface DNA samples using massively parallel sequencing (MPS) compared to traditional capillary electrophoresis (CE). A total of 30 samples were collected from various surfaces in an office environment and were analysed with CE and MPS. These were compared against 60 reference samples (office inhabitants). To identify contributors, likelihood ratios (LRs) were calculated for MPS and CE data using the probabilistic genotyping software MPSproto and EuroForMix respectively. Although a higher number of sequences/peaks were observed per DNA profile in MPS compared to CE, LR values were found to be lower for MPS data formats. This might be the result of the increased complexity of MPS data, along with a possible elevation of unknown alleles and/or artefacts. The study highlights avenues for improving MPS data quality and analysis to facilitate more robust interpretation of challenging casework-like samples.
Collapse
Affiliation(s)
- Maria Martin Agudo
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Chiara Fantinato
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Arne Roseth
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Håvard Aanes
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Peter Gill
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | | | - Øyvind Bleka
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
3
|
Rye MS, Hymus CM. Discordance of Penta D x.4 microvariant alleles results between three capillary electrophoresis and one massively parallel sequencing short tandem repeat kits. Forensic Sci Int Genet 2024; 73:103112. [PMID: 39096603 DOI: 10.1016/j.fsigen.2024.103112] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 06/12/2024] [Accepted: 07/30/2024] [Indexed: 08/05/2024]
Abstract
Forensic Biology is contingent upon matching DNA profiles between a crime sample and a reference sample. There are several capillary electrophoresis kits available to generate a short tandem repeat (STR) profile from DNA samples, while newer methods using massively parallel sequencing are slowly being implemented in forensic laboratories worldwide. During evaluation of a newer capillary electrophoresis kit, Applied Biosystems™ VeriFiler™ Plus, a discordance was observed in the Penta D locus. The previous kit, Promega PowerPlex 21® System produced a 13.4,14 genotype, whilst VeriFiler™ Plus produced a 14,14 genotype. An expanded investigation into Penta D microvariant alleles revealed that multiple discordances were observed for DNA profiles containing larger x.4 variants. There was full concordance between PowerPlex® 21 and QIAGEN Investigator® 26plex, however discordances were observed between VeriFiler™ Plus and the other three kits tested, including the massively parallel sequencing kit, Verogen ForenSeq® MainstAY. Notably, four of these discordances resulted in null alleles with the VeriFiler™ Plus kit. A review of the Penta D DNA sequences in MainstAY revealed fully concordant microvariant alleles involved deletions within the repeat region, whilst variability in the discordances observed were dependent on the location of the variation outside the repeat region and the analysis method used. Variations observed within the 5' flanking region produced the same allele designation across all capillary electrophoresis kits. However, deletions within the 3' region either produced a null allele for VeriFiler™ Plus where the deletion is thought to overlap the primer binding site, or microvariant alleles for the PowerPlex® 21 and Investigator 26plex kits, which produced longer Penta D amplicons. The discovery of these variations in the Penta D flanking sequences is informative as it increases the awareness of Penta D discordances between different kit chemistries in nominated reference DNA profile comparisons and DNA database searching and matching alike, and provides support for this phenomenon when providing evidence as to the admissibility of such results in trial proceedings.
Collapse
Affiliation(s)
- Marie S Rye
- Forensic Biology Laboratory, PathWest Laboratory Medicine WA, Locked Bag 2009, Nedlands, Western Australia 6009, Australia.
| | - Colby M Hymus
- Forensic Biology Laboratory, PathWest Laboratory Medicine WA, Locked Bag 2009, Nedlands, Western Australia 6009, Australia
| |
Collapse
|
4
|
Kocsis B, Mátrai N, Bárány G, Tömöry G, Heinrich A, Egyed B. Internal validation of the Precision ID GlobalFiler NGS STR panel v2 kit with locus-specific analytical threshold, and with special regard to mixtures and low template DNA detection. Forensic Sci Int Genet 2024; 74:103159. [PMID: 39490047 DOI: 10.1016/j.fsigen.2024.103159] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2024] [Revised: 10/06/2024] [Accepted: 10/06/2024] [Indexed: 11/05/2024]
Abstract
We performed an internal laboratory validation of the Precision ID GlobalFiler NGS STR panel v2 kit to assist the introduction of the technology into the routine forensic casework practice. The study was designed and evaluated based not only on the key validation standards like sensitivity, stability, reproducibility, repeatability, mixture, and concordance, but we also tested the effect of reduced input DNA, we measured and applied locus-specific analytical threshold values, tested two different PCR cycle conditions, sequence artifacts and stutters were also analysed. During the study we also tested the new method on real casework samples. The sensitivity study confirmed that adding 500 pg template DNA for library preparation can be optimal at base PCR cycle number (that was 24), because the measured average heterozygote balance was not lower than 0.82, and each allele was detected above the analytical threshold. However, contrary to previous communications, increasing the PCR cycle numbers up to 28 has not resulted the significant elevation of the heterozygote imbalance. According to our results, raised PCR cycle condition (i.e. 28) is appropriate at or below 150 pg total input DNA. For most loci, the calculated AT was lower than the manufacturer's recommended. Applying the newly established ATs with raised PCR cycle conditions the allele detection sensitivity and reliability increased. We observed allele dropouts only at the 15 pg template DNA experiments with 5 % frequency, that is better to previously published studies. This result indicates that this low amount of DNA (i.e. 15 pg) could be a minimum limit of template input for a potentially successful analysis. In the mixture study the minor contributor could be detected up to 1:19 mixture ratio. We detected minor alleles in all measurements and concentrations above the threshold if the template DNA were fixed and only SNP differences were observed between the same alleles of the contributors. To test concordance between the new method and traditional STR genotyping we analysed 58 Hungarian individual samples in parallel. Nearby the detected 248 different length-based alleles on the 31 loci in the sample pool we revealed additional 75 sequence variant alleles, that represent an approximately 23 % increase in the total number of observed alleles. The casework study confirmed that the Precision ID GlobalFiler NGS STR panel v2 kit is effective even in genotyping degraded samples with extremely low levels of DNA, if we apply elevated cycle number for library preparation and use locus-specific analytical thresholds.
Collapse
Affiliation(s)
- Balázs Kocsis
- Doctoral School of Biology, ELTE Eötvös Loránd University, Budapest, Hungary; Department of Genetics, Hungarian Institute for Forensic Sciences, Budapest, Hungary
| | - Norbert Mátrai
- Department of Genetics, Hungarian Institute for Forensic Sciences, Budapest, Hungary
| | - Gusztáv Bárány
- Department of Genetics, Hungarian Institute for Forensic Sciences, Budapest, Hungary
| | - Gyöngyvér Tömöry
- Department of Genetics, Hungarian Institute for Forensic Sciences, Budapest, Hungary
| | - Attila Heinrich
- Department of Genetics, Hungarian Institute for Forensic Sciences, Budapest, Hungary
| | - Balázs Egyed
- Department of Genetics, ELTE Eötvös Loránd University, Budapest, Hungary.
| |
Collapse
|
5
|
Agudo MM, Aanes H, Albert M, Janssen K, Gill P, Bleka Ø. An overview of autosomal STRs and identity SNPs in a Norwegian population using massively parallel sequencing. Forensic Sci Int Genet 2024; 71:103057. [PMID: 38733649 DOI: 10.1016/j.fsigen.2024.103057] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 02/27/2024] [Accepted: 04/28/2024] [Indexed: 05/13/2024]
Abstract
In recent years, probabilistic genotyping software has been adapted for the analysis of massively parallel sequencing (MPS) forensic data. Likelihood ratios (LR) are based on allele frequencies selected from populations of interest. This study provides an outline of sequence-based (SB) allele frequencies for autosomal short tandem repeats (aSTRs) and identity single nucleotide polymorphisms (iSNPs) in 371 individuals from Southern Norway. 27 aSTRs and 94 iSNPs were previously analysed with the ForenSeq™ DNA Signature Prep Kit (Verogen). The number of alleles with frequencies less than 0.05 for sequenced-based alleles was 4.6 times higher than for length-based alleles. Consistent with previous studies, it was observed that sequence-based data (both with and without flanks) exhibited higher allele diversity compared to length-based (LB) data; random match probabilities were lower for SB alleles confirming their advantage to discriminate between individuals. Two alleles in markers D22S1045 and Penta D were observed with SNPs in the 3´ flanking region, which have not been reported before. Also, a novel SNP with a minor allele frequency (MAF) of 0.001, was found in marker TH01. The impact of the sample size on minor allele frequency (MAF) values was studied in 88 iSNPs from Southern Norway (n = 371). The findings were then compared to a larger Norwegian population dataset (n = 15,769). The results showed that the smaller Southern Norway dataset provided similar results, and it was a representative sample. Population structure was analyzed for regions within Southern Norway; FST estimates for aSTR and iSNPs did not indicate any genetic structure. Finally, we investigated the genetic differences between Southern Norway and two other populations: Northern Norway and Denmark. Allele frequencies between these populations were compared, and we found no significant frequency differences (p-values > 0.0001). We also calculated the pairwise FST values per marker and comparisons between Southern and Northern Norway showed small differences. In contrast, the comparisons between Southern Norway and Denmark showed higher FST values for some markers, possibly driven by distinct alleles that were present in only one of the populations. In summary, we propose that allele frequencies from each population considered in this study could be used interchangeably to calculate genotype probabilities.
Collapse
Affiliation(s)
- Maria Martin Agudo
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Håvard Aanes
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Michel Albert
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway
| | - Kirstin Janssen
- Centre for Forensic Genetics, UiT The Arctic University of Norway, Norway
| | - Peter Gill
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway; Department of Forensic Medicine, Institute of Clinical Medicine, University of Oslo, Oslo, Norway
| | - Øyvind Bleka
- Department of Forensic Sciences, Oslo University Hospital, Oslo, Norway.
| |
Collapse
|
6
|
Dierig L, Kunz SN, Wiegand P. Comparison of massively parallel sequencing to capillary electrophoresis for short tandem repeat genotyping of trace DNA. Electrophoresis 2024; 45:451-462. [PMID: 38085164 DOI: 10.1002/elps.202300145] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2023] [Revised: 09/08/2023] [Accepted: 10/23/2023] [Indexed: 03/20/2024]
Abstract
In forensic genetics, massively parallel sequencing (MPS) offers several advantages over the current golden standard, capillary electrophoresis (CE): additional sequence information, shorter amplicon lengths, and the simultaneous analysis of many markers. These benefits result in a reduced number of reactions necessary while improving the amount of data obtained, thereby conserving valuable sample extracts. This proves particularly advantageous for the analysis of trace DNA. This study assessed the suitability of MPS for short tandem repeat (STR) typing of low template samples compared with results obtained through CE. The MPS genotypes showed higher concordance to reference genotypes, with donor alleles being more frequently assigned to be the major contributor, meeting the requirements for database entry. However, the MPS workflow is more time-consuming and associated with higher costs.
Collapse
Affiliation(s)
- Lisa Dierig
- Institute of Legal Medicine, Ulm University, Ulm, Germany
| | | | - Peter Wiegand
- Institute of Legal Medicine, Ulm University, Ulm, Germany
| |
Collapse
|
7
|
Soldati G, Turrina S, Treccani M, Saccardo C, Ausania F, De Leo D. Concordance study on Y-STRs typing between SeqStudio™ genetic analyzer for HID and MiSeq™ FGx forensic genomics system. Mol Biol Rep 2023; 50:9779-9789. [PMID: 37812349 PMCID: PMC10676315 DOI: 10.1007/s11033-023-08808-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/25/2023] [Accepted: 09/08/2023] [Indexed: 10/10/2023]
Abstract
BACKGROUND Massively Parallel Sequencing (MPS) allowed an increased number of information to be retrieved from short tandem repeat (STR) analysis, expanding them not only to the size, as already performed in Capillary Electrophoresis (CE), but also to the sequence. MPS requires constant development and validation of the analytical parameters to ensure that the genotyping results of STRs correspond to those obtained by CE. Given the increased frequency of usage of Y-STRs as supplementary markers to the autosomal STRs analysis, it is urgent to validate the concordance of the typing results between CE and MPS analyses. METHODS AND RESULTS DNA extracted from 125 saliva samples of unrelated males was genotyped using Yfiler™ Plus PCR Amplification Kit and ForenSeq™ DNA Signature Prep Kit, which were analyzed by SeqStudio™ Genetic Analyzer for HID and MiSeq™ FGx Forensic Genomics System, respectively. For each shared Y-STR, allele designation, number of length- and sequence-based alleles per locus, stutter percentage, and the intra-locus balance of multicopy Y-STRs were screened. CONCLUSIONS Although the number of forensic genetics laboratories that are applying the MPS technique in routine analysis is small and does not allow a global assessment of MPS limitations, this comparative study highlights the ability of MPS to produce reliable profiles despite the generation of large amounts of raw data.
Collapse
Affiliation(s)
- Giulia Soldati
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy.
| | - Stefania Turrina
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Mirko Treccani
- GM Lab, Department of Neurosciences, Biomedicine and Movement Sciences, University of Verona, Verona, Italy
| | - Chiara Saccardo
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Francesco Ausania
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| | - Domenico De Leo
- Department of Diagnostics and Public Health, Section of Forensic Medicine, Forensic Genetics Lab, University of Verona, Verona, Italy
| |
Collapse
|
8
|
Elwick K, Rydzak P, Robertson JM. Evaluation of Library Preparation Workflows and Applications to Different Sample Types Using the PowerSeq ® 46GY System with Massively Parallel Sequencing. Genes (Basel) 2023; 14:genes14050977. [PMID: 37239337 DOI: 10.3390/genes14050977] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/23/2023] [Revised: 04/21/2023] [Accepted: 04/24/2023] [Indexed: 05/28/2023] Open
Abstract
This project evaluated the prototype PowerSeq® 46GY System using donor DNA and casework-type samples. The goal of this study was to determine whether modifications to the manufacturer's protocol could increase read coverage and improve sample results. Buccal and casework-type libraries were prepared using the TruSeq® DNA PCR-Free HT kit or the KAPA HyperPrep kit. Both kits were evaluated unmodified, and by substituting AMPure® XP beads for the beads of the most optimal kit. Two qPCR kits, the PowerSeq® Quant MS System and KAPA Library Quantification Kit, were also evaluated along with a KAPA size-adjustment workbook, which was compared as a third quantification method. Libraries were sequenced using the MiSeq® FGx and data were analyzed with STRait Razor. Results suggested that all three quantification methods overestimated library concentration, but the PowerSeq kit was most accurate. Samples prepared with the TruSeq library kit provided the highest coverage and the fewest instances of dropout and below-threshold alleles compared with the KAPA kit. Additionally, all bone and hair samples demonstrated full profile completeness, with bone samples yielding a higher average coverage than hair samples. Overall, our study demonstrated that the 46GY manufacturer's protocol produced the best quality results compared to alternative library preparation options.
Collapse
Affiliation(s)
- Kyleen Elwick
- Visiting Scientist Program, Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, 2501 Investigation Parkway, Quantico, VA 22135, USA
| | - Patrick Rydzak
- Visiting Scientist Program, Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, 2501 Investigation Parkway, Quantico, VA 22135, USA
| | - James M Robertson
- Research & Support Unit, Laboratory Division, Federal Bureau of Investigation, Quantico, VA 22135, USA
| |
Collapse
|
9
|
Butler JM. Recent advances in forensic biology and forensic DNA typing: INTERPOL review 2019-2022. Forensic Sci Int Synerg 2022; 6:100311. [PMID: 36618991 PMCID: PMC9813539 DOI: 10.1016/j.fsisyn.2022.100311] [Citation(s) in RCA: 29] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
Abstract
This review paper covers the forensic-relevant literature in biological sciences from 2019 to 2022 as a part of the 20th INTERPOL International Forensic Science Managers Symposium. Topics reviewed include rapid DNA testing, using law enforcement DNA databases plus investigative genetic genealogy DNA databases along with privacy/ethical issues, forensic biology and body fluid identification, DNA extraction and typing methods, mixture interpretation involving probabilistic genotyping software (PGS), DNA transfer and activity-level evaluations, next-generation sequencing (NGS), DNA phenotyping, lineage markers (Y-chromosome, mitochondrial DNA, X-chromosome), new markers and approaches (microhaplotypes, proteomics, and microbial DNA), kinship analysis and human identification with disaster victim identification (DVI), and non-human DNA testing including wildlife forensics. Available books and review articles are summarized as well as 70 guidance documents to assist in quality control that were published in the past three years by various groups within the United States and around the world.
Collapse
Affiliation(s)
- John M. Butler
- National Institute of Standards and Technology, Special Programs Office, 100 Bureau Drive, Mail Stop 4701, Gaithersburg, MD, USA
| |
Collapse
|
10
|
Wang X, Budowle B, Ge J. USAT: a bioinformatic toolkit to facilitate interpretation and comparative visualization of tandem repeat sequences. BMC Bioinformatics 2022; 23:497. [PMID: 36402991 PMCID: PMC9675219 DOI: 10.1186/s12859-022-05021-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 10/29/2022] [Indexed: 11/21/2022] Open
Abstract
Background Tandem repeats (TR), highly variable genomic variants, are widely used in individual identification, disease diagnostics, and evolutionary studies. The recent advances in sequencing technologies and bioinformatic tools facilitate calling TR haplotypes genome widely. Both length-based and sequence-based TR alleles are used in different applications. However, sequence-based TR alleles could provide the highest precision in characterizing TR haplotypes. The need to identify the differences at the single nucleotide level between or among TR haplotypes with an easy-use bioinformatic tool is essential. Results In this study, we developed a Universal STR Allele Toolkit (USAT) for TR haplotype analysis, which takes TR haplotype output from existing tools to perform allele size conversion, sequence comparison of haplotypes, figure plotting, comparison for allele distribution, and interactive visualization. An exemplary application of USAT for analysis of the CODIS core STR loci for DNA forensics with benchmarking human individuals demonstrated the capabilities of USAT. USAT has user-friendly graphic interfaces and runs fast in major computing operating systems with parallel computing enabled. Conclusion USAT is a user-friendly bioinformatics software for interpretation, visualization, and comparisons of TRs. Supplementary Information The online version contains supplementary material available at 10.1186/s12859-022-05021-1.
Collapse
Affiliation(s)
- Xuewen Wang
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA
| | - Bruce Budowle
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA ,grid.266871.c0000 0000 9765 6057Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX USA
| | - Jianye Ge
- grid.266869.50000 0001 1008 957XCenter for Human Identification, Health Science Center, University of North Texas, Fort Worth, TX USA ,grid.266871.c0000 0000 9765 6057Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, Fort Worth, TX USA
| |
Collapse
|
11
|
An ambiguous sequence-based allele of SE33. FORENSIC SCIENCE INTERNATIONAL GENETICS SUPPLEMENT SERIES 2022. [DOI: 10.1016/j.fsigss.2022.10.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
|
12
|
Hall CL, Kesharwani RK, Phillips NR, Planz JV, Sedlazeck FJ, Zascavage RR. Accurate profiling of forensic autosomal STRs using the Oxford Nanopore Technologies MinION device. Forensic Sci Int Genet 2021; 56:102629. [PMID: 34837788 DOI: 10.1016/j.fsigen.2021.102629] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2021] [Revised: 09/28/2021] [Accepted: 11/01/2021] [Indexed: 01/23/2023]
Abstract
The high variability characteristic of short tandem repeat (STR) markers is harnessed for human identification in forensic genetic analyses. Despite the power and reliability of current typing techniques, sequence-level information both within and around STRs are masked in the length-based profiles generated. Forensic STR typing using next generation sequencing (NGS) has therefore gained attention as an alternative to traditional capillary electrophoresis (CE) approaches. In this proof-of-principle study, we evaluate the forensic applicability of the newest and smallest NGS platform available - the Oxford Nanopore Technologies (ONT) MinION device. Although nanopore sequencing on the handheld MinION offers numerous advantages, including low startup cost and on-site sample processing, the relatively high error rate and lack of forensic-specific analysis software has prevented accurate profiling across STR panels in previous studies. Here we present STRspy, a streamlined method capable of producing length- and sequence-based STR allele designations from noisy, error-prone third generation sequencing reads. To assess the capabilities of STRspy, seven reference samples (female: n = 2; male: n = 5) were amplified at 15 and 30 PCR cycles using the Promega PowerSeq 46GY System and sequenced on the ONT MinION device in triplicate. Basecalled reads were then processed with STRspy using a custom database containing alleles reported in the STRSeq BioProject NIST 1036 dataset. Resultant STR allele designations and flanking region single nucleotide polymorphism (SNP) calls were compared to the manufacturer-validated genotypes for each sample. STRspy generated robust and reliable genotypes across all autosomal STR loci amplified with 30 PCR cycles, achieving 100% concordance based on both length and sequence. Furthermore, we were able to identify flanking region SNPs in the 15-cycle dataset with > 90% accuracy. These results demonstrate that when analyzed with STRspy ONT reads can reveal additional variation in and around STR loci depending on read coverage. As the first and only third generation sequencing platform-specific method to successfully profile the entire panel of autosomal STRs amplified by a commercially available multiplex, STRspy significantly increases the feasibility of nanopore sequencing in forensic applications.
Collapse
Affiliation(s)
- Courtney L Hall
- Department of Microbiology, Immunology & Genetics, University of North Texas Health Science Center, 3400 Camp Bowie Blvd, Fort Worth, TX 76107, USA.
| | - Rupesh K Kesharwani
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston TX 77030, USA
| | - Nicole R Phillips
- Department of Microbiology, Immunology & Genetics, University of North Texas Health Science Center, 3400 Camp Bowie Blvd, Fort Worth, TX 76107, USA
| | - John V Planz
- Department of Microbiology, Immunology & Genetics, University of North Texas Health Science Center, 3400 Camp Bowie Blvd, Fort Worth, TX 76107, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, One Baylor Plaza, Houston TX 77030, USA
| | - Roxanne R Zascavage
- Department of Microbiology, Immunology & Genetics, University of North Texas Health Science Center, 3400 Camp Bowie Blvd, Fort Worth, TX 76107, USA; Department of Criminology and Criminal Justice, University of Texas at Arlington, 701 S Nedderman Dr, Arlington, TX 76109, USA
| |
Collapse
|