1
|
Nakanishi H, Yoneyama K, Hara M, Takada A, Sakai K, Saito K. Estimating individual mtDNA haplotypes in mixed DNA samples by combining MinION and MiSeq. Int J Legal Med 2022; 136:423-432. [PMID: 35001166 DOI: 10.1007/s00414-021-02763-0] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2021] [Accepted: 12/03/2021] [Indexed: 12/30/2022]
Abstract
We tried to estimate individual mtDNA haplotypes in mixed DNA samples by combining MinION and MiSeq. The BAM files produced by MiSeq were viewed using Integrative Genomics Viewer (IGV) to verify mixed bases. By sorting the reads according to base type for each mixed base, partial haplotypes were determined. Then, the BAM files produced by MinKNOW were viewed using IGV. To determine haplotypes with IGV, only mixed bases determined by MiSeq were used as target bases. By sorting the reads according to base type for each target base, each contributor's haplotype was estimated. In mixed samples from two contributors, even a haplotype with a minor contribution of 5% could be distinguished from the haplotype of the major contributor. In mixed samples of three contributors (mixture ratios of 1:1:1 and 4:2:1), each haplotype could also be distinguished. Sequences of C-stretches were determined very inaccurately in the MinION analysis. Although the analysis method was simple, each haplotype was correctly detected in all mixed samples with two or three contributors in various mixture ratios by combining MinION and MiSeq. This should be useful for identifying contributors to mixed samples.
Collapse
Affiliation(s)
- Hiroaki Nakanishi
- Department of Forensic Medicine, Juntendo University School of Medicine, 2-1-1, Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan.
| | - Katsumi Yoneyama
- Department of Forensic Medicine, Saitama Medical University, 38 Morohongo, Moroyama, Saitama, 350-0495, Japan
| | - Masaaki Hara
- Department of Forensic Medicine, Saitama Medical University, 38 Morohongo, Moroyama, Saitama, 350-0495, Japan
| | - Aya Takada
- Department of Forensic Medicine, Saitama Medical University, 38 Morohongo, Moroyama, Saitama, 350-0495, Japan
| | - Kentaro Sakai
- Department of Forensic Medicine, Juntendo University School of Medicine, 2-1-1, Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
- Tokyo Medical Examiner's Office, Tokyo Metropolitan Government, 4-21-18, Otsuka, Bunkyo-Ku, Tokyo, 112-0012, Japan
| | - Kazuyuki Saito
- Department of Forensic Medicine, Juntendo University School of Medicine, 2-1-1, Hongo, Bunkyo-Ku, Tokyo, 113-8421, Japan
| |
Collapse
|
2
|
An Introductory Overview of Open-Source and Commercial Software Options for the Analysis of Forensic Sequencing Data. Genes (Basel) 2021; 12:genes12111739. [PMID: 34828345 PMCID: PMC8618049 DOI: 10.3390/genes12111739] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/15/2021] [Revised: 10/27/2021] [Accepted: 10/27/2021] [Indexed: 12/30/2022] Open
Abstract
The top challenges of adopting new methods to forensic DNA analysis in routine laboratories are often the capital investment and the expertise required to implement and validate such methods locally. In the case of next-generation sequencing, in the last decade, several specifically forensic commercial options became available, offering reliable and validated solutions. Despite this, the readily available expertise to analyze, interpret and understand such data is still perceived to be lagging behind. This review gives an introductory overview for the forensic scientists who are at the beginning of their journey with implementing next-generation sequencing locally and because most in the field do not have a bioinformatics background may find it difficult to navigate the new terms and analysis options available. The currently available open-source and commercial software for forensic sequencing data analysis are summarized here to provide an accessible starting point for those fairly new to the forensic application of massively parallel sequencing.
Collapse
|
3
|
DNA-nanopore technology: a human perspective. Emerg Top Life Sci 2021; 5:455-463. [PMID: 34282838 DOI: 10.1042/etls20200282] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/05/2021] [Revised: 06/09/2021] [Accepted: 06/11/2021] [Indexed: 11/17/2022]
Abstract
The purpose of this article is to give a brief overview of the current state of nanopore sequencing in relation to forensic science with a brief outline of where it stands in relation to current methods, its potential uses in forensic science and factors which may influence acceptance of this technology by forensic practitioners, the judiciary and law enforcement. Perhaps most importantly consideration is also given to concerns which may influence the acceptance of the technology by the general public.
Collapse
|
4
|
King JL, Woerner AE, Mandape SN, Kapema KB, Moura-Neto RS, Silva R, Budowle B. STRait Razor Online: An enhanced user interface to facilitate interpretation of MPS data. Forensic Sci Int Genet 2021; 52:102463. [PMID: 33493821 DOI: 10.1016/j.fsigen.2021.102463] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2020] [Revised: 11/06/2020] [Accepted: 12/29/2020] [Indexed: 12/17/2022]
Abstract
Since 2013, STRait Razor has enabled analysis of massively parallel sequencing (MPS) data from various marker systems such as short tandem repeats, single nucleotide polymorphisms, insertion/deletions, and mitochondrial DNA. In this paper, STRait Razor Online (SRO), available at https://www.unthsc.edu/straitrazor, is introduced as an interactive, Shiny-based user interface for primary analysis of MPS data and secondary analysis of STRait Razor haplotype pileups. This software can be accessed from any common browser via desktop, tablet, or smartphone device. SRO is available also as a standalone application and open-source R script available at https://github.com/ExpectationsManaged/STRaitRazorOnline. The local application is capable of batch processing of both fastq files and primary analysis output. Processed batches generate individual report folders and summary reports at the locus- and haplotype-level in a matter of minutes. For example, the processing of data from ∼700 samples generated with the ForenSeq Signature Preparation Kit from allsequences.txt to a final table can be performed in ∼40 min whereas the Excel-based workbooks can take 35-60 h to compile a subset of the tables generated by SRO. To facilitate analysis of single-source, reference samples, a preliminary triaging system was implemented that calls potential alleles and flags loci suspected of severe heterozygote imbalance. When compared to published, manually curated data sets, 98.72 % of software-assigned allele calls without manual interpretation were consistent with curated data sets, 0.99 % loci were presented to the user for interpretation due to heterozygote imbalance, and the remaining 0.29 % of loci were inconsistent due to the analytical thresholds used across the studies.
Collapse
Affiliation(s)
- Jonathan L King
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA.
| | - August E Woerner
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Sammed N Mandape
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | - Kapema Bupe Kapema
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| | | | - Rosane Silva
- Instituto de Biofisica Carlos Chagas Filho, Universidade Federal do Rio de Janeiro, Rio de Janeiro, RJ, Brazil
| | - Bruce Budowle
- Center for Human Identification, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA; Department of Microbiology, Immunology, and Genetics, University of North Texas Health Science Center, 3500 Camp Bowie Blvd., Fort Worth, TX, 76107, USA
| |
Collapse
|
5
|
Silva DSBS, Scheible MK, Bailey SF, Williams CL, Allwood JS, Just RS, Schuetter J, Skomrock N, Minard-Smith A, Barker-Scoggins N, Eichman C, Meiklejohn K, Faith SA. Sequence-based autosomal STR characterization in four US populations using PowerSeq™ Auto/Y system. Forensic Sci Int Genet 2020; 48:102311. [PMID: 32531758 DOI: 10.1016/j.fsigen.2020.102311] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2019] [Revised: 04/21/2020] [Accepted: 05/14/2020] [Indexed: 11/30/2022]
Abstract
The forensic science community is poised to utilize modern advances in massively parallel sequencing (MPS) technologies to better characterize biological samples with higher resolution. A critical component towards the advancement of forensic DNA analysis with these technologies is a comprehensive understanding of the diversity and population distribution of sequence-based short tandem repeat (STR) alleles. Here we analyzed 786 samples of individuals from different population groups, including four of the mostly commonly encountered in forensic casework in the USA. DNA samples were amplified with the PowerSeq™ Auto/Y System Prototype Kit (Promega Corp.), and sequencing was performed on an Illumina® MiSeq instrument. Sequence data were analyzed using a bioinformatics processing tool, Altius. For additional data analysis and profile comparison, capillary electrophoresis (CE) size-based STR genotypes were generated for a subset of individuals, and where possible, also with a second commercially available MPS STR assay. Autosomal STR loci were analyzed and frequencies were calculated based on sequence composition. Also, population genetics studies were performed, with Hardy-Weinberg equilibrium, polymorphic information content (PIC), and observed and expected heterozygosity all assessed. Overall, sequence-based allelic variants of the repeat region were observed in 20 out of 22 different STR loci commonly used in forensic DNA genotyping, with the highest number of sequence variation observed at locus D12S391. The highest increase in allelic diversity and in PIC through sequence-based genotyping was observed at loci D3S1358 and D8S1179. Such detailed sequence analysis, as the one performed in the present study, is important to help understand the diversity of sequence-based STR alleles across different populations and to demonstrate how such allelic variation can improve statistics used for forensic casework.
Collapse
Affiliation(s)
- Deborah S B S Silva
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA; NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA.
| | - Melissa K Scheible
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA; NC State University, Population Health and Pathobiology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Sarah F Bailey
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Christopher L Williams
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Julia S Allwood
- NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Rebecca S Just
- DNA Support Unit, Federal Bureau of Investigation Laboratory, 2501 Investigation Parkway, Quantico, VA, 22135, USA
| | - Jared Schuetter
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA
| | - Nicholas Skomrock
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA
| | | | - Nicole Barker-Scoggins
- NC State University, College of Veterinary Medicine, Office of Information Technology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Christopher Eichman
- NC State University, College of Veterinary Medicine, Office of Information Technology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Kelly Meiklejohn
- NC State University, Population Health and Pathobiology, 1060 William Moore Dr., Raleigh, NC, 27607, USA
| | - Seth A Faith
- Battelle Memorial Institute, 505 King Ave., Columbus, OH, 43201, USA; NC State University, Molecular Biomedical Sciences, 1060 William Moore Dr., Raleigh, NC, 27607, USA; Penn State University Forensic Science Program, 329 Whitmore Laboratory, University Park, PA, 16802, USA
| |
Collapse
|
6
|
S B S Silva D, Sawitzki FR, Scheible MKR, Bailey SF, S Alho C, Faith SA. Paternity testing using massively parallel sequencing and the PowerSeq™ AUTO/Y system for short tandem repeat sequencing. Electrophoresis 2018; 39:2669-2673. [PMID: 30132945 DOI: 10.1002/elps.201800072] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/10/2018] [Revised: 06/28/2018] [Accepted: 08/15/2018] [Indexed: 11/09/2022]
Abstract
Massively parallel sequencing (MPS) is gaining attention as a new technology for routine forensic casework, including paternity testing. Recently released MPS multiplex panels provide many more loci compared to CE methods, plus provide sequence-based alleles that together improve the statistical power of the genetic testing. Here, an MPS system (PowerSeq™ AUTO/Y) was applied for STR sequencing in the study of first-degree STR sequence allele inheritance from families in Southern Brazil. In 29 trios (mother-child-father) analyzed, the paternity index values generally increased when data from sequence-based analysis were used in comparison to length-based data. Further, allele inconsistencies (e.g., single repeat mutation events) between child and parents could be resolved with MPS by assessing the core repeat and flanking region sequences. Lastly, the sequence information allowed for identification of isoalleles (alleles of the same size, but different sequence) to determine specific paternal and maternal inheritances. The results from this study showed advantages of implementing sequence-based analysis, MPS, in paternity testing with improved statistical calculations and a greater resolution for the trios/families tested.
Collapse
Affiliation(s)
- Deborah S B S Silva
- Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
| | - Fernanda R Sawitzki
- Laboratory of Human and Molecular Genetics, Pontificia Universidade Catolica do Rio Grande do Sul, Porto Alegre-RS, Brazil
| | - Melissa K R Scheible
- Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
| | - Sarah F Bailey
- Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
| | - Clarice S Alho
- Laboratory of Human and Molecular Genetics, Pontificia Universidade Catolica do Rio Grande do Sul, Porto Alegre-RS, Brazil
| | - Seth A Faith
- Molecular Biomedical Sciences, North Carolina State University, Raleigh, NC, USA
| |
Collapse
|
7
|
Genetic analysis of Southern Brazil subjects using the PowerSeq™ AUTO/Y system for short tandem repeat sequencing. Forensic Sci Int Genet 2017; 33:129-135. [PMID: 29275088 DOI: 10.1016/j.fsigen.2017.12.008] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/07/2017] [Revised: 12/12/2017] [Accepted: 12/15/2017] [Indexed: 11/21/2022]
Abstract
With the advent of Next-Generation Sequencing technology, sequencing of short tandem repeats (STRs) allows for a more detailed analysis when compared to size-based fragment methods (capillary electrophoresis-CE). The implementation of high-throughput sequencing can help uncover deeper genetic diversities of different populations. Subjects from the South region of Brazil present a particular and more homogeneous ancestry background when compared to other regions of the country. Both autosomal and Y- STRs have been analyzed in these individuals; however, all analyses published to date encompass data from CE-based fragment analysis. In this study, a genetic analysis of 59 individuals from Southern Brazil was performed on STR sequences. Forensically relevant STRs were PCR-enriched using a prototype of the PowerSeq™ AUTO/Y system (Promega Corp.). Next-generation sequencing was performed on an Illumina MiSeq instrument. The raw data (FASTQ files) were processed using a custom designed sequence processing tool, Altius. Isoalleles, which are sequence-based allelic variants that do not differ in length, were observed in nine autosomal and in six Y- STRs from the core global forensic marker set. The number of distinctive alleles based on sequence was higher when compared to those based on length, 37.3% higher in autosomal STRs and 13.8% higher in Y-STRs. The most polymorphic autosomal locus was D12S391, which presented 38 different sequence-based alleles. Among the loci in the Y chromosome, DYS389II presented the highest number of isoalleles. In comparison to CE analysis, Observed and Expected Heterozygosity, Polymorphic Information Content (PIC) and Genetic Diversity also presented higher values when the alleles were analyzed based on their sequence. For autosomal loci, Polymorphic Information Content (PIC) was 2.6% higher for sequence-based data. Diversity was 9.3% and 6.5% higher for autosomal and Y markers, respectively. In the analysis of the repeat structures for the STR loci, a new allele variant was found for allele 18 in the vWA locus. The STR flanking regions were also further investigated and sixteen variations were observed at nine autosomal STR loci and one Y-STR locus. The results obtained in this study demonstrate the importance of genetic analysis based on sequencing and highlight the diversity of the South Brazilian population when characterized by STR sequencing.
Collapse
|