1
|
Liu Q, Chen Y, Qi H. Advances in Genotyping Detection of Fragmented Nucleic Acids. BIOSENSORS 2024; 14:465. [PMID: 39451678 PMCID: PMC11506436 DOI: 10.3390/bios14100465] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/31/2024] [Revised: 09/26/2024] [Accepted: 09/27/2024] [Indexed: 10/26/2024]
Abstract
Single nucleotide variant (SNV) detection is pivotal in various fields, including disease diagnosis, viral screening, genetically modified organism (GMO) identification, and genotyping. However, detecting SNVs presents significant challenges due to the fragmentation of nucleic acids caused by cellular apoptosis, molecular shearing, and physical degradation processes such as heating. Fragmented nucleic acids often exhibit variable lengths and inconsistent breakpoints, complicating the accurate detection of SNVs. This article delves into the underlying causes of nucleic acid fragmentation and synthesizes the strengths and limitations of next-generation sequencing technology, high-resolution melting curves, molecular probes, and CRISPR-based approaches for SNV detection in fragmented nucleic acids. By providing a detailed comparative analysis, it seeks to offer valuable insights for researchers working to overcome the challenges of SNV detection in fragmented samples, ultimately advancing the accurate and efficient detection of single nucleotide variants across diverse applications.
Collapse
Affiliation(s)
- Qian Liu
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; (Q.L.); (Y.C.)
- Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Yun Chen
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; (Q.L.); (Y.C.)
- Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300350, China; (Q.L.); (Y.C.)
- Key Laboratory of Systems Bioengineering (Ministry of Education), School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China
| |
Collapse
|
2
|
Kang S, Liu Q, Zhang J, Zhang Y, Qi H. 2,6-diaminopurine (Z)-containing toehold probes improve genotyping sensitivity. Biotechnol Bioeng 2024; 121:1384-1393. [PMID: 38151965 DOI: 10.1002/bit.28642] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Revised: 12/15/2023] [Accepted: 12/15/2023] [Indexed: 12/29/2023]
Abstract
2,6-diaminopurine (Z), a naturally occurring noncanonical nucleotide base found in bacteriophages, enhances DNA hybridization by forming three hydrogen bonds with thymine (T). These distinct biochemical characteristics make it particularly valuable in applications that rely on the thermodynamics of DNA hybridization. However, the practical use of Z-containing oligos is limited by their high production cost and the challenges associated with their synthesis. Here, we developed an efficient and cost-effective approach to synthesize Z-containing oligos of high quality based on an isothermal strand displacement reaction. These newly synthesized Z-oligos are then employed as toehold-blockers in an isothermal genotyping assay designed to detect rare single nucleotide variations (SNV). When compared with their counterparts containing the standard adenine (A) base, the Z-containing blockers significantly enhance the accuracy of identifying SNV. Overall, our innovative methodology in the synthesis of Z-containing oligos, which can also be used to incorporate other unconventional and unnatural bases into oligonucleotides, is anticipated to be adopted for diverse applications, including genotyping, biosensing, and gene therapy.
Collapse
Affiliation(s)
- Shaohua Kang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
| | - Qian Liu
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
| | - Jie Zhang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
| | - Yan Zhang
- New Cornerstone Science Laboratory, School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
- Tianjin Key Laboratory for Modern Drug Delivery & High-Efficiency, Collaborative Innovation Center of Chemical Science and Engineering, School of Pharmaceutical Science and Technology, Tianjin University, Tianjin, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, China
| |
Collapse
|
3
|
Gimpel AL, Stark WJ, Heckel R, Grass RN. A digital twin for DNA data storage based on comprehensive quantification of errors and biases. Nat Commun 2023; 14:6026. [PMID: 37758710 PMCID: PMC10533828 DOI: 10.1038/s41467-023-41729-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2023] [Accepted: 09/18/2023] [Indexed: 09/29/2023] Open
Abstract
Archiving data in synthetic DNA offers unprecedented storage density and longevity. Handling and storage introduce errors and biases into DNA-based storage systems, necessitating the use of Error Correction Coding (ECC) which comes at the cost of added redundancy. However, insufficient data on these errors and biases, as well as a lack of modeling tools, limit data-driven ECC development and experimental design. In this study, we present a comprehensive characterisation of the error sources and biases present in the most common DNA data storage workflows, including commercial DNA synthesis, PCR, decay by accelerated aging, and sequencing-by-synthesis. Using the data from 40 sequencing experiments, we build a digital twin of the DNA data storage process, capable of simulating state-of-the-art workflows and reproducing their experimental results. We showcase the digital twin's ability to replace experiments and rationalize the design of redundancy in two case studies, highlighting opportunities for tangible cost savings and data-driven ECC development.
Collapse
Affiliation(s)
- Andreas L Gimpel
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 1-5, 8093, Zürich, Switzerland
| | - Wendelin J Stark
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 1-5, 8093, Zürich, Switzerland
| | - Reinhard Heckel
- Department of Computer Engineering, Technical University of Munich, Arcistrasse 21, 80333, Munich, Germany
| | - Robert N Grass
- Department of Chemistry and Applied Biosciences, ETH Zürich, Vladimir-Prelog-Weg 1-5, 8093, Zürich, Switzerland.
| |
Collapse
|
4
|
Liu Q, Wei Y, Wang Z, Song DP, Cui J, Qi H. Sustainable DNA Data Storage on Cellulose Paper. SMALL METHODS 2023; 7:e2201610. [PMID: 37263984 DOI: 10.1002/smtd.202201610] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2022] [Revised: 04/04/2023] [Indexed: 06/03/2023]
Abstract
DNA is a promising material for high density and long-term archival data storage. In addition to algorithms for encoding digital information into DNA sequences, the DNA writing (chemical synthesis) and reading (DNA sequencing), the preservation of DNA mixtures with high sequence diversity is another critical issue for sustainable, long-term, and large-scale DNA data storage. Here, this work demonstrates a method for low-cost, convenient and sustainable DNA data storage on cellulose paper. A DNA pool comprising thousands of sequences, in which archival data are encoded, is conveniently stored on a cellulose paper with a calculated density as high as 15 TB per mm3 through electrostatic adsorption. This work demonstrates that these digitally encoded DNA pools can be stable for years on the cellulose paper after drying even when directly exposed to air. Furthermore, the reversible electrostatic adsorption enables repeated loading/retrieval of DNA on/off cellulose paper. Therefore, this sustainable DNA preservation on cellulose paper through the convenient electrostatic adsorption exhibits a great advantage in terms of storage capacity and cost that is crucial for practical systems to achieve large-scale and long-time data storage.
Collapse
Affiliation(s)
- Qian Liu
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300350, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, 300350, China
| | - Yanan Wei
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300350, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, 300350, China
| | - Zhaoguan Wang
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300350, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, 300350, China
| | - Dong-Po Song
- Tianjin Key Laboratory of Composite and Functional Materials, School of Materials Science and Engineering, Tianjin University, Tianjin, 300350, China
| | - Jingsong Cui
- School of Cyber Science and Engineering, Wuhan University, Wuhan, 430072, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin, 300350, China
- Key Laboratory of Systems Bioengineering (Ministry of Education), Tianjin University, Tianjin, 300350, China
- Zhejiang Shaoxing Research Institute of Tianjin University, Zhejiang, 312369, China
| |
Collapse
|
5
|
Adaptive Savitzky–Golay Filters for Analysis of Copy Number Variation Peaks from Whole-Exome Sequencing Data. INFORMATION 2023. [DOI: 10.3390/info14020128] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/18/2023] Open
Abstract
Copy number variation (CNV) is a form of structural variation in the human genome that provides medical insight into complex human diseases; while whole-genome sequencing is becoming more affordable, whole-exome sequencing (WES) remains an important tool in clinical diagnostics. Because of its discontinuous nature and unique characteristics of sparse target-enrichment-based WES data, the analysis and detection of CNV peaks remain difficult tasks. The Savitzky–Golay (SG) smoothing is well known as a fast and efficient smoothing method. However, no study has documented the use of this technique for CNV peak detection. It is well known that the effectiveness of the classical SG filter depends on the proper selection of the window length and polynomial degree, which should correspond with the scale of the peak because, in the case of peaks with a high rate of change, the effectiveness of the filter could be restricted. Based on the Savitzky–Golay algorithm, this paper introduces a novel adaptive method to smooth irregular peak distributions. The proposed method ensures high-precision noise reduction by dynamically modifying the results of the prior smoothing to automatically adjust parameters. Our method offers an additional feature extraction technique based on density and Euclidean distance. In comparison to classical Savitzky–Golay filtering and other peer filtering methods, the performance evaluation demonstrates that adaptive Savitzky–Golay filtering performs better. According to experimental results, our method effectively detects CNV peaks across all genomic segments for both short and long tags, with minimal peak height fidelity values (i.e., low estimation bias). As a result, we clearly demonstrate how well the adaptive Savitzky–Golay filtering method works and how its use in the detection of CNV peaks can complement the existing techniques used in CNV peak analysis.
Collapse
|