101
|
Choi Y, Bae HJ, Lee AC, Choi H, Lee D, Ryu T, Hyun J, Kim S, Kim H, Song SH, Kim K, Park W, Kwon S. DNA Micro-Disks for the Management of DNA-Based Data Storage with Index and Write-Once-Read-Many (WORM) Memory Features. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2020; 32:e2001249. [PMID: 32725925 DOI: 10.1002/adma.202001249] [Citation(s) in RCA: 37] [Impact Index Per Article: 7.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/22/2020] [Revised: 06/03/2020] [Indexed: 05/25/2023]
Abstract
DNA-based data storage has attracted attention because of its higher physical density of the data and longer retention time than those of conventional digital data storage. However, previous DNA-based data storage lacked index features and the data quality of storage after a single access was not preserved, obstructing its industrial use. Here, DNA micro-disks, QR-coded micro-sized disks that harbor data-encoded DNA molecules for the efficient management of DNA-based data storage, are proposed. The two major features that previous DNA-based data-storage studies could not achieve are demonstrated. One feature is accessing data items efficiently by indexing the data-encoded DNA library. Another is achieving write-once-read-many (WORM) memory through the immobilization of DNA molecules on the disk and their enrichment through in situ DNA production. Through these features, the reliability of DNA-based data storage is increased by allowing selective and multiple accession of data-encoded DNA with lower data loss than previous DNA-based data storage methods.
Collapse
Affiliation(s)
- Yeongjae Choi
- Nano Systems Institute, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Hyung Jong Bae
- Department of Electrical and Computer Engineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Amos C Lee
- Interdisciplinary Program for Bioengineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Hansol Choi
- Department of Electrical and Computer Engineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Daewon Lee
- Nano Systems Institute, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
- BK21+ Creative Research Engineer Development for IT, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Taehoon Ryu
- Celemics Inc., 131, Gasandigital 1-ro, Geumcheon-gu, Seoul, 08506, Republic of Korea
| | - Jinwoo Hyun
- Department of Electrical and Computer Engineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| | - Seojoo Kim
- Department of Electronic Engineering, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
| | - Hyeli Kim
- Department of Electronic Engineering, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
| | - Suk-Heung Song
- Department of Electronic Engineering, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
| | - Kibeom Kim
- Department of Electronic Engineering, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
| | - Wook Park
- Department of Electronic Engineering, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
- Institute for Wearable Convergence Electronics, Kyung Hee University, Deogyeong-daero, Giheung-gu, Yongin-si, Gyeonggi-do, 17104, Republic of Korea
| | - Sunghoon Kwon
- Nano Systems Institute, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
- Department of Electrical and Computer Engineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
- Interdisciplinary Program for Bioengineering, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
- Institute of Entrepreneurial Bio Convergence, Seoul National University, 1, Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
- Seoul National University Hospital Biomedical Research Institute, Seoul National University Hospital, 101, Daehak-ro, Jongno-gu, Seoul, 03080, Republic of Korea
- Inter-University Semiconductor Research Center (ISRC), Seoul National University, 1 Gwanak-ro, Gwanak-gu, Seoul, 08826, Republic of Korea
| |
Collapse
|
102
|
Abstract
DNA polymerases play a central role in biology by transferring genetic information from one generation to the next during cell division. Harnessing the power of these enzymes in the laboratory has fueled an increase in biomedical applications that involve the synthesis, amplification, and sequencing of DNA. However, the high substrate specificity exhibited by most naturally occurring DNA polymerases often precludes their use in practical applications that require modified substrates. Moving beyond natural genetic polymers requires sophisticated enzyme-engineering technologies that can be used to direct the evolution of engineered polymerases that function with tailor-made activities. Such efforts are expected to uniquely drive emerging applications in synthetic biology by enabling the synthesis, replication, and evolution of synthetic genetic polymers with new physicochemical properties.
Collapse
|
103
|
Synthesis of DNA Origami Scaffolds: Current and Emerging Strategies. Molecules 2020; 25:molecules25153386. [PMID: 32722650 PMCID: PMC7435391 DOI: 10.3390/molecules25153386] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2020] [Revised: 07/23/2020] [Accepted: 07/24/2020] [Indexed: 12/20/2022] Open
Abstract
DNA origami nanocarriers have emerged as a promising tool for many biomedical applications, such as biosensing, targeted drug delivery, and cancer immunotherapy. These highly programmable nanoarchitectures are assembled into any shape or size with nanoscale precision by folding a single-stranded DNA scaffold with short complementary oligonucleotides. The standard scaffold strand used to fold DNA origami nanocarriers is usually the M13mp18 bacteriophage’s circular single-stranded DNA genome with limited design flexibility in terms of the sequence and size of the final objects. However, with the recent progress in automated DNA origami design—allowing for increasing structural complexity—and the growing number of applications, the need for scalable methods to produce custom scaffolds has become crucial to overcome the limitations of traditional methods for scaffold production. Improved scaffold synthesis strategies will help to broaden the use of DNA origami for more biomedical applications. To this end, several techniques have been developed in recent years for the scalable synthesis of single stranded DNA scaffolds with custom lengths and sequences. This review focuses on these methods and the progress that has been made to address the challenges confronting custom scaffold production for large-scale DNA origami assembly.
Collapse
|
104
|
Chua JPS, Go MK, Osothprarop T, Mcdonald S, Karabadzhak AG, Yew WS, Peisajovich S, Nirantar S. Evolving a Thermostable Terminal Deoxynucleotidyl Transferase. ACS Synth Biol 2020; 9:1725-1735. [PMID: 32497424 DOI: 10.1021/acssynbio.0c00078] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
Abstract
Terminal deoxynucleotidyl transferase (TdT) catalyzes template free incorporation of arbitrary nucleotides onto single-stranded DNA. Due to this unique feature, TdT is widely used in biotechnology and clinical applications. One particularly tantalizing use is the synthesis of long de novo DNA molecules by TdT-mediated iterative incorporation of a 3' reversibly blocked nucleotide, followed by deblocking. However, wild-type (WT) TdT is not optimized for the incorporation of 3' modified nucleotides, and TdT engineering is hampered by the fact that TdT is marginally stable and only present in mesophilic organisms. We sought to first evolve a thermostable TdT variant to serve as backbone for subsequent evolution to enable efficient incorporation of 3'-modified nucleotides. A thermostable variant would be a good starting point for such an effort, as evolution to incorporate bulky modified nucleotides generally results in lowered stability. In addition, a thermostable TdT would also be useful when blunt dsDNA is a substrate as higher temperature could be used to melt dsDNA. Here, we developed an assay to identify thermostable TdT variants. After screening about 10 000 TdT mutants, we identified a variant, named TdT3-2, that is 10 °C more thermostable than WT TdT, while preserving the catalytic properties of the WT enzyme.
Collapse
Affiliation(s)
- Jasmine Puay Suan Chua
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597
- NUS Synthetic Biology for Clinical and Technological Innovation, 14 Medical Drive, Singapore 117599
- Illumina Singapore Pte, Ltd., 29 Woodlands Industrial Park E1, North Tech Building, Singapore 757716
| | - Maybelle Kho Go
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597
- NUS Synthetic Biology for Clinical and Technological Innovation, 14 Medical Drive, Singapore 117599
| | - Trina Osothprarop
- Illumina Inc., 5200 Illumina Way, San Diego, California 92122, United States
| | - Seth Mcdonald
- Illumina Inc., 5200 Illumina Way, San Diego, California 92122, United States
| | | | - Wen Shan Yew
- Department of Biochemistry, Yong Loo Lin School of Medicine, National University of Singapore, 8 Medical Drive, Singapore 117597
- NUS Synthetic Biology for Clinical and Technological Innovation, 14 Medical Drive, Singapore 117599
| | - Sergio Peisajovich
- Illumina Inc., 5200 Illumina Way, San Diego, California 92122, United States
| | - Saurabh Nirantar
- Illumina Singapore Pte, Ltd., 29 Woodlands Industrial Park E1, North Tech Building, Singapore 757716
| |
Collapse
|
105
|
Lin KN, Volkel K, Tuck JM, Keung AJ. Dynamic and scalable DNA-based information storage. Nat Commun 2020; 11:2981. [PMID: 32532979 PMCID: PMC7293219 DOI: 10.1038/s41467-020-16797-2] [Citation(s) in RCA: 48] [Impact Index Per Article: 9.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2019] [Accepted: 05/20/2020] [Indexed: 11/11/2022] Open
Abstract
The physical architectures of information storage systems often dictate how information is encoded, databases are organized, and files are accessed. Here we show that a simple architecture comprised of a T7 promoter and a single-stranded overhang domain (ss-dsDNA), can unlock dynamic DNA-based information storage with powerful capabilities and advantages. The overhang provides a physical address for accessing specific DNA strands as well as implementing a range of in-storage file operations. It increases theoretical storage densities and capacities by expanding the encodable sequence space and simplifies the computational burden in designing sets of orthogonal file addresses. Meanwhile, the T7 promoter enables repeatable information access by transcribing information from DNA without destroying it. Furthermore, saturation mutagenesis around the T7 promoter and systematic analyses of environmental conditions reveal design criteria that can be used to optimize information access. This simple but powerful ss-dsDNA architecture lays the foundation for information storage with versatile capabilities. The physical architectures of information storage dictate how data is encoded, organised and accessed. Here the authors use DNA with a single-strand overhang as a physical address to access specific data and do in-storage file operations in a scalable and reusuable manner.
Collapse
Affiliation(s)
- Kevin N Lin
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Campus Box 7905, Raleigh, NC, 27695-7905, USA
| | - Kevin Volkel
- Department of Electrical and Computer Engineering, North Carolina State University, Campus Box 7911, Raleigh, NC, 27695-7911, USA
| | - James M Tuck
- Department of Electrical and Computer Engineering, North Carolina State University, Campus Box 7911, Raleigh, NC, 27695-7911, USA.
| | - Albert J Keung
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Campus Box 7905, Raleigh, NC, 27695-7905, USA.
| |
Collapse
|
106
|
Arcadia CE, Kennedy E, Geiser J, Dombroski A, Oakley K, Chen SL, Sprague L, Ozmen M, Sello J, Weber PM, Reda S, Rose C, Kim E, Rubenstein BM, Rosenstein JK. Multicomponent molecular memory. Nat Commun 2020; 11:691. [PMID: 32019933 PMCID: PMC7000828 DOI: 10.1038/s41467-020-14455-1] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2019] [Accepted: 01/08/2020] [Indexed: 11/25/2022] Open
Abstract
Multicomponent reactions enable the synthesis of large molecular libraries from relatively few inputs. This scalability has led to the broad adoption of these reactions by the pharmaceutical industry. Here, we employ the four-component Ugi reaction to demonstrate that multicomponent reactions can provide a basis for large-scale molecular data storage. Using this combinatorial chemistry we encode more than 1.8 million bits of art historical images, including a Cubist drawing by Picasso. Digital data is written using robotically synthesized libraries of Ugi products, and the files are read back using mass spectrometry. We combine sparse mixture mapping with supervised learning to achieve bit error rates as low as 0.11% for single reads, without library purification. In addition to improved scaling of non-biological molecular data storage, these demonstrations offer an information-centric perspective on the high-throughput synthesis and screening of small-molecule libraries. Small non-polymeric molecules have tremendous structural diversity that can be used to represent information. Here the authors encode data in synthesized libraries of Ugi products.
Collapse
|
107
|
Hao M, Qiao J, Qi H. Current and Emerging Methods for the Synthesis of Single-Stranded DNA. Genes (Basel) 2020; 11:E116. [PMID: 31973021 PMCID: PMC7073533 DOI: 10.3390/genes11020116] [Citation(s) in RCA: 28] [Impact Index Per Article: 5.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 01/16/2020] [Accepted: 01/18/2020] [Indexed: 12/21/2022] Open
Abstract
Methods for synthesizing arbitrary single-strand DNA (ssDNA) fragments are rapidly becoming fundamental tools for gene editing, DNA origami, DNA storage, and other applications. To meet the rising application requirements, numerous methods have been developed to produce ssDNA. Some approaches allow the synthesis of freely chosen user-defined ssDNA sequences to overcome the restrictions and limitations of different length, purity, and yield. In this perspective, we provide an overview of the representative ssDNA production strategies and their most significant challenges to enable the readers to make informed choices of synthesis methods and enhance the availability of increasingly inexpensive synthetic ssDNA. We also aim to stimulate a broader interest in the continued development of efficient ssDNA synthesis techniques and improve their applications in future research.
Collapse
Affiliation(s)
- Min Hao
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; (M.H.); (J.Q.)
- Key Laboratory of Systems Bioengineering of Ministry of Education, Tianjin University, Tianjin 300072, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering, Tianjin University, Tianjin 300072, China
| | - Jianjun Qiao
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; (M.H.); (J.Q.)
- Key Laboratory of Systems Bioengineering of Ministry of Education, Tianjin University, Tianjin 300072, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering, Tianjin University, Tianjin 300072, China
| | - Hao Qi
- School of Chemical Engineering and Technology, Tianjin University, Tianjin 300072, China; (M.H.); (J.Q.)
- Key Laboratory of Systems Bioengineering of Ministry of Education, Tianjin University, Tianjin 300072, China
- SynBio Research Platform, Collaborative Innovation Center of Chemical Science and Engineering, Tianjin University, Tianjin 300072, China
| |
Collapse
|
108
|
Dong Y, Sun F, Ping Z, Ouyang Q, Qian L. DNA storage: research landscape and future prospects. Natl Sci Rev 2020; 7:1092-1107. [PMID: 34692128 PMCID: PMC8288837 DOI: 10.1093/nsr/nwaa007] [Citation(s) in RCA: 90] [Impact Index Per Article: 18.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Revised: 01/06/2020] [Accepted: 01/18/2020] [Indexed: 02/06/2023] Open
Abstract
Abstract
The global demand for data storage is currently outpacing the world's storage capabilities. DNA, the carrier of natural genetic information, offers a stable, resource- and energy-efficient and sustainable data storage solution. In this review, we summarize the fundamental theory, research history, and technical challenges of DNA storage. From a quantitative perspective, we evaluate the prospect of DNA, and organic polymers in general, as a novel class of data storage medium.
Collapse
Affiliation(s)
- Yiming Dong
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Fajia Sun
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| | - Zhi Ping
- Academician Workstation of BGI Synthetic Genomics, BGI-Shenzhen, Shenzhen 518083, China
| | - Qi Ouyang
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
- The State Key Laboratory for Artificial Microstructures and Mesoscopic Physics, School of Physics, Peking University, Beijing 100871, China
| | - Long Qian
- Center for Quantitative Biology and Peking-Tsinghua Center for Life Sciences, Peking University, Beijing 100871, China
| |
Collapse
|
109
|
Enhancing Terminal Deoxynucleotidyl Transferase Activity on Substrates with 3' Terminal Structures for Enzymatic De Novo DNA Synthesis. Genes (Basel) 2020; 11:genes11010102. [PMID: 31963235 PMCID: PMC7016565 DOI: 10.3390/genes11010102] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/08/2019] [Revised: 12/31/2019] [Accepted: 01/07/2020] [Indexed: 01/13/2023] Open
Abstract
Enzymatic oligonucleotide synthesis methods based on the template-independent polymerase terminal deoxynucleotidyl transferase (TdT) promise to enable the de novo synthesis of long oligonucleotides under mild, aqueous conditions. Intermediates with a 3′ terminal structure (hairpins) will inevitably arise during synthesis, but TdT has poor activity on these structured substrates, limiting its usefulness for oligonucleotide synthesis. Here, we described two parallel efforts to improve the activity of TdT on hairpins: (1) optimization of the concentrations of the divalent cation cofactors and (2) engineering TdT for enhanced thermostability, enabling reactions at elevated temperatures. By combining both of these improvements, we obtained a ~10-fold increase in the elongation rate of a guanine-cytosine hairpin.
Collapse
|
110
|
Higashikuni Y, Lu TK. Advancing CRISPR-Based Programmable Platforms beyond Genome Editing in Mammalian Cells. ACS Synth Biol 2019; 8:2607-2619. [PMID: 31751114 DOI: 10.1021/acssynbio.9b00297] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
Abstract
Human diseases are caused by dysregulation of cellular biological programs that are encoded in DNA. Unveiling the endogenous programs and encoding new programs into the genome are key to creating novel diagnostic and therapeutic strategies. CRISPR/Cas9, originally identified in bacteria, has revolutionized genome editing in mammalian cells. Recent advances in CRISPR technologies have provided new programmable platforms for modifying cell function and behavior. CRISPR-based transcriptional regulators and modified gRNAs have enabled multiplexed regulation and visualization of genome dynamics with spatiotemporal precision. Using these toolkits, genome-scale screening platforms can identify key genetic elements or combinations thereof that modulate phenotypes in mammalian cells. In addition, imaging platforms for multiplexed genomic labeling have been created to study the conformation and dynamics of chromatin in living cells, which are essential for genome function. Furthermore, CRISPR-based computation and memory platforms have been built in living mammalian cells by using DNA as a data processing and storage medium to regulate and monitor cellular behaviors. The conditional regulation of CRISPR-based parts has enabled the design of complex multilayered biological programs. CRISPR-based memory platforms can continuously record biological events as mutations in defined DNA loci. By making use of base editors, CRISPR-based computation and memory platforms have been interconnected to perform logic operations based on past events. These technologies open up new avenues for understanding biological phenomena and designing mammalian cells as living machines for biomedical applications.
Collapse
|
111
|
Abstract
Because of its longevity and enormous information density, DNA is considered a promising data storage medium. In this work, we provide instructions for archiving digital information in the form of DNA and for subsequently retrieving it from the DNA. In principle, information can be represented in DNA by simply mapping the digital information to DNA and synthesizing it. However, imperfections in synthesis, sequencing, storage and handling of the DNA induce errors within the molecules, making error-free information storage challenging. The procedure discussed here enables error-free storage by protecting the information using error-correcting codes. Specifically, in this protocol, we provide the technical details and precise instructions for translating digital information to DNA sequences, physically handling the biomolecules, storing them and subsequently re-obtaining the information by sequencing the DNA. Along with the protocol, we provide computer code that automatically encodes digital information to DNA sequences and decodes the information back from DNA to a digital file. The required software is provided on a Github repository. The protocol relies on commercial DNA synthesis and DNA sequencing via Illumina dye sequencing, and requires 1-2 h of preparation time, 1/2 d for sequencing preparation and 2-4 h for data analysis. This protocol focuses on storage scales of ~100 kB to 15 MB, offering an ideal starting point for small experiments. It can be augmented to enable higher data volumes and random access to the data and also allows for future sequencing and synthesis technologies, by changing the parameters of the encoder/decoder to account for the corresponding error rates.
Collapse
|
112
|
Ostrov N, Beal J, Ellis T, Gordon DB, Karas BJ, Lee HH, Lenaghan SC, Schloss JA, Stracquadanio G, Trefzer A, Bader JS, Church GM, Coelho CM, Efcavitch JW, Güell M, Mitchell LA, Nielsen AAK, Peck B, Smith AC, Stewart CN, Tekotte H. Technological challenges and milestones for writing genomes. Science 2019; 366:310-312. [DOI: 10.1126/science.aay0339] [Citation(s) in RCA: 34] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023]
Affiliation(s)
- Nili Ostrov
- Author affiliations are listed in the supplementary materials
| | - Jacob Beal
- Author affiliations are listed in the supplementary materials
| | - Tom Ellis
- Author affiliations are listed in the supplementary materials
| | | | | | - Henry H. Lee
- Author affiliations are listed in the supplementary materials
| | | | | | | | - Axel Trefzer
- Author affiliations are listed in the supplementary materials
| | - Joel S. Bader
- Author affiliations are listed in the supplementary materials
| | | | | | | | - Marc Güell
- Author affiliations are listed in the supplementary materials
| | | | | | - Bill Peck
- Author affiliations are listed in the supplementary materials
| | | | - C. Neal Stewart
- Author affiliations are listed in the supplementary materials
| | - Hille Tekotte
- Author affiliations are listed in the supplementary materials
| |
Collapse
|
113
|
Data storage in DNA with fewer synthesis cycles using composite DNA letters. Nat Biotechnol 2019; 37:1229-1236. [PMID: 31501560 DOI: 10.1038/s41587-019-0240-x] [Citation(s) in RCA: 79] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2018] [Accepted: 07/25/2019] [Indexed: 12/24/2022]
Abstract
The density and long-term stability of DNA make it an appealing storage medium, particularly for long-term data archiving. Existing DNA storage technologies involve the synthesis and sequencing of multiple nominally identical molecules in parallel, resulting in information redundancy. We report the development of encoding and decoding methods that exploit this redundancy using composite DNA letters. A composite DNA letter is a representation of a position in a sequence that consists of a mixture of all four DNA nucleotides in a predetermined ratio. Our methods encode data using fewer synthesis cycles. We encode 6.4 MB into composite DNA, with distinguishable composition medians, using 20% fewer synthesis cycles per unit of data, as compared to previous reports. We also simulate encoding with larger composite alphabets, with distinguishable composition deciles, to show that 75% fewer synthesis cycles are potentially sufficient. We describe applicable error-correcting codes and inference methods, and investigate error patterns in the context of composite DNA letters.
Collapse
|
114
|
Deshpande S, Yang Y, Chilkoti A, Zauscher S. Enzymatic synthesis and modification of high molecular weight DNA using terminal deoxynucleotidyl transferase. Methods Enzymol 2019; 627:163-188. [PMID: 31630739 PMCID: PMC7241426 DOI: 10.1016/bs.mie.2019.07.044] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/12/2023]
Abstract
The recognition that nucleic acids can be used as polymeric materials led to the blossoming of the field of DNA nanotechnology, with a broad range of applications in biotechnology, biosensors, diagnostics, and drug delivery. These applications require efficient methods to synthesize and chemically modify high molecular weight DNA. Here, we discuss terminal deoxynucleotidyl transferase (TdT)-catalyzed enzymatic polymerization (TcEP) as an alternative to conventional enzymatic and solid-phase DNA synthesis. We describe biochemical requirements for TcEP and provide step-by-step protocols to carry out TcEP in solution and from surfaces.
Collapse
Affiliation(s)
- Sonal Deshpande
- Department of Biomedical Engineering, Duke University, Durham, NC, United States
| | - Yunqi Yang
- Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC, United States
| | - Ashutosh Chilkoti
- Department of Biomedical Engineering, Duke University, Durham, NC, United States; Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC, United States.
| | - Stefan Zauscher
- Department of Biomedical Engineering, Duke University, Durham, NC, United States; Department of Mechanical Engineering and Materials Science, Duke University, Durham, NC, United States.
| |
Collapse
|