1
|
Shen P, Zheng Y, Zhang C, Li S, Chen Y, Chen Y, Liu Y, Cai Z. DNA storage: The future direction for medical cold data storage. Synth Syst Biotechnol 2025; 10:677-695. [PMID: 40235856 PMCID: PMC11999466 DOI: 10.1016/j.synbio.2025.03.006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2024] [Revised: 03/11/2025] [Accepted: 03/12/2025] [Indexed: 04/17/2025] Open
Abstract
DNA storage, characterized by its durability, data density, and cost-effectiveness, is a promising solution for managing the increasing data volumes in healthcare. This review explores state-of-the-art DNA storage technologies, and provides insights into designing a DNA storage system tailored for medical cold data. We anticipate that a practical approach for medical cold data storage will involve establishing regional, in vitro DNA storage centers that can serve multiple hospitals. The immediacy of DNA storage for medical data hinges on the development of novel, high-density, specialized coding methods. Established commercial techniques, such as DNA chemical synthesis and next-generation sequencing (NGS), along with mixed drying with alkaline salts and refined Polymerase Chain Reaction (PCR), potentially represent the optimal options for data writing, reading, storage, and accessing, respectively. Data security could be promised by the integration of traditional digital encryption and DNA steganography. Although breakthrough developments like artificial nucleotides and DNA nanostructures show potential, they remain in the laboratory research phase. In conclusion, DNA storage is a viable preservation strategy for medical cold data in the near future.
Collapse
Affiliation(s)
- Peilin Shen
- Department of Urology, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong Province, PR China
- Shantou University Medical College, Shantou, Guangdong Province, PR China
| | - Yukui Zheng
- The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong Province, PR China
- Shantou University Medical College, Shantou, Guangdong Province, PR China
| | - CongYu Zhang
- Shantou University Medical College, Shantou, Guangdong Province, PR China
| | - Shuo Li
- School of Artificial Intelligence, University of Chinese Academy of Sciences, Beijing, PR China
- BGI-Shenzhen, Shenzhen, Guangdong Province, PR China
- BGI Hospital Groups, Ltd., Shenzhen, Guangdong Province, PR China
| | - Yongru Chen
- Department of Emergency Intensive Care Unit, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong Province, PR China
| | - Yongsong Chen
- Department of Endocrinology, The First Affiliated Hospital of Shantou University Medical College, Shantou, Guangdong Province, PR China
| | - Yuchen Liu
- Shenzhen Institute of Translational Medicine, Shenzhen Second People's Hospital, The First Affiliated Hospital of Shenzhen University, Health Science Center, Shenzhen University, Shenzhen, Guangdong Province, PR China
- Key Laboratory of Medical Reprogramming Technology, Shenzhen Second People's Hospital, The First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong Province, PR China
- Shenzhen Institute of Synthetic Biology, Shenzhen Institutes of Advanced Technology, Chinese Academy of Sciences, Guangdong Province, PR China
| | - Zhiming Cai
- Shantou University Medical College, Shantou, Guangdong Province, PR China
- Key Laboratory of Medical Reprogramming Technology, Shenzhen Second People's Hospital, The First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong Province, PR China
- Guangdong Key Laboratory of Systems Biology and Synthetic Biology for Urogenital Tumors, Shenzhen, Guangdong Province, PR China
- State Engineering Laboratory of Medical Key Technologies Application of Synthetic Biology, Shenzhen Second People's Hospital, The First Affiliated Hospital of Shenzhen University, Shenzhen, Guangdong Province, PR China
- Carson International Cancer Center of Shenzhen University, Shenzhen, Guangdong Province, PR China
| |
Collapse
|
2
|
Luo H, Huang W, He Z, Fang Y, Tian Y, Xiong Z. Engineered Living Memory Microspheroid-Based Archival File System for Random Accessible In Vivo DNA Storage. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2025; 37:e2415358. [PMID: 39981833 DOI: 10.1002/adma.202415358] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/09/2024] [Revised: 02/12/2025] [Indexed: 02/22/2025]
Abstract
Given its exceptional durability and high information density, deoxyribonucleic acid (DNA) has the potential to meet the escalating global demand for data storage if it can be stored efficiently and accessed randomly in exabyte-to-yottabyte-scale databases. Here, this work introduces the Engineered Living Memory Microspheroid (ELMM) as a novel material for DNA data storage, retrieval, and management. This work engineers a plasmid library and devises a random access strategy pairing plasmid function with DNA data in a key-value format. Each DNA segment is integrated with its corresponding plasmid, introduced into bacteria, and encapsulated within matrix material via droplet microfluidics within 5 min. ELMMs can be stored at room temperature following lyophilization and, upon rehydration, each type of ELMM exhibits specific functions expressed by the plasmids, allowing for physical differentiation based on these characteristics. This work demonstrates fluorescent expression as the plasmid function and employs fluorescence-based sorting access image files in a prototype database. By utilizing N optical channels, to retrieve 2N file types, each with a minimum of 10 copies. ELMM offers a digital-to-biological information solution, ensuring the preservation, access, replication, and management of files within large-scale DNA databases.
Collapse
Affiliation(s)
- Hao Luo
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| | - Wen Huang
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| | - ZhongHui He
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| | - Yongcong Fang
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| | - Yueming Tian
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| | - Zhuo Xiong
- Biomanufacturing Center, Department of Mechanical Engineering, Tsinghua University, Beijing, 100084, China
- Biomanufacturing and Rapid Forming Technology Key Laboratory of Beijing, Beijing, 100084, China
- Biomanufacturing and Engineering Living Systems Innovation International Talents Base (111 Base), Beijing, 100084, China
| |
Collapse
|
3
|
Ge Q, Qin R, Liu S, Guo Q, Han C, Chen W. Pragmatic soft-decision data readout of encoded large DNA. Brief Bioinform 2025; 26:bbaf102. [PMID: 40091194 PMCID: PMC11911122 DOI: 10.1093/bib/bbaf102] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/17/2024] [Revised: 02/06/2025] [Accepted: 02/24/2025] [Indexed: 03/19/2025] Open
Abstract
The encoded large DNA can be cloned and stored in vivo, capable of write-once and stable replication for multiple retrievals, offering potential in economic data archiving. Nanopore sequencing is advantageous in data access of large DNA due to its rapidity and long-read sequencing capability. However, the data readout is commonly limited by insertion and deletion (indel) errors and sequence assembly complexity. Here, a pragmatic soft-decision data readout is presented, achieving assembly-free sequence reconstruction, indel error correction, and ultra-low coverage data readout. Specifically, the watermark is cleverly embedded within large DNA fragments, allowing for the direct localization of raw reads via watermark alignment to avoid complex read assembly. A soft-decision forward-backward algorithm is proposed, which can identify indel errors and provide probability information to the error correction code, enabling error-free data recovery. Additionally, a minimum state transition is maintained, and a read segmentation is incorporated to achieve fast information reading. The readout assays for two circular plasmids (~51 kb) with different coding rates were demonstrated and achieved error-free recovery directly from noisy reads (error rate ~1%) at coverage of 1-4×. Simulations conducted on large-scale datasets across various error rates further confirm the scalability of the method and its robust performance under extreme conditions. This readout method enables nearly single-molecule recovery of large DNA, particularly suitable for rapid readout of DNA storage.
Collapse
Affiliation(s)
- Qi Ge
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| | - Rui Qin
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| | - Shuang Liu
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| | - Quan Guo
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| | - Changcai Han
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
- Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| | - Weigang Chen
- School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
- Tianjin Key Laboratory of Imaging and Sensing Microelectronic Technology, School of Microelectronics, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
- Frontier Science Center for Synthetic Biology, Ministry of Education, Tianjin University, No. 92 Weijin Road, Nankai District, Tianjin 300072, China
| |
Collapse
|
4
|
Liu X, Yu E, Zhao Q, Han H, Li Q. Enzymes as green and sustainable tools for DNA data storage. Chem Commun (Camb) 2025; 61:2891-2905. [PMID: 39834292 DOI: 10.1039/d4cc06351a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2025]
Abstract
DNA is considered as an ideal supramolecular material for information storage with high storage density and long-term stability. Enzymes, as green and sustainable tools, offer several unique advantages for DNA-based information storage. These advantages include low cost and reduced generation of hazardous wastes during DNA synthesis, as well as the improvements in data reading speed and data recovery accuracy. Moreover, enzymes could achieve scalable data steganography. In this review, we introduced the exciting application strategies of enzymatic tools in each step of DNA information storage (writing, storing, retrieval and reading). We further address the challenges and opportunities associated with enzymatic tools for DNA information storage, aiming at developing new techniques to overcome these obstacles.
Collapse
Affiliation(s)
- Xutong Liu
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China.
| | - Enyang Yu
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China.
| | - Qixuan Zhao
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China.
| | - Haobo Han
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China.
| | - Quanshun Li
- Key Laboratory for Molecular Enzymology and Engineering of Ministry of Education, School of Life Sciences, Jilin University, Changchun 130012, China.
| |
Collapse
|
5
|
Bar-Lev D, Sabary O, Yaakobi E. The zettabyte era is in our DNA. NATURE COMPUTATIONAL SCIENCE 2024; 4:813-817. [PMID: 39516373 DOI: 10.1038/s43588-024-00717-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/06/2023] [Accepted: 10/03/2024] [Indexed: 11/16/2024]
Abstract
This Perspective surveys the critical computational challenges associated with in vitro DNA-based data storage. As digital data expand exponentially, traditional storage media are becoming less viable, making DNA a promising solution due to its density and durability. However, numerous obstacles remain, including error correction, data retrieval from large volumes of noisy reads, and scalability. The Perspective also highlights challenges for DNA-based data centers, such as fault tolerance, random access, and data removal, which must be addressed to make DNA-based storage practical.
Collapse
Affiliation(s)
- Daniella Bar-Lev
- The Henry and Marilyn Taub Faculty of Computer Science, Technion, Israel Institute of Technology, Haifa, Israel.
| | - Omer Sabary
- The Henry and Marilyn Taub Faculty of Computer Science, Technion, Israel Institute of Technology, Haifa, Israel.
| | - Eitan Yaakobi
- The Henry and Marilyn Taub Faculty of Computer Science, Technion, Israel Institute of Technology, Haifa, Israel.
| |
Collapse
|
6
|
Zhao X, Li J, Fan Q, Dai J, Long Y, Liu R, Zhai J, Pan Q, Li Y. Composite Hedges Nanopores codec system for rapid and portable DNA data readout with high INDEL-Correction. Nat Commun 2024; 15:9395. [PMID: 39477940 PMCID: PMC11525716 DOI: 10.1038/s41467-024-53455-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2024] [Accepted: 10/11/2024] [Indexed: 11/02/2024] Open
Abstract
Reading digital information from highly dense but lightweight DNA medium nowadays relies on time-consuming next-generation sequencing. Nanopore sequencing holds the promise to overcome the efficiency problem, but high indel error rates lead to the requirement of large amount of high quality data for accurate readout. Here we introduce Composite Hedges Nanopores, capable of handling indel rates up to 15.9% and substitution rates up to 7.8%. The overall information density can be doubled from 0.59 to 1.17 by utilizing a degenerated eight-letter alphabet. We demonstrate that sequencing times of 20 and 120 minutes are sufficient for processing representative text and image files, respectively. Moreover, to achieve complete data recovery, it is estimated that text and image data require 4× and 8× physical redundancy of composite strands, respectively. Our codec system excels on both molecular design and equalized dictionary usage, laying a solid foundation approaching to real-time DNA data retrieval and encoding.
Collapse
Affiliation(s)
- Xuyang Zhao
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China
| | - Junyao Li
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China
| | - Qingyuan Fan
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China
| | - Jing Dai
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China
| | - Yanping Long
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Ronghui Liu
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China
| | - Jixian Zhai
- Department of Biology, School of Life Sciences, Southern University of Science and Technology, Shenzhen, China
| | - Qing Pan
- College of Information Engineering, Zhejiang University of Technology, Hangzhou, China.
| | - Yi Li
- School of Microelectronics, MOE Engineering Research Center of Integrated Circuits for Next Generation Communications, Southern University of Science and Technology, Shenzhen, China.
| |
Collapse
|
7
|
Zhong W, Geng C, Fu Z, Mao C, Zheng Y, Wang S, Liu K, Yang Y, Lu C, Jiang X. Flow Cytometry Sorting for Random Access in DNA Data Storage: Encapsulation for Enhanced Stability and Sequence Integrity of DNA. Anal Chem 2024; 96:16099-16108. [PMID: 39319639 DOI: 10.1021/acs.analchem.4c04637] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 09/26/2024]
Abstract
As digital data undergo explosive growth, deoxyribonucleic acid (DNA) has emerged as a promising storage medium due to its high density, longevity, and ease of replication, offering vast potential in data storage solutions. This study focuses on the protection and retrieval of data during the DNA storage process, developing a technique that employs flow cytometry sorting (FCS) to segregate multicolored fluorescent DNA microparticles encoded with data and facilitating efficient random access. Moreover, the encapsulated fluorescent DNA microparticles, formed through layer-by-layer self-assembly, preserve structural and sequence integrity even under harsh conditions while also supporting a high-density DNA payload. Experimental results have shown that the encoded data can still be successfully recovered from encapsulated DNA microparticles following de-encapsulation. We also successfully demonstrated the automated encapsulation process of fluorescent DNA microparticles using a microfluidic chip. This research provides an innovative approach to the long-term stability and random readability of DNA data storage.
Collapse
Affiliation(s)
- Wukun Zhong
- MOE Key Laboratory for Analytical Science of Food Safety and Biology, College of Chemistry, Fuzhou University, Fuzhou 350108, China
| | - Chunyang Geng
- Shenzhen Key Laboratory of Smart Healthcare Engineering, Guangdong Provincial Key Laboratory of Advanced Biomaterials, Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, Guangdong 518055, China
| | - Zhangcheng Fu
- MOE Key Laboratory for Analytical Science of Food Safety and Biology, College of Chemistry, Fuzhou University, Fuzhou 350108, China
| | - Cuiping Mao
- Shenzhen Key Laboratory of Smart Healthcare Engineering, Guangdong Provincial Key Laboratory of Advanced Biomaterials, Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, Guangdong 518055, China
| | - Yanlin Zheng
- MOE Key Laboratory for Analytical Science of Food Safety and Biology, College of Chemistry, Fuzhou University, Fuzhou 350108, China
| | - Saijie Wang
- Shenzhen Key Laboratory of Smart Healthcare Engineering, Guangdong Provincial Key Laboratory of Advanced Biomaterials, Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, Guangdong 518055, China
| | - Kai Liu
- Engineering Research Center of Advanced Rare Earth Materials (Ministry of Education), Department of Chemistry, Tsinghua University, Beijing 100084, China
| | - Yang Yang
- Institute of Molecular Medicine and Shanghai Key Laboratory for Nucleic Acid Chemistry and Nanomedicine, State Key Laboratory of Oncogenes and Related Genes, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, Shanghai 200127, China
| | - Chunhua Lu
- MOE Key Laboratory for Analytical Science of Food Safety and Biology, College of Chemistry, Fuzhou University, Fuzhou 350108, China
| | - Xingyu Jiang
- Shenzhen Key Laboratory of Smart Healthcare Engineering, Guangdong Provincial Key Laboratory of Advanced Biomaterials, Department of Biomedical Engineering, Southern University of Science and Technology, Shenzhen, Guangdong 518055, China
| |
Collapse
|
8
|
Yu M, Tang X, Li Z, Wang W, Wang S, Li M, Yu Q, Xie S, Zuo X, Chen C. High-throughput DNA synthesis for data storage. Chem Soc Rev 2024; 53:4463-4489. [PMID: 38498347 DOI: 10.1039/d3cs00469d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/20/2024]
Abstract
With the explosion of digital world, the dramatically increasing data volume is expected to reach 175 ZB (1 ZB = 1012 GB) in 2025. Storing such huge global data would consume tons of resources. Fortunately, it has been found that the deoxyribonucleic acid (DNA) molecule is the most compact and durable information storage medium in the world so far. Its high coding density and long-term preservation properties make itself one of the best data storage carriers for the future. High-throughput DNA synthesis is a key technology for "DNA data storage", which encodes binary data stream (0/1) into quaternary long DNA sequences consisting of four bases (A/G/C/T). In this review, the workflow of DNA data storage and the basic methods of artificial DNA synthesis technology are outlined first. Then, the technical characteristics of different synthesis methods and the state-of-the-art of representative commercial companies, with a primary focus on silicon chip microarray-based synthesis and novel enzymatic DNA synthesis are presented. Finally, the recent status of DNA storage and new opportunities for future development in the field of high-throughput, large-scale DNA synthesis technology are summarized.
Collapse
Affiliation(s)
- Meng Yu
- Institute of Medical Chips, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, 200025, Shanghai, China.
- School of Microelectronics, Shanghai University, 201800, Shanghai, China
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
| | - Xiaohui Tang
- Institute of Medical Chips, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, 200025, Shanghai, China.
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
| | - Zhenhua Li
- Institute of Medical Chips, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, 200025, Shanghai, China.
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
| | - Weidong Wang
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
| | - Shaopeng Wang
- Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200127, Shanghai, China.
| | - Min Li
- Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200127, Shanghai, China.
| | - Qiuliyang Yu
- Shenzhen Key Laboratory for the Intelligent Microbial Manufacturing of Medicines, Shenzhen Institute of Advanced Technology, Chinese Academy of Sciences, 518055, Shenzhen, China
| | - Sijia Xie
- Institute of Medical Chips, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, 200025, Shanghai, China.
- School of Microelectronics, Shanghai University, 201800, Shanghai, China
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
| | - Xiaolei Zuo
- Institute of Molecular Medicine, Renji Hospital, School of Medicine, Shanghai Jiao Tong University, 200127, Shanghai, China.
| | - Chang Chen
- Institute of Medical Chips, Ruijin Hospital, Shanghai Jiao Tong University School of Medicine, 200025, Shanghai, China.
- School of Microelectronics, Shanghai University, 201800, Shanghai, China
- Shanghai Industrial μTechnology Research Institute, 201800, Shanghai, China
- State Key Laboratory of Transducer Technology, Shanghai Institute of Microsystem and Information Technology, Chinese Academy of Sciences, 200050, Shanghai, China
| |
Collapse
|
9
|
Hou Z, Qiang W, Wang X, Chen X, Hu X, Han X, Shen W, Zhang B, Xing P, Shi W, Dai J, Huang X, Zhao G. "Cell Disk" DNA Storage System Capable of Random Reading and Rewriting. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2024; 11:e2305921. [PMID: 38332565 PMCID: PMC11022697 DOI: 10.1002/advs.202305921] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/21/2023] [Revised: 11/23/2023] [Indexed: 02/10/2024]
Abstract
DNA has emerged as an appealing material for information storage due to its great storage density and durability. Random reading and rewriting are essential tasks for practical large-scale data storage. However, they are currently difficult to implement simultaneously in a single DNA-based storage system, strongly limiting their practicability. Here, a "Cell Disk" storage system is presented, achieving high-density in vivo DNA data storage that enables both random reading and rewriting. In this system, each yeast cell is used as a chamber to store information, similar to a "disk block" but with the ability to self-replicate. Specifically, each genome of yeast cell has a customized CRISPR/Cas9-based "lock-and-key" module inserted, which allows selective retrieval, erasure, or rewriting of the targeted cell "block" from a pool of cells ("disk"). Additionally, a codec algorithm with lossless compression ability is developed to improve the information density of each cell "block". As a proof of concept, target-specific reading and rewriting of the compressed data from a mimic cell "disk" comprising up to 105 "blocks" are demonstrated and achieve high specificity and reliability. The "Cell Disk" system described here concurrently supports random reading and rewriting, and it should have great scalability for practical data storage use.
Collapse
Affiliation(s)
- Zhaohua Hou
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Wei Qiang
- Shenzhen Key Laboratory of Synthetic GenomicsGuangdong Provincial Key Laboratory of Synthetic GenomicsShenzhen Institute of Synthetic BiologyShenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdong518055P. R. China
| | - Xiangxiang Wang
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Xiaoxu Chen
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Xin Hu
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Xuye Han
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Wenlu Shen
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Bing Zhang
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Peng Xing
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Wenping Shi
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| | - Junbiao Dai
- Shenzhen Key Laboratory of Synthetic GenomicsGuangdong Provincial Key Laboratory of Synthetic GenomicsShenzhen Institute of Synthetic BiologyShenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdong518055P. R. China
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at ShenzhenChinese Academy of Agricultural SciencesShenzhenP. R. China
| | - Xiaoluo Huang
- Shenzhen Key Laboratory of Synthetic GenomicsGuangdong Provincial Key Laboratory of Synthetic GenomicsShenzhen Institute of Synthetic BiologyShenzhen Institutes of Advanced TechnologyChinese Academy of SciencesShenzhenGuangdong518055P. R. China
| | - Guanghou Zhao
- School of Ecology and EnvironmentNorthwestern Polytechnical University1 Dongxiang Road, Chang'an DistrictXi'anShaanxi710129P. R. China
| |
Collapse
|
10
|
Yeom H, Kim N, Lee AC, Kim J, Kim H, Choi H, Song SW, Kwon S, Choi Y. Highly Accurate Sequence- and Position-Independent Error Profiling of DNA Synthesis and Sequencing. ACS Synth Biol 2023; 12:3567-3577. [PMID: 37961855 PMCID: PMC10729760 DOI: 10.1021/acssynbio.3c00308] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 11/01/2023] [Accepted: 11/01/2023] [Indexed: 11/15/2023]
Abstract
A comprehensive error analysis of DNA-stored data during processing, such as DNA synthesis and sequencing, is crucial for reliable DNA data storage. Both synthesis and sequencing errors depend on the sequence and the transition of bases of nucleotides; ignoring either one of the error sources leads to technical challenges in minimizing the error rate. Here, we present a methodology and toolkit that utilizes an oligonucleotide library generated from a 10-base-shifted sequence array, which is individually labeled with unique molecular identifiers, to delineate and profile DNA synthesis and sequencing errors simultaneously. This methodology enables position- and sequence-independent error profiling of both DNA synthesis and sequencing. Using this toolkit, we report base transitional errors in both synthesis and sequencing in general DNA data storage as well as degenerate-base-augmented DNA data storage. The methodology and data presented will contribute to the development of DNA sequence designs with minimal error.
Collapse
Affiliation(s)
- Huiran Yeom
- Division
of Data Science, College of Information and Communication Technology, The University of Suwon, Hwaseong 18323, Republic of Korea
| | - Namphil Kim
- Department
of Electrical and Computer Engineering, Seoul National University, Seoul 08826, South Korea
| | | | - Jinhyun Kim
- Department
of Electrical and Computer Engineering, Seoul National University, Seoul 08826, South Korea
| | - Hamin Kim
- Department
of Interdisciplinary Program for Bioengineering, Seoul National University, Seoul 08826, South Korea
| | - Hansol Choi
- Bio-MAX
Institute, Seoul National University, Seoul 08826, Republic of Korea
| | - Seo Woo Song
- Basic Science
and Engineering Initiative, Children’s Heart Center, Stanford University, Stanford, California 94304, United States
| | - Sunghoon Kwon
- Department
of Electrical and Computer Engineering, Seoul National University, Seoul 08826, South Korea
- Department
of Interdisciplinary Program for Bioengineering, Seoul National University, Seoul 08826, South Korea
- Bio-MAX
Institute, Seoul National University, Seoul 08826, Republic of Korea
| | - Yeongjae Choi
- School
of Materials Science and Engineering, Gwangju
Institute of Science and Technology (GIST), Gwangju 61105, Republic of Korea
| |
Collapse
|