1
|
Roberts NG, Gilmore MJ, Struck TH, Kocot KM. Multiple Displacement Amplification Facilitates SMRT Sequencing of Microscopic Animals and the Genome of the Gastrotrich Lepidodermella squamata (Dujardin 1841). Genome Biol Evol 2024; 16:evae254. [PMID: 39590608 DOI: 10.1093/gbe/evae254] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2024] [Revised: 11/11/2024] [Accepted: 11/14/2024] [Indexed: 11/28/2024] Open
Abstract
Obtaining adequate DNA for long-read genome sequencing remains a roadblock to producing contiguous genomes from small-bodied organisms, hindering understanding of phylogenetic relationships and genome evolution. Multiple displacement amplification leverages Phi29 DNA polymerase to produce micrograms of DNA from picograms of input. However, multiple displacement amplification's inherent biases in amplification related to guanine and cytosine (GC) content, repeat content and chimera production are a problem for long-read genome assembly, which has been little investigated. We explored the utility of multiple displacement amplification for generating template DNA for High Fidelity (HiFi) sequencing directly from living cells of Caenorhabditis elegans (Nematoda) and Lepidodermella squamata (Gastrotricha) containing one order of magnitude less DNA than required for the PacBio Ultra-Low DNA Input Workflow. High Fidelity sequencing of libraries prepared from multiple displacement amplification products resulted in highly contiguous and complete genomes for both C. elegans (102 Mbp assembly; 336 contigs; N50 = 868 kbp; L50 = 39; BUSCO_nematoda_nucleotide: S:96.1%, D:2.8%) and L. squamata (122 Mbp assembly; 157 contigs; N50 = 3.9 Mbp; L50 = 13; BUSCO_metazoa_nucleotide: S:80.8%, D:2.8%). Coverage uniformity for reads from multiple displacement amplification DNA (Gini Index: 0.14, normalized mean across all 100 kbp blocks: 0.49) and reads from pooled nematode DNA (Gini Index: 0.16, normalized mean across all 100 kbp blocks: 0.49) proved similar. Using this approach, we sequenced the genome of the microscopic invertebrate L. squamata (Gastrotricha), the first of its phylum. Using the newly sequenced genome, we infer Gastrotricha's long-debated phylogenetic position as the sister taxon of Platyhelminthes and conduct a comparative analysis of the Hox cluster.
Collapse
Affiliation(s)
- Nickellaus G Roberts
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, Alabama, USA
| | - Michael J Gilmore
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, Alabama, USA
| | | | - Kevin M Kocot
- Department of Biological Sciences, The University of Alabama, Tuscaloosa, Alabama, USA
- Alabama Museum of Natural History, The University of Alabama, Tuscaloosa, Alabama, USA
| |
Collapse
|
2
|
Lu N, Qiao Y, An P, Luo J, Bi C, Li M, Lu Z, Tu J. Exploration of whole genome amplification generated chimeric sequences in long-read sequencing data. Brief Bioinform 2023; 24:bbad275. [PMID: 37529913 DOI: 10.1093/bib/bbad275] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2023] [Revised: 06/21/2023] [Accepted: 07/10/2023] [Indexed: 08/03/2023] Open
Abstract
MOTIVATION Multiple displacement amplification (MDA) has become the most commonly used method of whole genome amplification, generating a vast amount of DNA with higher molecular weight and greater genome coverage. Coupling with long-read sequencing, it is possible to sequence the amplicons of over 20 kb in length. However, the formation of chimeric sequences (chimeras, expressed as structural errors in sequencing data) in MDA seriously interferes with the bioinformatics analysis but its influence on long-read sequencing data is unknown. RESULTS We sequenced the phi29 DNA polymerase-mediated MDA amplicons on the PacBio platform and analyzed chimeras within the generated data. The 3rd-ChimeraMiner has been constructed as a pipeline for recognizing and restoring chimeras into the original structures in long-read sequencing data, improving the efficiency of using TGS data. Five long-read datasets and one high-fidelity long-read dataset with various amplification folds were analyzed. The result reveals that the mis-priming events in amplification are more frequently occurring than widely perceived, and the propor tion gradually accumulates from 42% to over 78% as the amplification continues. In total, 99.92% of recognized chimeric sequences were demonstrated to be artifacts, whose structures were wrongly formed in MDA instead of existing in original genomes. By restoring chimeras to their original structures, the vast majority of supplementary alignments that introduce false-positive structural variants are recycled, removing 97% of inversions on average and contributing to the analysis of structural variation in MDA-amplified samples. The impact of chimeras in long-read sequencing data analysis should be emphasized, and the 3rd-ChimeraMiner can help to quantify and reduce the influence of chimeras. AVAILABILITY AND IMPLEMENTATION The 3rd-ChimeraMiner is available on GitHub, https://github.com/dulunar/3rdChimeraMiner.
Collapse
Affiliation(s)
- Na Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Yi Qiao
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Pengfei An
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
- Monash University-Southeast University Joint Research Institute, Suzhou 215123, China
| | - Jiajian Luo
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Changwei Bi
- College of Information Science and Technology, Nanjing Forestry University, Nanjing 210037, China
| | - Musheng Li
- Department of Physiology and Cell Biology, University of Nevada, Reno School of Medicine, Reno, NV 89511, USA
| | - Zuhong Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Jing Tu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| |
Collapse
|
3
|
Lu N, Qiao Y, Lu Z, Tu J. Chimera: The spoiler in multiple displacement amplification. Comput Struct Biotechnol J 2023; 21:1688-1696. [PMID: 36879882 PMCID: PMC9984789 DOI: 10.1016/j.csbj.2023.02.034] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2022] [Revised: 02/18/2023] [Accepted: 02/18/2023] [Indexed: 02/24/2023] Open
Abstract
Multiple displacement amplification (MDA) based on isothermal random priming and high fidelity phi29 DNA polymerase-mediated processive extension has revolutionized the field of whole genome amplification by enabling the amplification of minute amounts of DNA, such as from a single cell, generating vast amounts of DNA with high genome coverage. Despite its advantages, MDA has its own challenges, one of the grandest being the formation of chimeric sequences (chimeras), which presents in all MDA products and seriously disturbs the downstream analysis. In this review, we provide a comprehensive overview of current research on MDA chimeras. We first reviewed the mechanisms of chimera formation and chimera detection methods. We then systematically summarized the characteristics of chimeras, including overlap, chimeric distance, chimeric density, and chimeric rate, as found in independently published sequencing data. Finally, we reviewed the methods used to process chimeric sequences and their impacts on the improvement of data utilization efficiency. The information presented in this review will be useful for those interested in understanding the challenges with MDA and in improving its performance.
Collapse
Affiliation(s)
- Na Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Yi Qiao
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Zuhong Lu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| | - Jing Tu
- State Key Laboratory of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China
| |
Collapse
|
4
|
Hu Y, Jiang Z, Chen K, Zhou Z, Zhou X, Wang Y, Yang J, Zhang B, Wen L, Tang F. scNanoATAC-seq: a long-read single-cell ATAC sequencing method to detect chromatin accessibility and genetic variants simultaneously within an individual cell. Cell Res 2023; 33:83-86. [PMID: 36220860 PMCID: PMC9810643 DOI: 10.1038/s41422-022-00730-x] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/25/2022] [Accepted: 09/12/2022] [Indexed: 01/07/2023] Open
Affiliation(s)
- Yuqiong Hu
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China
| | - Zhenhuan Jiang
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China ,grid.11135.370000 0001 2256 9319PKU-Tsinghua-NIBS Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Kexuan Chen
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China
| | - Zhangxian Zhou
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China ,grid.11135.370000 0001 2256 9319Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China
| | - Xin Zhou
- grid.11135.370000 0001 2256 9319Department of General Surgery, Third Hospital, Peking University, Beijing, China
| | - Yan Wang
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China
| | - Jingwei Yang
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China
| | - Bo Zhang
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China
| | - Lu Wen
- grid.11135.370000 0001 2256 9319Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China ,grid.419897.a0000 0004 0369 313XBeijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China
| | - Fuchou Tang
- Biomedical Pioneering Innovation Center, School of Life Sciences, Peking University, Beijing, China. .,Beijing Advanced Innovation Center for Genomics (ICG), Ministry of Education Key Laboratory of Cell Proliferation and Differentiation, Beijing, China. .,PKU-Tsinghua-NIBS Graduate Program, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China. .,Peking-Tsinghua Center for Life Sciences, Academy for Advanced Interdisciplinary Studies, Peking University, Beijing, China.
| |
Collapse
|
5
|
Wang X, Liu Y, Liu H, Pan W, Ren J, Zheng X, Tan Y, Chen Z, Deng Y, He N, Chen H, Li S. Recent advances and application of whole genome amplification in molecular diagnosis and medicine. MedComm (Beijing) 2022; 3:e116. [PMID: 35281794 PMCID: PMC8906466 DOI: 10.1002/mco2.116] [Citation(s) in RCA: 15] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2021] [Revised: 01/11/2022] [Accepted: 01/12/2022] [Indexed: 11/30/2022] Open
Abstract
Whole genome amplification (WGA) is a technology for non-selective amplification of the whole genome sequence, first appearing in 1992. Its primary purpose is to amplify and reflect the whole genome of trace tissues and single cells without sequence bias and to provide sufficient DNA template for subsequent multigene and multilocus analysis, along with comprehensive genome research. WGA provides a method to obtain a large amount of genetic information from a small amount of DNA and provides a valuable tool for preserving limited samples in molecular biology. WGA technology is especially suitable for forensic identification and genetic disease research, along with new technologies such as next-generation sequencing (NGS). In addition, WGA is also widely used in single-cell sequencing. Due to the small amount of DNA in a single cell, it is often unable to meet the amount of samples needed for sequencing, so WGA is generally used to achieve the amplification of trace samples. This paper reviews WGA methods based on different principles, summarizes both amplification principle and amplification quality, and discusses the application prospects and challenges of WGA technology in molecular diagnosis and medicine.
Collapse
Affiliation(s)
- Xiaoyu Wang
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Yapeng Liu
- School of Early‐Childhood Education, Nanjing Xiaozhuang UniversityNanjingChina
| | - Hongna Liu
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Wenjing Pan
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Jie Ren
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Xiangming Zheng
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Yimin Tan
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Zhu Chen
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Yan Deng
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Nongyue He
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
- State Key Laboratory of BioelectronicsSoutheast UniversityNanjingChina
| | - Hui Chen
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| | - Song Li
- Hunan Key Laboratory of Biomedical Nanomaterials and DevicesHunan University of TechnologyZhuzhouChina
| |
Collapse
|
6
|
dCITI-Seq: droplet combinational indexed transposon insertion sequencing. Anal Bioanal Chem 2022; 414:2661-2670. [DOI: 10.1007/s00216-022-03902-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2021] [Revised: 01/08/2022] [Accepted: 01/13/2022] [Indexed: 11/25/2022]
|
7
|
The mechanism and improvements to the isothermal amplification of nucleic acids, at a glance. Anal Biochem 2021; 631:114260. [PMID: 34023274 DOI: 10.1016/j.ab.2021.114260] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/23/2021] [Revised: 05/15/2021] [Accepted: 05/18/2021] [Indexed: 01/08/2023]
Abstract
A comparative review of the most common isothermal methods is provided. In the last two decades, the challenge of using isothermal amplification systems as an alternate to the most extensive and long-standing nucleic acids-amplifying method-the polymerase chain reaction-has arisen. The main advantage of isothermal amplification is no requirement for expensive laboratory equipment for thermal cycling. Considerable efforts have been made to improve the current techniques of nucleic acid amplification and the development of new approaches based on the main drawbacks of each method. The most important and challenging goal was to achieve a low-cost, straightforward system that is rapid, specific, accurate, and sensitive.
Collapse
|
8
|
Qiao Y, Liu W, Lu N, Xu Z, Tu J, Lu Z. Rapid droplet multiple displacement amplification based on the droplet regeneration strategy. Anal Chim Acta 2021; 1141:173-179. [DOI: 10.1016/j.aca.2020.10.031] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2020] [Revised: 10/14/2020] [Accepted: 10/16/2020] [Indexed: 10/23/2022]
|
9
|
Long N, Qiao Y, Xu Z, Tu J, Lu Z. Recent advances and application in whole-genome multiple displacement amplification. QUANTITATIVE BIOLOGY 2020. [DOI: 10.1007/s40484-020-0217-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
10
|
Tu J, Chen L, Gao S, Zhang J, Bi C, Tao Y, Lu N, Lu Z. Obtaining Genome Sequences of Mutualistic Bacteria in Single Microcystis Colonies. Int J Mol Sci 2019; 20:ijms20205047. [PMID: 31614621 PMCID: PMC6829522 DOI: 10.3390/ijms20205047] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/22/2019] [Revised: 10/09/2019] [Accepted: 10/10/2019] [Indexed: 01/01/2023] Open
Abstract
Cells of Microcystis are associated with heterotrophic bacteria and organized in colonies in natural environment, which are basic elements in the mass occurrence of cyanobacterial species. Analyzing these colonies by using metagenomics is helpful to understand species composition and relationship. Meanwhile, the difference in population abundance among Microcystis colonies could be used to recover genome bins from metagenome assemblies. Herein, we designed a pipeline to obtain high-quality genomes of mutualistic bacteria from single natural Microcystis colonies. Single colonies were lysed, and then amplified by using multiple displacement amplification to overcome the DNA quantity limit. A two-step assembly was performed after sequencing and scaffolds were grouped into putative bins based on their differential-coverage among species. We analyzed six natural colonies of three prevailing Microcystis species from Lake Taihu. Clustering results proved that colonies of the same species were similar in the microbial community composition. Eight putative population genome bins with wide bacterial diversity and different GC content were identified based on coverage difference among colonies. At the phylum level, proteobacteria was the most abundant besides cyanobacteria. Six of the population bins were further refined into nearly complete genomes (completeness > 90%).
Collapse
Affiliation(s)
- Jing Tu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Liang Chen
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Shen Gao
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Junyi Zhang
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
- Wuxi Environmental Monitoring Center, Wuxi 210096, China.
| | - Changwei Bi
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Yuhan Tao
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Na Lu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| | - Zuhong Lu
- State Key Lab of Bioelectronics, School of Biological Science and Medical Engineering, Southeast University, Nanjing 210096, China.
| |
Collapse
|