1
|
Su J, Li S, Zheng Z, Lam TW, Luo R. ClusterV-Web: a user-friendly tool for profiling HIV quasispecies and generating drug resistance reports from nanopore long-read data. Bioinform Adv 2024; 4:vbae006. [PMID: 38282975 PMCID: PMC10812873 DOI: 10.1093/bioadv/vbae006] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/01/2023] [Revised: 12/11/2023] [Accepted: 01/12/2024] [Indexed: 01/30/2024]
Abstract
Summary Third-generation long-read sequencing is an increasingly utilized technique for profiling human immunodeficiency virus (HIV) quasispecies and detecting drug resistance mutations due to its ability to cover the entire viral genome in individual reads. Recently, the ClusterV tool has demonstrated accurate detection of HIV quasispecies from Nanopore long-read sequencing data. However, the need for scripting skills and a computational environment may act as a barrier for many potential users. To address this issue, we have introduced ClusterV-Web, a user-friendly web-based application that enables easy configuration and execution of ClusterV, both remotely and locally. Our tool provides interactive tables and data visualizations to aid in the interpretation of results. This development is expected to democratize access to long-read sequencing data analysis, enabling a wider range of researchers and clinicians to efficiently profile HIV quasispecies and detect drug resistance mutations. Availability and implementation ClusterV-Web is freely available and open source, with detailed documentation accessible at http://www.bio8.cs.hku.hk/ClusterVW/. The standalone Docker image and source code are also available at https://github.com/HKU-BAL/ClusterV-Web.
Collapse
Affiliation(s)
- Junhao Su
- Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Shumin Li
- Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Zhenxian Zheng
- Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Tak-Wah Lam
- Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Hong Kong 999077, China
| |
Collapse
|
2
|
Sandler S, Horne RI, Rocchetti S, Novak R, Hsu NS, Castellana Cruz M, Faidon Brotzakis Z, Gregory RC, Chia S, Bernardes GJL, Keyser UF, Vendruscolo M. Multiplexed Digital Characterization of Misfolded Protein Oligomers via Solid-State Nanopores. J Am Chem Soc 2023; 145:25776-25788. [PMID: 37972287 PMCID: PMC10690769 DOI: 10.1021/jacs.3c09335] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/26/2023] [Revised: 10/28/2023] [Accepted: 10/31/2023] [Indexed: 11/19/2023]
Abstract
Misfolded protein oligomers are of central importance in both the diagnosis and treatment of Alzheimer's and Parkinson's diseases. However, accurate high-throughput methods to detect and quantify oligomer populations are still needed. We present here a single-molecule approach for the detection and quantification of oligomeric species. The approach is based on the use of solid-state nanopores and multiplexed DNA barcoding to identify and characterize oligomers from multiple samples. We study α-synuclein oligomers in the presence of several small-molecule inhibitors of α-synuclein aggregation as an illustration of the potential applicability of this method to the development of diagnostic and therapeutic methods for Parkinson's disease.
Collapse
Affiliation(s)
- Sarah
E. Sandler
- Cavendish
Laboratory, Maxwell Centre, Department of Physics, University of Cambridge, Cambridge CB3 0HE, U.K.
| | - Robert I. Horne
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Sara Rocchetti
- Cavendish
Laboratory, Maxwell Centre, Department of Physics, University of Cambridge, Cambridge CB3 0HE, U.K.
| | - Robert Novak
- Cavendish
Laboratory, Maxwell Centre, Department of Physics, University of Cambridge, Cambridge CB3 0HE, U.K.
| | - Nai-Shu Hsu
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Marta Castellana Cruz
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Z. Faidon Brotzakis
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Rebecca C. Gregory
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Sean Chia
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
- Bioprocessing
Technology Institute, Agency for Science, Technology and Research
(A*STAR), Singapore 138668
| | - Gonçalo J. L. Bernardes
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| | - Ulrich F. Keyser
- Cavendish
Laboratory, Maxwell Centre, Department of Physics, University of Cambridge, Cambridge CB3 0HE, U.K.
| | - Michele Vendruscolo
- Centre
for Misfolding Diseases, Yusuf Hamied Department of Chemistry, University of Cambridge, Cambridge CB2 1EW, U.K.
| |
Collapse
|
3
|
Li Y, Sandler SE, Keyser UF, Zhu J. DNA Volume, Topology, and Flexibility Dictate Nanopore Current Signals. Nano Lett 2023; 23:7054-7061. [PMID: 37487050 PMCID: PMC10416563 DOI: 10.1021/acs.nanolett.3c01823] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 05/16/2023] [Revised: 07/13/2023] [Indexed: 07/26/2023]
Abstract
Nanopores have developed into powerful single-molecule sensors capable of identifying and characterizing small polymers, such as DNA, by electrophoretically driving them through a nanoscale pore and monitoring temporary blockades in the ionic pore current. However, the relationship between nanopore signals and the physical properties of DNA remains only partly understood. Herein, we introduce a programmable DNA carrier platform to capture carefully designed DNA nanostructures. Controlled translocation experiments through our glass nanopores allowed us to disentangle this relationship. We vary DNA topology by changing the length, strand duplications, sequence, unpaired nucleotides, and rigidity of the analyte DNA and find that the ionic current drop is mainly determined by the volume and flexibility of the DNA nanostructure in the nanopore. Finally, we use our understanding of the role of DNA topology to discriminate circular single-stranded DNA molecules from linear ones with the same number of nucleotides using the nanopore signal.
Collapse
Affiliation(s)
- Yunxuan Li
- Cavendish
Laboratory, University of Cambridge, JJ Thompson Avenue, Cambridge CB3 0HE, United Kingdom
| | - Sarah E. Sandler
- Cavendish
Laboratory, University of Cambridge, JJ Thompson Avenue, Cambridge CB3 0HE, United Kingdom
| | - Ulrich F. Keyser
- Cavendish
Laboratory, University of Cambridge, JJ Thompson Avenue, Cambridge CB3 0HE, United Kingdom
| | - Jinbo Zhu
- Cavendish
Laboratory, University of Cambridge, JJ Thompson Avenue, Cambridge CB3 0HE, United Kingdom
- School
of Biomedical Engineering, Faculty of Medicine, Dalian University of Technology, No. 2 Linggong Road, Dalian 116024, China
| |
Collapse
|
4
|
Yu H, Zheng Z, Su J, Lam TW, Luo R. Boosting variant-calling performance with multi-platform sequencing data using Clair3-MP. BMC Bioinformatics 2023; 24:308. [PMID: 37537536 PMCID: PMC10401749 DOI: 10.1186/s12859-023-05434-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/07/2023] [Accepted: 07/31/2023] [Indexed: 08/05/2023] Open
Abstract
BACKGROUND With the continuous advances in third-generation sequencing technology and the increasing affordability of next-generation sequencing technology, sequencing data from different sequencing technology platforms is becoming more common. While numerous benchmarking studies have been conducted to compare variant-calling performance across different platforms and approaches, little attention has been paid to the potential of leveraging the strengths of different platforms to optimize overall performance, especially integrating Oxford Nanopore and Illumina sequencing data. RESULTS We investigated the impact of multi-platform data on the performance of variant calling through carefully designed experiments with a deep learning-based variant caller named Clair3-MP (Multi-Platform). Through our research, we not only demonstrated the capability of ONT-Illumina data for improved variant calling, but also identified the optimal scenarios for utilizing ONT-Illumina data. In addition, we revealed that the improvement in variant calling using ONT-Illumina data comes from an improvement in difficult genomic regions, such as the large low-complexity regions and segmental and collapse duplication regions. Moreover, Clair3-MP can incorporate reference genome stratification information to achieve a small but measurable improvement in variant calling. Clair3-MP is accessible as an open-source project at: https://github.com/HKU-BAL/Clair3-MP . CONCLUSIONS These insights have important implications for researchers and practitioners alike, providing valuable guidance for improving the reliability and efficiency of genomic analysis in diverse applications.
Collapse
Affiliation(s)
- Huijing Yu
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China
| | - Zhenxian Zheng
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China
| | - Junhao Su
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China.
| | - Tak-Wah Lam
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China.
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Pok Fu Lam, Hong Kong SAR, China.
| |
Collapse
|
5
|
Ugolini C, Mulroney L, Leger A, Castelli M, Criscuolo E, Williamson MK, Davidson AD, Almuqrin A, Giambruno R, Jain M, Frigè G, Olsen H, Tzertzinis G, Schildkraut I, Wulf MG, Corrêa IR, Ettwiller L, Clementi N, Clementi M, Mancini N, Birney E, Akeson M, Nicassio F, Matthews D, Leonardi T. Nanopore ReCappable sequencing maps SARS-CoV-2 5' capping sites and provides new insights into the structure of sgRNAs. Nucleic Acids Res 2022; 50:3475-3489. [PMID: 35244721 PMCID: PMC8989550 DOI: 10.1093/nar/gkac144] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/25/2021] [Revised: 02/05/2022] [Accepted: 02/16/2022] [Indexed: 01/09/2023] Open
Abstract
The SARS-CoV-2 virus has a complex transcriptome characterised by multiple, nested subgenomic RNAsused to express structural and accessory proteins. Long-read sequencing technologies such as nanopore direct RNA sequencing can recover full-length transcripts, greatly simplifying the assembly of structurally complex RNAs. However, these techniques do not detect the 5' cap, thus preventing reliable identification and quantification of full-length, coding transcript models. Here we used Nanopore ReCappable Sequencing (NRCeq), a new technique that can identify capped full-length RNAs, to assemble a complete annotation of SARS-CoV-2 sgRNAs and annotate the location of capping sites across the viral genome. We obtained robust estimates of sgRNA expression across cell lines and viral isolates and identified novel canonical and non-canonical sgRNAs, including one that uses a previously un-annotated leader-to-body junction site. The data generated in this work constitute a useful resource for the scientific community and provide important insights into the mechanisms that regulate the transcription of SARS-CoV-2 sgRNAs.
Collapse
Affiliation(s)
- Camilla Ugolini
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, 20139 Milano, Italy
| | - Logan Mulroney
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, 20139 Milano, Italy
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
- Biomolecular Engineering Department, UC Santa Cruz, CA 95064, USA
| | - Adrien Leger
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Matteo Castelli
- Laboratory of Microbiology and Virology, Vita-Salute San Raffaele University; via Olgettina 58, 20132 Milan, Italy
| | - Elena Criscuolo
- Laboratory of Microbiology and Virology, Vita-Salute San Raffaele University; via Olgettina 58, 20132 Milan, Italy
| | - Maia Kavanagh Williamson
- School of Cellular and Molecular Medicine, Faculty of Life Sciences, University Walk, University of Bristol, Bristol BS8 1TD, UK
| | - Andrew D Davidson
- School of Cellular and Molecular Medicine, Faculty of Life Sciences, University Walk, University of Bristol, Bristol BS8 1TD, UK
| | - Abdulaziz Almuqrin
- School of Cellular and Molecular Medicine, Faculty of Life Sciences, University Walk, University of Bristol, Bristol BS8 1TD, UK
- Department of Clinical Laboratory Sciences, King Saud University, Riyadh, Saudi Arabia
| | - Roberto Giambruno
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, 20139 Milano, Italy
| | - Miten Jain
- Biomolecular Engineering Department, UC Santa Cruz, CA 95064, USA
| | - Gianmaria Frigè
- Department of Experimental Oncology, IEO European Institute of Oncology IRCCS, 20139 Milano, Italy
| | - Hugh Olsen
- Biomolecular Engineering Department, UC Santa Cruz, CA 95064, USA
| | | | | | | | | | | | - Nicola Clementi
- Laboratory of Microbiology and Virology, Vita-Salute San Raffaele University; via Olgettina 58, 20132 Milan, Italy
- Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Scientific Institute; via Olgettina 60, 20132 Milan, Italy
| | - Massimo Clementi
- Laboratory of Microbiology and Virology, Vita-Salute San Raffaele University; via Olgettina 58, 20132 Milan, Italy
- Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Scientific Institute; via Olgettina 60, 20132 Milan, Italy
| | - Nicasio Mancini
- Laboratory of Microbiology and Virology, Vita-Salute San Raffaele University; via Olgettina 58, 20132 Milan, Italy
- Laboratory of Medical Microbiology and Virology, IRCCS San Raffaele Scientific Institute; via Olgettina 60, 20132 Milan, Italy
| | - Ewan Birney
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, UK
| | - Mark Akeson
- Biomolecular Engineering Department, UC Santa Cruz, CA 95064, USA
| | - Francesco Nicassio
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, 20139 Milano, Italy
| | - David A Matthews
- School of Cellular and Molecular Medicine, Faculty of Life Sciences, University Walk, University of Bristol, Bristol BS8 1TD, UK
| | - Tommaso Leonardi
- Center for Genomic Science of IIT@SEMM, Fondazione Istituto Italiano di Tecnologia, 20139 Milano, Italy
| |
Collapse
|
6
|
Canales NA, Pérez-Escobar OA, Powell RF, Töpel M, Kidner C, Nesbitt M, Maldonado C, Barnes CJ, Rønsted N, Przelomska NAS, Leitch IJ, Antonelli A. A highly contiguous, scaffold-level nuclear genome assembly for the fever tree ( Cinchona pubescens Vahl) as a novel resource for Rubiaceae research. GigaByte 2022; 2022:gigabyte71. [PMID: 36950143 PMCID: PMC10027117 DOI: 10.46471/gigabyte.71] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Accepted: 09/29/2022] [Indexed: 11/12/2022] Open
Abstract
The Andean fever tree (Cinchona L.; Rubiaceae) is a source of bioactive quinine alkaloids used to treat malaria. C. pubescens Vahl is a valuable cash crop within its native range in northwestern South America, however, genomic resources are lacking. Here we provide the first highly contiguous and annotated nuclear and plastid genome assemblies using Oxford Nanopore PromethION-derived long-read and Illumina short-read data. Our nuclear genome assembly comprises 603 scaffolds with a total length of 904 Mbp (∼82% of the full genome based on a genome size of 1.1 Gbp/1C). Using a combination of de novo and reference-based transcriptome assemblies we annotated 72,305 coding sequences comprising 83% of the BUSCO gene set and 4.6% fragmented sequences. Using additional plastid and nuclear datasets we place C. pubescens in the Gentianales order. This first genomic resource for C. pubescens opens new research avenues, including the analysis of alkaloid biosynthesis in the fever tree.
Collapse
Affiliation(s)
- Nataly Allasi Canales
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
- Royal Botanic Gardens, Kew, Richmond, UK
| | - Oscar A. Pérez-Escobar
- Royal Botanic Gardens, Kew, Richmond, UK
- Gothenburg Global Biodiversity Centre, University of Gothenburg, Gothenburg, Sweden
- Corresponding authors. E-mail: ;
| | | | - Mats Töpel
- University of Gothenburg, Department of Marine Sciences, Gothenburg, Sweden
| | | | | | - Carla Maldonado
- Herbario Nacional de Bolivia, Instituto de Ecología, Universidad Mayor de San Andrés, La Paz, Bolivia
| | | | - Nina Rønsted
- Natural History Museum of Denmark, University of Copenhagen, Copenhagen, Denmark
- National Tropical Botanical Garden, Kalaheo, Hawaii, USA
| | | | - Ilia J. Leitch
- Royal Botanic Gardens, Kew, Richmond, UK
- Corresponding authors. E-mail: ;
| | - Alexandre Antonelli
- Royal Botanic Gardens, Kew, Richmond, UK
- Gothenburg Global Biodiversity Centre, University of Gothenburg, Gothenburg, Sweden
- Department of Plant Sciences, University of Oxford, Oxford, UK
| |
Collapse
|
7
|
Abstract
We describe a method for direct tRNA sequencing using the Oxford Nanopore MinION. The principal technical advance is custom adapters that facilitate end-to-end sequencing of individual transfer RNA (tRNA) molecules at subnanometer precision. A second advance is a nanopore sequencing pipeline optimized for tRNA. We tested this method using purified E. coli tRNAfMet, tRNALys, and tRNAPhe samples. 76-92% of individual aligned tRNA sequence reads were full length. As a proof of concept, we showed that nanopore sequencing detected all 43 expected isoacceptors in total E. coli MRE600 tRNA as well as isodecoders that further define that tRNA population. Alignment-based comparisons between the three purified tRNAs and their synthetic controls revealed systematic nucleotide miscalls that were diagnostic of known modifications. Systematic miscalls were also observed proximal to known modifications in total E. coli tRNA alignments, including a highly conserved pseudouridine in the T loop. This work highlights the potential of nanopore direct tRNA sequencing as well as improvements needed to implement tRNA sequencing for human healthcare applications.
Collapse
|