1
|
Kumari P, Kaur M, Dindhoria K, Ashford B, Amarasinghe SL, Thind AS. Advances in long-read single-cell transcriptomics. Hum Genet 2024:10.1007/s00439-024-02678-x. [PMID: 38787419 DOI: 10.1007/s00439-024-02678-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2023] [Accepted: 05/07/2024] [Indexed: 05/25/2024]
Abstract
Long-read single-cell transcriptomics (scRNA-Seq) is revolutionizing the way we profile heterogeneity in disease. Traditional short-read scRNA-Seq methods are limited in their ability to provide complete transcript coverage, resolve isoforms, and identify novel transcripts. The scRNA-Seq protocols developed for long-read sequencing platforms overcome these limitations by enabling the characterization of full-length transcripts. Long-read scRNA-Seq techniques initially suffered from comparatively poor accuracy compared to short read scRNA-Seq. However, with improvements in accuracy, accessibility, and cost efficiency, long-reads are gaining popularity in the field of scRNA-Seq. This review details the advances in long-read scRNA-Seq, with an emphasis on library preparation protocols and downstream bioinformatics analysis tools.
Collapse
Affiliation(s)
- Pallawi Kumari
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Manmeet Kaur
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Kiran Dindhoria
- Institute of Microbial Technology, Council of Scientific and Industrial Research, Chandigarh, India
| | - Bruce Ashford
- Illawarra Shoalhaven Local Health District (ISLHD), NSW Health, Wollongong, NSW, Australia
| | - Shanika L Amarasinghe
- Monash Biomedical Discovery Institute, Monash University, Clayton, VIC, 3800, Australia
- Walter and Eliza Hall Institute of Medical Research, 1G, Royal Parade, Parkville, VIC, 3025, Australia
| | - Amarinder Singh Thind
- Illawarra Shoalhaven Local Health District (ISLHD), NSW Health, Wollongong, NSW, Australia.
- The School of Chemistry and Molecular Bioscience (SCMB), University of Wollongong, Loftus St, Wollongong, NSW, 2500, Australia.
| |
Collapse
|
2
|
Fares J, Wan Y, Mair R, Price SJ. Molecular diversity in isocitrate dehydrogenase-wild-type glioblastoma. Brain Commun 2024; 6:fcae108. [PMID: 38646145 PMCID: PMC11032202 DOI: 10.1093/braincomms/fcae108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/15/2023] [Revised: 01/15/2024] [Accepted: 03/26/2024] [Indexed: 04/23/2024] Open
Abstract
In the dynamic landscape of glioblastoma, the 2021 World Health Organization Classification of Central Nervous System tumours endeavoured to establish biological homogeneity, yet isocitrate dehydrogenase-wild-type (IDH-wt) glioblastoma persists as a tapestry of clinical and molecular diversity. Intertumoural heterogeneity in IDH-wt glioblastoma presents a formidable challenge in treatment strategies. Recent strides in genetics and molecular biology have enhanced diagnostic precision, revealing distinct subtypes and invasive patterns that influence survival in patients with IDH-wt glioblastoma. Genetic and molecular biomarkers, such as the overexpression of neurofibromin 1, phosphatase and tensin homolog and/or cyclin-dependent kinase inhibitor 2A, along with specific immune cell abundance and neurotransmitters, correlate with favourable outcomes. Conversely, increased expression of epidermal growth factor receptor tyrosine kinase, platelet-derived growth factor receptor alpha and/or vascular endothelial growth factor receptor, coupled with the prevalence of glioma stem cells, tumour-associated myeloid cells, regulatory T cells and exhausted effector cells, signifies an unfavourable prognosis. The methylation status of O6-methylguanine-DNA methyltransferase and the influence of microenvironmental factors and neurotransmitters further shape treatment responses. Understanding intertumoural heterogeneity is complemented by insights into intratumoural dynamics and cellular interactions within the tumour microenvironment. Glioma stem cells and immune cell composition significantly impact progression and outcomes, emphasizing the need for personalized therapies targeting pro-tumoural signalling pathways and resistance mechanisms. A successful glioblastoma management demands biomarker identification, combination therapies and a nuanced approach considering intratumoural variability. These advancements herald a transformative era in glioblastoma comprehension and treatment.
Collapse
Affiliation(s)
- Jawad Fares
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
- Cambridge Brain Tumour Imaging Laboratory, Division of Neurosurgery, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
- Department of Neurological Surgery, Feinberg School of Medicine, Northwestern University, Chicago, IL 60611, USA
| | - Yizhou Wan
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
- Cambridge Brain Tumour Imaging Laboratory, Division of Neurosurgery, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Richard Mair
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| | - Stephen J Price
- Academic Neurosurgery Division, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
- Cambridge Brain Tumour Imaging Laboratory, Division of Neurosurgery, Department of Clinical Neurosciences, University of Cambridge, Cambridge, CB2 0QQ, UK
| |
Collapse
|
3
|
Gunter HM, Youlten SE, Reis ALM, McCubbin T, Madala BS, Wong T, Stevanovski I, Cipponi A, Deveson IW, Santini NS, Kummerfeld S, Croucher PI, Marcellin E, Mercer TR. A universal molecular control for DNA, mRNA and protein expression. Nat Commun 2024; 15:2480. [PMID: 38509097 PMCID: PMC10954659 DOI: 10.1038/s41467-024-46456-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2022] [Accepted: 02/28/2024] [Indexed: 03/22/2024] Open
Abstract
The expression of genes encompasses their transcription into mRNA followed by translation into protein. In recent years, next-generation sequencing and mass spectrometry methods have profiled DNA, RNA and protein abundance in cells. However, there are currently no reference standards that are compatible across these genomic, transcriptomic and proteomic methods, and provide an integrated measure of gene expression. Here, we use synthetic biology principles to engineer a multi-omics control, termed pREF, that can act as a universal molecular standard for next-generation sequencing and mass spectrometry methods. The pREF sequence encodes 21 synthetic genes that can be in vitro transcribed into spike-in mRNA controls, and in vitro translated to generate matched protein controls. The synthetic genes provide qualitative controls that can measure sensitivity and quantitative accuracy of DNA, RNA and peptide detection. We demonstrate the use of pREF in metagenome DNA sequencing and RNA sequencing experiments and evaluate the quantification of proteins using mass spectrometry. Unlike previous spike-in controls, pREF can be independently propagated and the synthetic mRNA and protein controls can be sustainably prepared by recipient laboratories using common molecular biology techniques. Together, this provides a universal synthetic standard able to integrate genomic, transcriptomic and proteomic methods.
Collapse
Affiliation(s)
- Helen M Gunter
- Australian Institute of Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Queensland, Australia
- BASE mRNA Facility, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence in Synthetic Biology, The University of Queensland, Brisbane, Queensland, Australia
| | - Scott E Youlten
- Department of Genetics, Yale University School of Medicine, New Haven, CT, 06510, USA
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Andre L M Reis
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Sydney, New South Wales, Australia
- School of Electrical and Information Engineering, University of Sydney, Sydney, New South Wales, Australia
| | - Tim McCubbin
- Australian Institute of Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence in Synthetic Biology, The University of Queensland, Brisbane, Queensland, Australia
| | - Bindu Swapna Madala
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Sydney, New South Wales, Australia
| | - Ted Wong
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
| | - Igor Stevanovski
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Sydney, New South Wales, Australia
| | - Arcadi Cipponi
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Ira W Deveson
- Genomics and Inherited Disease Program, Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- Centre for Population Genomics, Garvan Institute of Medical Research and Murdoch Children's Research Institute, Sydney, New South Wales, Australia
- School of Electrical and Information Engineering, University of Sydney, Sydney, New South Wales, Australia
| | - Nadia S Santini
- Centro Nacional de Investigación Disciplinaria en Conservación y Mejoramiento de Ecosistemas Forestales, INIFAP, Ciudad de México, 04010, Mexico
| | - Sarah Kummerfeld
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Peter I Croucher
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, New South Wales, Australia
| | - Esteban Marcellin
- Australian Institute of Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Queensland, Australia
- ARC Centre of Excellence in Synthetic Biology, The University of Queensland, Brisbane, Queensland, Australia
| | - Tim R Mercer
- Australian Institute of Bioengineering and Nanotechnology, The University of Queensland, Brisbane, Queensland, Australia.
- BASE mRNA Facility, The University of Queensland, Brisbane, Queensland, Australia.
- ARC Centre of Excellence in Synthetic Biology, The University of Queensland, Brisbane, Queensland, Australia.
- Garvan Institute of Medical Research, Sydney, New South Wales, Australia.
| |
Collapse
|
4
|
Holmes MJ, Mahjour B, Castro CP, Farnum GA, Diehl AG, Boyle AP. HaplotagLR: An efficient and configurable utility for haplotagging long reads. PLoS One 2024; 19:e0298688. [PMID: 38478504 PMCID: PMC10936807 DOI: 10.1371/journal.pone.0298688] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 01/30/2024] [Indexed: 03/17/2024] Open
Abstract
Understanding the functional effects of sequence variation is crucial in genomics. Individual human genomes contain millions of variants that contribute to phenotypic variability and disease risks at the population level. Because variants rarely act in isolation, we must consider potential interactions of neighboring variants to accurately predict functional effects. We can accomplish this using haplotagging, which matches sequencing reads to their parental haplotypes using alleles observed at known heterozygous variants. However, few published tools for haplotagging exist and these share several technical and usability-related shortcomings that limit applicability, in particular a lack of insight or control over error rates, and lack of key metrics on the underlying sources of haplotagging error. Here we present HaplotagLR: a user-friendly tool that haplotags long sequencing reads based on a multinomial model and existing phased variant lists. HaplotagLR is user-configurable and includes a basic error model to control the empirical FDR in its output. We show that HaplotagLR outperforms the leading haplotagging method in simulated datasets, especially at high levels of specificity, and displays 7% greater sensitivity in haplotagging real data. HaplotagLR advances both the immediate utility of haplotagging and paves the way for further improvements to this important method.
Collapse
Affiliation(s)
- Monica J. Holmes
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Babak Mahjour
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Christopher P. Castro
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Gregory A. Farnum
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Adam G. Diehl
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
| | - Alan P. Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, Michigan, United States of America
- Department of Human Genetics, University of Michigan, Ann Arbor, Michigan, United States of America
| |
Collapse
|
5
|
Groot Koerkamp R, Ivanov P. Exact global alignment using A* with chaining seed heuristic and match pruning. Bioinformatics 2024; 40:btae032. [PMID: 38265119 PMCID: PMC10932610 DOI: 10.1093/bioinformatics/btae032] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2023] [Revised: 11/14/2023] [Accepted: 01/20/2024] [Indexed: 01/25/2024] Open
Abstract
MOTIVATION Sequence alignment has been at the core of computational biology for half a century. Still, it is an open problem to design a practical algorithm for exact alignment of a pair of related sequences in linear-like time. RESULTS We solve exact global pairwise alignment with respect to edit distance by using the A* shortest path algorithm. In order to efficiently align long sequences with high divergence, we extend the recently proposed seed heuristic with match chaining, gap costs, and inexact matches. We additionally integrate the novel match pruning technique and diagonal transition to improve the A* search. We prove the correctness of our algorithm, implement it in the A*PA aligner, and justify our extensions intuitively and empirically. On random sequences of divergence d=4% and length n, the empirical runtime of A*PA scales near-linearly with length (best fit n1.06, n≤107 bp). A similar scaling remains up to d=12% (best fit n1.24, n≤107 bp). For n=107 bp and d=4%, A*PA reaches >500× speedup compared to the leading exact aligners Edlib and BiWFA. The performance of A*PA is highly influenced by long gaps. On long (n>500kb) ONT reads of a human sample it efficiently aligns sequences with d<10%, leading to 3× median speedup compared to Edlib and BiWFA. When the sequences come from different human samples, A*PA performs 1.7× faster than Edlib and BiWFA. AVAILABILITY AND IMPLEMENTATION github.com/RagnarGrootKoerkamp/astar-pairwise-aligner.
Collapse
Affiliation(s)
- Ragnar Groot Koerkamp
- Department of Computer Science, ETH Zurich, Rämistrasse 101, Zurich 8092, Switzerland
| | - Pesho Ivanov
- Department of Computer Science, ETH Zurich, Rämistrasse 101, Zurich 8092, Switzerland
| |
Collapse
|
6
|
Jiménez P, Muñoz M, Cruz-Saavedra L, Camargo A, Ramírez JD. Blastocystis genetic diversity in animal and human samples from different departments of Colombia using complete sequencing of the 18S rRNA gene (SSU rRNA) by Oxford Nanopore Technologies (ONT). Acta Trop 2024; 249:107090. [PMID: 38048971 DOI: 10.1016/j.actatropica.2023.107090] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 11/27/2023] [Accepted: 12/01/2023] [Indexed: 12/06/2023]
Abstract
Blastocystis is an intestinal microeukaryote that has raised attention due to its wide distribution in animals and humans. The risk of zoonotic circulation primarily arises from close contact with infected animals. Therefore, the following study aimed to evaluate the diversity and frequency of Blastocystis subtypes in Colombian human and animal samples using complete sequencing of the 18S rRNA gene. For this purpose, 341 human stool samples and 277 animal fecal samples (from cattle, sheep, goat, pigs, cats, and dogs), were collected from different Colombian regions and analyzed using PCR-based detection and full-length 18S SSU rRNA gene Next-Generation Sequencing (NGS). Among the 618 samples from both hosts, humans and animals, the results revealed widespread Blastocystis frequency, with 48.09% (n = 164) in humans and 31.4% (n = 87) detection in animals. Dogs, cats, sheep, pigs, and wild animals tested positive, aligning with global prevalence patterns. Also, 29 human samples and 23 animal samples were sequenced using ONT technology from which 11 long-read unique sequences were generated and cluster with their compared reference sequences. The subtype distribution varied within hosts, detecting ST1 and ST3 in both human and animal samples. Subtypes ST5, ST10, ST14, ST15, ST21, ST24, ST25 and ST26 were limited to animals hosts, some of which are considered to have zoonotic potential. On the other hand, ST2 was found exclusively in human samples from Bolivar region. Mixed infections occurred in both animal and humans, 60.86% and 27.58% respectively. Moreover, to our knowledge, this is the first study in Colombia identifying ST15 in pigs and ST25 in sheep. The subtypes (STs) identified in this study indicate that certain animals may serve as reservoirs with the potential for zoonotic transmission. The identification of zoonotic subtypes highlights the use of Next Generation Sequencing as the depth and resolution of the sequences increases providing insights into STs of medical and veterinarian significance. It also reveals the coexistence of diverse subtypes among hosts. Further research is essential for understanding transmission dynamics, health implications, and detection strategies for Blastocystis occurrence in animals and humans, mainly associated to the role of animals as reservoirs and their close interaction with humans.
Collapse
Affiliation(s)
- Paula Jiménez
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Marina Muñoz
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Lissa Cruz-Saavedra
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia
| | - Anny Camargo
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia; Health Sciences Faculty, Universidad de Boyacá, Tunja, Colombia
| | - Juan David Ramírez
- Centro de Investigaciones en Microbiología y Biotecnología-UR (CIMBIUR), Facultad de Ciencias Naturales, Universidad del Rosario, Bogotá, Colombia; Molecular Microbiology Laboratory, Department of Pathology, Molecular and Cell-based Medicine, Icahn School of Medicine at Mount Sinai, New York, NY, United States.
| |
Collapse
|
7
|
Chamchoy K, Sudsumrit S, Wongwigkan J, Petmitr S, Songdej D, Adams ER, Edwards T, Leartsakulpanich U, Boonyuen U. Molecular characterization of G6PD mutations identifies new mutations and a high frequency of intronic variants in Thai females. PLoS One 2023; 18:e0294200. [PMID: 37967096 PMCID: PMC10651042 DOI: 10.1371/journal.pone.0294200] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2023] [Accepted: 10/26/2023] [Indexed: 11/17/2023] Open
Abstract
Glucose-6-phosphate dehydrogenase (G6PD) deficiency is an X-linked enzymopathy caused by mutations in the G6PD gene. A medical concern associated with G6PD deficiency is acute hemolytic anemia induced by certain foods, drugs, and infections. Although phenotypic tests can correctly identify hemizygous males, as well as homozygous and compound heterozygous females, heterozygous females with a wide range of G6PD activity may be misclassified as normal. This study aimed to develop multiplex high-resolution melting (HRM) analyses to enable the accurate detection of G6PD mutations, especially among females with heterozygous deficiency. Multiplex HRM assays were developed to detect six G6PD variants, i.e., G6PD Gaohe (c.95A>G), G6PD Chinese-4 (c.392G>T), G6PD Mahidol (c.487G>A), G6PD Viangchan (c.871G>A), G6PD Chinese-5 (c.1024C>T), and G6PD Union (c.1360C>T) in two reactions. The assays were validated and then applied to genotype G6PD mutations in 248 Thai females. The sensitivity of the HRM assays developed was 100% [95% confidence interval (CI): 94.40%-100%] with a specificity of 100% (95% CI: 88.78%-100%) for detecting these six mutations. The prevalence of G6PD deficiency was estimated as 3.63% (9/248) for G6PD deficiency and 31.05% (77/248) for intermediate deficiency by phenotypic assay. The developed HRM assays identified three participants with normal enzyme activity as heterozygous for G6PD Viangchan. Interestingly, a deletion in intron 5 nucleotide position 637/638 (c.486-34delT) was also detected by the developed HRM assays. G6PD genotyping revealed a total of 12 G6PD genotypes, with a high prevalence of intronic variants. Our results suggested that HRM analysis-based genotyping is a simple and reliable approach for detecting G6PD mutations, and could be used to prevent the misdiagnosis of heterozygous females by phenotypic assay. This study also sheds light on the possibility of overlooking intronic variants, which could affect G6PD expression and contribute to enzyme deficiency.
Collapse
Affiliation(s)
- Kamonwan Chamchoy
- Princess Srisavangavadhana College of Medicine, Chulabhorn Royal Academy, Bangkok, Thailand
| | - Sirapapha Sudsumrit
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Jutamas Wongwigkan
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Songsak Petmitr
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| | - Duantida Songdej
- Department of Pediatrics, Faculty of Medicine Ramathibodi Hospital, Mahidol University, Bangkok, Thailand
| | - Emily R. Adams
- Centre for Drugs and Diagnostics Research, Liverpool School of Tropical Medicine, Liverpool, United Kingdom
| | - Thomas Edwards
- Centre for Drugs and Diagnostics Research, Liverpool School of Tropical Medicine, Liverpool, United Kingdom
| | - Ubolsree Leartsakulpanich
- National Center for Genetic Engineering and Biotechnology, National Science and Technology Development Agency, Pathum Thani, Thailand
| | - Usa Boonyuen
- Department of Molecular Tropical Medicine and Genetics, Faculty of Tropical Medicine, Mahidol University, Bangkok, Thailand
| |
Collapse
|
8
|
Pagnamenta AT, Camps C, Giacopuzzi E, Taylor JM, Hashim M, Calpena E, Kaisaki PJ, Hashimoto A, Yu J, Sanders E, Schwessinger R, Hughes JR, Lunter G, Dreau H, Ferla M, Lange L, Kesim Y, Ragoussis V, Vavoulis DV, Allroggen H, Ansorge O, Babbs C, Banka S, Baños-Piñero B, Beeson D, Ben-Ami T, Bennett DL, Bento C, Blair E, Brasch-Andersen C, Bull KR, Cario H, Cilliers D, Conti V, Davies EG, Dhalla F, Dacal BD, Dong Y, Dunford JE, Guerrini R, Harris AL, Hartley J, Hollander G, Javaid K, Kane M, Kelly D, Kelly D, Knight SJL, Kreins AY, Kvikstad EM, Langman CB, Lester T, Lines KE, Lord SR, Lu X, Mansour S, Manzur A, Maroofian R, Marsden B, Mason J, McGowan SJ, Mei D, Mlcochova H, Murakami Y, Németh AH, Okoli S, Ormondroyd E, Ousager LB, Palace J, Patel SY, Pentony MM, Pugh C, Rad A, Ramesh A, Riva SG, Roberts I, Roy N, Salminen O, Schilling KD, Scott C, Sen A, Smith C, Stevenson M, Thakker RV, Twigg SRF, Uhlig HH, van Wijk R, Vona B, Wall S, Wang J, Watkins H, Zak J, Schuh AH, Kini U, Wilkie AOM, Popitsch N, Taylor JC. Structural and non-coding variants increase the diagnostic yield of clinical whole genome sequencing for rare diseases. Genome Med 2023; 15:94. [PMID: 37946251 PMCID: PMC10636885 DOI: 10.1186/s13073-023-01240-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 09/27/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Whole genome sequencing is increasingly being used for the diagnosis of patients with rare diseases. However, the diagnostic yields of many studies, particularly those conducted in a healthcare setting, are often disappointingly low, at 25-30%. This is in part because although entire genomes are sequenced, analysis is often confined to in silico gene panels or coding regions of the genome. METHODS We undertook WGS on a cohort of 122 unrelated rare disease patients and their relatives (300 genomes) who had been pre-screened by gene panels or arrays. Patients were recruited from a broad spectrum of clinical specialties. We applied a bioinformatics pipeline that would allow comprehensive analysis of all variant types. We combined established bioinformatics tools for phenotypic and genomic analysis with our novel algorithms (SVRare, ALTSPLICE and GREEN-DB) to detect and annotate structural, splice site and non-coding variants. RESULTS Our diagnostic yield was 43/122 cases (35%), although 47/122 cases (39%) were considered solved when considering novel candidate genes with supporting functional data into account. Structural, splice site and deep intronic variants contributed to 20/47 (43%) of our solved cases. Five genes that are novel, or were novel at the time of discovery, were identified, whilst a further three genes are putative novel disease genes with evidence of causality. We identified variants of uncertain significance in a further fourteen candidate genes. The phenotypic spectrum associated with RMND1 was expanded to include polymicrogyria. Two patients with secondary findings in FBN1 and KCNQ1 were confirmed to have previously unidentified Marfan and long QT syndromes, respectively, and were referred for further clinical interventions. Clinical diagnoses were changed in six patients and treatment adjustments made for eight individuals, which for five patients was considered life-saving. CONCLUSIONS Genome sequencing is increasingly being considered as a first-line genetic test in routine clinical settings and can make a substantial contribution to rapidly identifying a causal aetiology for many patients, shortening their diagnostic odyssey. We have demonstrated that structural, splice site and intronic variants make a significant contribution to diagnostic yield and that comprehensive analysis of the entire genome is essential to maximise the value of clinical genome sequencing.
Collapse
Affiliation(s)
- Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Carme Camps
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edoardo Giacopuzzi
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Human Technopole, Viale Rita Levi Montalcini 1, 20157, Milan, Italy
| | - John M Taylor
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mona Hashim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Eduardo Calpena
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Pamela J Kaisaki
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Akiko Hashimoto
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jing Yu
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edward Sanders
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Ron Schwessinger
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jim R Hughes
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- University Medical Center Groningen, Groningen University, PO Box 72, 9700 AB, Groningen, The Netherlands
| | - Helene Dreau
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Matteo Ferla
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Lukas Lange
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Yesim Kesim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Vassilis Ragoussis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Dimitrios V Vavoulis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Holger Allroggen
- Neurosciences Department, UHCW NHS Trust, Clifford Bridge Road, Coventry, CV2 2DX, UK
| | - Olaf Ansorge
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Christian Babbs
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Siddharth Banka
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Oxford Road, Manchester, M13 9WL, UK
| | - Benito Baños-Piñero
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - David Beeson
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Tal Ben-Ami
- Pediatric Hematology-Oncology Unit, Kaplan Medical Center, Rehovot, Israel
| | - David L Bennett
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Celeste Bento
- Hematology Department, Hospitais da Universidade de Coimbra, Coimbra, Portugal
| | - Edward Blair
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Charlotte Brasch-Andersen
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Katherine R Bull
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Holger Cario
- Department of Pediatrics and Adolescent Medicine, University Medical Center, Eythstrasse 24, 89075, Ulm, Germany
| | - Deirdre Cilliers
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Valerio Conti
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - E Graham Davies
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Fatima Dhalla
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Drive, Oxford, OX3 7TY, UK
| | - Beatriz Diez Dacal
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Yin Dong
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - James E Dunford
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Renzo Guerrini
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Adrian L Harris
- Department of Oncology, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Jane Hartley
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Georg Hollander
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Kassim Javaid
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Maureen Kane
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Pharmacy Hall North, Room 731, 20 N. Pine Street, Baltimore, MD, 21201, USA
| | - Deirdre Kelly
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Dominic Kelly
- Children's Hospital, OUH NHS Foundation Trust, NIHR Oxford BRC, Headley Way, Oxford, OX3 9DU, UK
| | - Samantha J L Knight
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Alexandra Y Kreins
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Erika M Kvikstad
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Craig B Langman
- Feinberg School of Medicine, Northwestern University, 211 E Chicago Avenue, Chicago, IL, MS37, USA
| | - Tracy Lester
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Kate E Lines
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Simon R Lord
- Early Phase Clinical Trials Unit, Department of Oncology, University of Oxford, Cancer and Haematology Centre, Level 2 Administration Area, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Xin Lu
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Sahar Mansour
- St George's University Hospitals NHS Foundation Trust, Blackshore Road, Tooting, London, SW17 0QT, UK
| | - Adnan Manzur
- MRC Centre for Neuromuscular Diseases, National Hospital for Neurology and Neurosurgery, Queen Square, London, WC1N 3BG, UK
| | - Reza Maroofian
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, WC1N 3BG, UK
| | - Brian Marsden
- Nuffield Department of Medicine, Kennedy Institute, University of Oxford, Oxford, OX3 7BN, UK
| | - Joanne Mason
- Yourgene Health Headquarters, Skelton House, Lloyd Street North, Manchester Science Park, Manchester, M15 6SH, UK
| | - Simon J McGowan
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Davide Mei
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Hana Mlcochova
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Yoshiko Murakami
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Andrea H Németh
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Steven Okoli
- Imperial College NHS Trust, Department of Haematology, Hammersmith Hospital, Du Cane Road, London, W12 0HS, UK
| | - Elizabeth Ormondroyd
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Lilian Bomme Ousager
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Jacqueline Palace
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Smita Y Patel
- Clinical Immunology, John Radcliffe Hospital, Level 4A, Oxford, OX3 9DU, UK
| | - Melissa M Pentony
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Chris Pugh
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Aboulfazl Rad
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
| | - Archana Ramesh
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Simone G Riva
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Irene Roberts
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Noémi Roy
- Department of Haematology, Oxford University Hospitals NHS Foundation Trust, Level 4, Haematology, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Outi Salminen
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Kyleen D Schilling
- Ann & Robert H. Lurie Children's Hospital of Chicago, 225 E Chicago Avenue, Chicago, IL, 60611, USA
| | - Caroline Scott
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Arjune Sen
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Conrad Smith
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mark Stevenson
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Rajesh V Thakker
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Stephen R F Twigg
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Holm H Uhlig
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
- Translational Gastroenterology Unit, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Richard van Wijk
- UMC Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Barbara Vona
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
- Institute of Human Genetics, University Medical Center Göttingen, Heinrich-Düker-Weg 12, 37073, Göttingen, Germany
- Institute for Auditory Neuroscience and InnerEarLab, University Medical Center Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| | - Steven Wall
- Oxford Craniofacial Unit, John Radcliffe Hospital, Level LG1, West Wing, Oxford, OX3 9DU, UK
| | - Jing Wang
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Hugh Watkins
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Jaroslav Zak
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
- Department of Immunology and Microbiology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA, 92037, USA
| | - Anna H Schuh
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Usha Kini
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Andrew O M Wilkie
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Niko Popitsch
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Biochemistry and Cell Biology, Max Perutz Labs, University of Vienna, Vienna BioCenter(VBC), Dr.-Bohr-Gasse 9, 1030, Vienna, Austria
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK.
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK.
| |
Collapse
|
9
|
Hansen MH, Cédile O, Kjeldsen MLG, Thomassen M, Preiss B, von Neuhoff N, Abildgaard N, Nyvold CG. Toward Cytogenomics: Technical Assessment of Long-Read Nanopore Whole-Genome Sequencing for Detecting Large Chromosomal Alterations in Mantle Cell Lymphoma. J Mol Diagn 2023; 25:796-805. [PMID: 37683892 DOI: 10.1016/j.jmoldx.2023.08.004] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2022] [Revised: 06/20/2023] [Accepted: 08/14/2023] [Indexed: 09/10/2023] Open
Abstract
The current advances and success of next-generation sequencing hold the potential for the transition of cancer cytogenetics toward comprehensive cytogenomics. However, the conventional use of short reads impedes the resolution of chromosomal aberrations. Thus, this study evaluated the detection and reproducibility of extensive copy number alterations and chromosomal translocations using long-read Oxford Nanopore Technologies whole-genome sequencing compared with short-read Illumina sequencing. Using the mantle cell lymphoma cell line Granta-519, almost 99% copy-number reproducibility at the 100-kilobase resolution between replicates was demonstrated, with 98% concordance to Illumina. Collectively, the performance of copy number calling from 1.5 million to 7.5 million long reads was comparable to 1 billion Illumina-based reads (50× coverage). Expectedly, the long-read resolution of canonical translocation t(11;14)(q13;q32) was superior, with a sequence similarity of 89% to the already published CCND1/IGH junction (9× coverage), spanning up to 69 kilobases. The cytogenetic profile of Granta-519 was in general agreement with the literature and karyotype, although several differences remained unresolved. In conclusion, contemporary long-read sequencing is primed for future cytogenomics or sequencing-guided cytogenetics. The combined strength of long- and short-read sequencing is apparent, where the high-precision junctional mapping complements and splits paired-end reads. The potential is emphasized by the flexible single-sample genomic data acquisition of Oxford Nanopore Technologies with the high resolution of allelic imbalances using Illumina short-read sequencing.
Collapse
Affiliation(s)
- Marcus H Hansen
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark.
| | - Oriane Cédile
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark; OPEN, Odense Patient Data Explorative Network, Odense University Hospital, Odense, Denmark
| | - Marie L G Kjeldsen
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark
| | - Mads Thomassen
- Department of Clinical Genetics, Odense University Hospital, Odense, Denmark
| | - Birgitte Preiss
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Pathology, Odense University Hospital, Odense, Denmark
| | - Nils von Neuhoff
- Department of Pediatric Hematology and Oncology, Essen University Hospital and University of Duisburg-Essen, Essen, Germany
| | - Niels Abildgaard
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark
| | - Charlotte G Nyvold
- Hematology-Pathology Research Laboratory, Research Unit of Hematology and Research Unit of Pathology, University of Southern Denmark and Odense University Hospital, Odense, Denmark; Department of Hematology, Odense University Hospital, Odense, Denmark; OPEN, Odense Patient Data Explorative Network, Odense University Hospital, Odense, Denmark
| |
Collapse
|
10
|
Liu D, Steinegger M. Block Aligner: an adaptive SIMD-accelerated aligner for sequences and position-specific scoring matrices. Bioinformatics 2023; 39:btad487. [PMID: 37535681 PMCID: PMC10457662 DOI: 10.1093/bioinformatics/btad487] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2022] [Revised: 06/10/2023] [Accepted: 08/01/2023] [Indexed: 08/05/2023] Open
Abstract
MOTIVATION Efficiently aligning sequences is a fundamental problem in bioinformatics. Many recent algorithms for computing alignments through Smith-Waterman-Gotoh dynamic programming (DP) exploit Single Instruction Multiple Data (SIMD) operations on modern CPUs for speed. However, these advances have largely ignored difficulties associated with efficiently handling complex scoring matrices or large gaps (insertions or deletions). RESULTS We propose a new SIMD-accelerated algorithm called Block Aligner for aligning nucleotide and protein sequences against other sequences or position-specific scoring matrices. We introduce a new paradigm that uses blocks in the DP matrix that greedily shift, grow, and shrink. This approach allows regions of the DP matrix to be adaptively computed. Our algorithm reaches over 5-10 times faster than some previous methods while incurring an error rate of less than 3% on protein and long read datasets, despite large gaps and low sequence identities. AVAILABILITY AND IMPLEMENTATION Our algorithm is implemented for global, local, and X-drop alignments. It is available as a Rust library (with C bindings) at https://github.com/Daniel-Liu-c0deb0t/block-aligner.
Collapse
Affiliation(s)
- Daniel Liu
- University of California Los Angeles, Los Angeles, CA, United States
| | - Martin Steinegger
- School of Biological Sciences, Artificial Intelligence Institute, Institute of Molecular Biology and Genetics, Seoul National University, Seoul, South Korea
| |
Collapse
|
11
|
de la Morena-Barrio B, Palomo Á, Padilla J, Martín-Fernández L, Rojo-Carrillo JJ, Cifuentes R, Bravo-Pérez C, Garrido-Rodríguez P, Miñano A, Rubio AM, Pagán J, Llamas M, Vicente V, Vidal F, Lozano ML, Corral J, de la Morena-Barrio ME. Impact of genetic structural variants in factor XI deficiency: identification, accurate characterization, and inferred mechanism by long-read sequencing. J Thromb Haemost 2023; 21:1779-1788. [PMID: 36940803 DOI: 10.1016/j.jtha.2023.03.009] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/23/2022] [Revised: 03/07/2023] [Accepted: 03/09/2023] [Indexed: 03/23/2023]
Abstract
BACKGROUND Congenital factor XI (FXI) deficiency is a probably underestimated coagulopathy that confers antithrombotic protection. Characterization of genetic defects in F11 is mainly focused on the identification of single-nucleotide variants and small insertion/deletions because they represent up to 99% of the alterations accounting for factor deficiency, with only 3 gross gene defects of structural variants (SVs) having been described. OBJECTIVES To identify and characterize the SVs affecting F11. METHODS The study was performed in 93 unrelated subjects with FXI deficiency recruited in Spanish hospitals over a period of 25 years (1997-2022). F11 was analyzed by next-generation sequencing, multiplex ligand probe amplification, and long-read sequencing. RESULTS Our study identified 30 different genetic variants. Interestingly, we found 3 SVs, all heterozygous: a complex duplication affecting exons 8 and 9, a tandem duplication of exon 14, and a large deletion affecting the whole gene. Nucleotide resolution obtained by long-read sequencing revealed Alu repetitive elements involved in all breakpoints. The large deletion was probably generated de novo in the paternal allele during gametogenesis, and despite affecting 30 additional genes, no syndromic features were described. CONCLUSION SVs may account for a high proportion of F11 genetic defects implicated in the molecular pathology of congenital FXI deficiency. These SVs, likely caused by a nonallelic homologous recombination involving repetitive elements, are heterogeneous in both type and length and may be de novo. These data support the inclusion of methods to detect SVs in this disorder, with long-read-based methods being the most appropriate because they detect all SVs and achieve adequate nucleotide resolution.
Collapse
Affiliation(s)
- Belén de la Morena-Barrio
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Ángeles Palomo
- Servicio de Hematología y Hemoterapia del centro Materno-Infantil del Hospital Regional Universitario Carlos de Haya, Málaga, Spain
| | - José Padilla
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Laura Martín-Fernández
- Laboratori de Coagulopaties Congènites, Banc de Sang i Teixits, Barcelona, Spain; Medicina Transfusional. Vall d'Hebron Institut de Recerca, Universitat Autònoma de Barcelona, Barcelona, Spain
| | - Juan José Rojo-Carrillo
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Rosa Cifuentes
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Carlos Bravo-Pérez
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Pedro Garrido-Rodríguez
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Antonia Miñano
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Ana María Rubio
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Javier Pagán
- Servicio de Medicina Interna, Hospital Universitario Morales Meseguer, Murcia, Spain
| | - María Llamas
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Vicente Vicente
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Francisco Vidal
- Laboratori de Coagulopaties Congènites, Banc de Sang i Teixits, Barcelona, Spain; Medicina Transfusional. Vall d'Hebron Institut de Recerca, Universitat Autònoma de Barcelona, Barcelona, Spain; Centro de Investigación Biomédica en Red de Enfermedades Cardiovasculares, Instituto de Salud Carlos III, Madrid, Spain
| | - María Luisa Lozano
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain
| | - Javier Corral
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain.
| | - María Eugenia de la Morena-Barrio
- Servicio de Hematología, Hospital Universitario Morales Meseguer, Centro Regional de Hemodonación, Universidad de Murcia, Instituto Murciano de Investigación Biosanitaria-Pascual Parrilla, Centro de Investigación Biomédica en Red de Enfermedades Raras-Instituto de Salud Carlos III, Murcia, Spain.
| |
Collapse
|
12
|
Affiliation(s)
- Bruna Gomes
- From the Departments of Medicine, Genetics, and Biomedical Data Science, Stanford University, Stanford, CA (B.G., E.A.A.); and the Department of Cardiology, Pneumology, and Angiology, Heidelberg University Hospital, Heidelberg, Germany (B.G.)
| | - Euan A Ashley
- From the Departments of Medicine, Genetics, and Biomedical Data Science, Stanford University, Stanford, CA (B.G., E.A.A.); and the Department of Cardiology, Pneumology, and Angiology, Heidelberg University Hospital, Heidelberg, Germany (B.G.)
| |
Collapse
|
13
|
Dong T, Wang M, Liu J, Ma P, Pang S, Liu W, Liu A. Diagnostics and analysis of SARS-CoV-2: current status, recent advances, challenges and perspectives. Chem Sci 2023; 14:6149-6206. [PMID: 37325147 PMCID: PMC10266450 DOI: 10.1039/d2sc06665c] [Citation(s) in RCA: 11] [Impact Index Per Article: 11.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Accepted: 05/03/2023] [Indexed: 06/17/2023] Open
Abstract
The disastrous spread of severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has induced severe public healthcare issues and weakened the global economy significantly. Although SARS-CoV-2 infection is not as fatal as the initial outbreak, many infected victims suffer from long COVID. Therefore, rapid and large-scale testing is critical in managing patients and alleviating its transmission. Herein, we review the recent advances in techniques to detect SARS-CoV-2. The sensing principles are detailed together with their application domains and analytical performances. In addition, the advantages and limits of each method are discussed and analyzed. Besides molecular diagnostics and antigen and antibody tests, we also review neutralizing antibodies and emerging SARS-CoV-2 variants. Further, the characteristics of the mutational locations in the different variants with epidemiological features are summarized. Finally, the challenges and possible strategies are prospected to develop new assays to meet different diagnostic needs. Thus, this comprehensive and systematic review of SARS-CoV-2 detection technologies may provide insightful guidance and direction for developing tools for the diagnosis and analysis of SARS-CoV-2 to support public healthcare and effective long-term pandemic management and control.
Collapse
Affiliation(s)
- Tao Dong
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
- School of Pharmacy, Medical College, Qingdao University 308 Ningxia Road Qingdao 266071 China
| | - Mingyang Wang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
| | - Junchong Liu
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
| | - Pengxin Ma
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
| | - Shuang Pang
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
| | - Wanjian Liu
- Qingdao Hightop Biotech Co., Ltd 369 Hedong Road, Hi-tech Industrial Development Zone Qingdao 266112 China
| | - Aihua Liu
- Institute for Chemical Biology & Biosensing, College of Life Sciences, Qingdao University 308 Ningxia Road Qingdao 266071 China
| |
Collapse
|
14
|
Yanagi I, Akahori R, Takeda KI. Dwell Time Prolongation and Identification of Single Nucleotides Passing through a Solid-State Nanopore by Using Ammonium Sulfate Aqueous Solution. ACS OMEGA 2023; 8:21285-21292. [PMID: 37332803 PMCID: PMC10268630 DOI: 10.1021/acsomega.3c02703] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 04/20/2023] [Accepted: 05/22/2023] [Indexed: 06/20/2023]
Abstract
The ionic current blockades when poly(dT)60 or dNTPs passed through SiN nanopores in an aqueous solution containing (NH4)2SO4 were investigated. The dwell time of poly(dT)60 in the nanopores in an aqueous solution containing (NH4)2SO4 was significantly longer compared to that in an aqueous solution that did not contain (NH4)2SO4. This dwell time prolongation effect due to the aqueous solution containing (NH4)2SO4 was also confirmed when dCTP passed through the nanopores. In addition, when the nanopores were fabricated via dielectric breakdown in the aqueous solution containing (NH4)2SO4, the dwell time prolongation effect for dCTP still occurred even after the aqueous solution was displaced with the aqueous solution without (NH4)2SO4. Furthermore, we measured the ionic current blockades when the four types of dNTPs passed through the same nanopore, and the four types of dNTPs could be statistically identified according to their current blockade values.
Collapse
Affiliation(s)
- Itaru Yanagi
- Center
for Exploratory Research, Research & Development Group, Hitachi, Ltd., 1-280, Higashi-koigakubo, Kokubunji, Tokyo 185-8603, Japan
| | - Rena Akahori
- Center
for Technology Innovation - Healthcare, Research & Development
Group, Hitachi, Ltd., 1-280, Higashi-koigakubo, Kokubunji, Tokyo 185-8603, Japan
| | - Ken-ichi Takeda
- Center
for Technology Innovation - Healthcare, Research & Development
Group, Hitachi, Ltd., 1-280, Higashi-koigakubo, Kokubunji, Tokyo 185-8603, Japan
| |
Collapse
|
15
|
Weissensteiner MH, Cremona MA, Guiblet WM, Stoler N, Harris RS, Cechova M, Eckert KA, Chiaromonte F, Huang YF, Makova KD. Accurate sequencing of DNA motifs able to form alternative (non-B) structures. Genome Res 2023; 33:907-922. [PMID: 37433640 PMCID: PMC10519405 DOI: 10.1101/gr.277490.122] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 05/04/2023] [Indexed: 07/13/2023]
Abstract
Approximately 13% of the human genome at certain motifs have the potential to form noncanonical (non-B) DNA structures (e.g., G-quadruplexes, cruciforms, and Z-DNA), which regulate many cellular processes but also affect the activity of polymerases and helicases. Because sequencing technologies use these enzymes, they might possess increased errors at non-B structures. To evaluate this, we analyzed error rates, read depth, and base quality of Illumina, Pacific Biosciences (PacBio) HiFi, and Oxford Nanopore Technologies (ONT) sequencing at non-B motifs. All technologies showed altered sequencing success for most non-B motif types, although this could be owing to several factors, including structure formation, biased GC content, and the presence of homopolymers. Single-nucleotide mismatch errors had low biases in HiFi and ONT for all non-B motif types but were increased for G-quadruplexes and Z-DNA in all three technologies. Deletion errors were increased for all non-B types but Z-DNA in Illumina and HiFi, as well as only for G-quadruplexes in ONT. Insertion errors for non-B motifs were highly, moderately, and slightly elevated in Illumina, HiFi, and ONT, respectively. Additionally, we developed a probabilistic approach to determine the number of false positives at non-B motifs depending on sample size and variant frequency, and applied it to publicly available data sets (1000 Genomes, Simons Genome Diversity Project, and gnomAD). We conclude that elevated sequencing errors at non-B DNA motifs should be considered in low-read-depth studies (single-cell, ancient DNA, and pooled-sample population sequencing) and in scoring rare variants. Combining technologies should maximize sequencing accuracy in future studies of non-B DNA.
Collapse
Affiliation(s)
| | - Marzia A Cremona
- Department of Operations and Decision Systems, Université Laval, Quebec, Quebec G1V0A6, Canada
- Population Health and Optimal Health Practices, CHU de Québec-Université Laval Research Center, Québec, Quebec G1V4G2, Canada
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Wilfried M Guiblet
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Laboratory of Cell Biology, NCI-CCR, National Institutes of Health, Bethesda, Maryland 20892, USA
| | - Nicholas Stoler
- Department of Biochemistry and Molecular Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Robert S Harris
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Monika Cechova
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Faculty of Informatics, Masaryk University, 60200 Brno, Czech Republic
| | - Kristin A Eckert
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Pathology, The Pennsylvania State University, College of Medicine, Hershey, Pennsylvania 17033, USA
| | - Francesca Chiaromonte
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Department of Statistics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Institute of Economics and L'EMbeDS, Sant'Anna School of Advanced Studies, Pisa 56127, Italy
| | - Yi-Fei Huang
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| | - Kateryna D Makova
- Department of Biology, The Pennsylvania State University, University Park, Pennsylvania 16802, USA;
- Center for Medical Genomics, The Pennsylvania State University, University Park, Pennsylvania 16802, USA
| |
Collapse
|
16
|
Zhao N, Yin G, Liu C, Zhang W, Shen Y, Wang D, Lin Z, Yang J, Mao J, Guo R, Zhang Y, Wang F, Liu Z, Lu X, Liu L. Critically short telomeres derepress retrotransposons to promote genome instability in embryonic stem cells. Cell Discov 2023; 9:45. [PMID: 37130870 PMCID: PMC10154409 DOI: 10.1038/s41421-023-00538-y] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/21/2022] [Accepted: 03/08/2023] [Indexed: 05/04/2023] Open
Abstract
Telomeres, at the ends of chromosomes, protect chromosomes from fusion and preserve genomic stability. However, the molecular mechanisms underlying telomere attrition-induced genome instability remain to be understood. We systematically analyzed the expression of retrotransposons and performed genomic sequencing of different cell and tissue types with telomeres of varying lengths due to telomerase deficiency. We found that critically short telomeres altered retrotransposon activity to promote genomic instability in mouse embryonic stem cells, as evidenced by elevated numbers of single nucleotide variants, indels and copy number variations (CNVs). Transpositions of retrotransposons such as LINE1 resulting from the short telomeres can also be found in these genomes with elevated number of mutations and CNVs. Retrotransposon activation is linked to increased chromatin accessibility, and reduced heterochromatin abundance correlates with short telomeres. Re-elongation of telomeres upon recovery of telomerase partly represses retrotransposons and heterochromatin accumulation. Together, our findings suggest a potential mechanism by which telomeres maintain genomic stability by suppressing chromatin accessibility and retrotransposon activity.
Collapse
Affiliation(s)
- Nannan Zhao
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Guoxing Yin
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Chun Liu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Weiyu Zhang
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- College of Pharmacy, Nankai University, Tianjin, China
| | - Yang Shen
- Genome Institute of Singapore, Singapore, Singapore
| | - Dan Wang
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Department of Immunology, Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zhenzhen Lin
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Department of Immunology, Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Jiao Yang
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Jian Mao
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Renpeng Guo
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China
| | - Yongwang Zhang
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China
- College of Pharmacy, Nankai University, Tianjin, China
| | - Feng Wang
- Department of Genetics, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China
| | - Zhe Liu
- 2011 Collaborative Innovation Center of Tianjin for Medical Epigenetics, Tianjin Key Laboratory of Medical Epigenetics, Department of Immunology, Biochemistry and Molecular Biology, School of Basic Medical Sciences, Tianjin Medical University, Tianjin, China.
| | - Xinyi Lu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China.
- College of Pharmacy, Nankai University, Tianjin, China.
| | - Lin Liu
- State Key Laboratory of Medicinal Chemical Biology, Nankai University, Tianjin, China.
- Frontiers Science Center for Cell Responses, College of Life Sciences, Nankai University, Tianjin, China.
- Haihe Laboratory of Cell Ecosystem, Chinese Academy of Medical Sciences & Peking Union Medical College, Tianjin, China.
- Institute of Translational Medicine, Tianjin Union Medical Center, Nankai University, Tianjin, China.
| |
Collapse
|
17
|
Loaiza JR, Bennett KL, Miller MJ, De León LF. Unraveling the genomic complexity of sylvatic mosquitoes in changing Neotropical environments. Curr Opin Biotechnol 2023; 81:102944. [PMID: 37099930 DOI: 10.1016/j.copbio.2023.102944] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/24/2023] [Revised: 03/10/2023] [Accepted: 03/20/2023] [Indexed: 04/28/2023]
Abstract
Sylvatic New World mosquitoes (e.g. Old-growth Forest species) can transmit viruses among non-human primates. This could be a continuous source of viral cycling and spillover events from animals to humans, particularly in changing environments. However, most species of Neotropical sylvatic mosquitoes (genera Aedes, Haemagogus, and Sabethes), which include vector and non-vector species, currently lack genomic resources because there is no reliable and accurate approach for creating de novo reference genomes for these insects. This is a major knowledge gap in the biology of these mosquitoes, restricting our ability to predict and mitigate the emergence and spread of novel arboviruses in Neotropical regions. We discuss recent advances and potential solutions for generating hybrid de novo assemblies from vector and non-vector species using pools of consanguineous offspring. We also discussed research opportunities likely to emerge from these genomic resources.
Collapse
Affiliation(s)
- Jose R Loaiza
- Centro de Biodiversidad y Descubrimiento de Drogas, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Ciudad del Saber, Clayton 0843-01103, Republic of Panama.
| | - Kelly L Bennett
- Wellcome Sanger Institute, Hinxton, Cambridge CB10 1SA, United Kingdom
| | - Matthew J Miller
- RENECO International Wildlife Consultants LLC, Al Reem Island, Abu Dhabi, UAE
| | - Luis F De León
- Centro de Biodiversidad y Descubrimiento de Drogas, Instituto de Investigaciones Científicas y Servicios de Alta Tecnología (INDICASAT AIP), Ciudad del Saber, Clayton 0843-01103, Republic of Panama; Department of Biology, University of Massachusetts Boston, Boston, MA 02125, USA
| |
Collapse
|
18
|
Weitao T, Grandinetti G, Guo P. Revolving ATPase motors as asymmetrical hexamers in translocating lengthy dsDNA via conformational changes and electrostatic interactions in phi29, T7, herpesvirus, mimivirus, E. coli, and Streptomyces. EXPLORATION (BEIJING, CHINA) 2023; 3:20210056. [PMID: 37324034 PMCID: PMC10191066 DOI: 10.1002/exp.20210056] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/17/2021] [Accepted: 04/28/2022] [Indexed: 06/17/2023]
Abstract
Investigations of the parallel architectures of biomotors in both prokaryotic and eukaryotic systems suggest a similar revolving mechanism in the use of ATP to drive translocation of the lengthy double-stranded (ds)DNA genomes. This mechanism is exemplified by the dsDNA packaging motor of bacteriophage phi29 that operates through revolving but not rotating dsDNA to "Push through a one-way valve". This unique and novel revolving mechanism discovered in phi29 DNA packaging motor was recently reported in other systems including the dsDNA packaging motor of herpesvirus, the dsDNA ejecting motor of bacteriophage T7, the plasmid conjugation machine TraB in Streptomyces, the dsDNA translocase FtsK of gram-negative bacteria, and the genome-packaging motor in mimivirus. These motors exhibit an asymmetrical hexameric structure for transporting the genome via an inch-worm sequential action. This review intends to delineate the revolving mechanism from a perspective of conformational changes and electrostatic interactions. In phi29, the positively charged residues Arg-Lys-Arg in the N-terminus of the connector bind the negatively charged interlocking domain of pRNA. ATP binding to an ATPase subunit induces the closed conformation of the ATPase. The ATPase associates with an adjacent subunit to form a dimer facilitated by the positively charged arginine finger. The ATP-binding induces a positive charging on its DNA binding surface via an allostery mechanism and thus the higher affinity for the negatively charged dsDNA. ATP hydrolysis induces an expanded conformation of the ATPase with a lower affinity for dsDNA due to the change of the surface charge, but the (ADP+Pi)-bound subunit in the dimer undergoes a conformational change that repels dsDNA. The positively charged lysine rings of the connector attract dsDNA stepwise and periodically to keep its revolving motion along the channel wall, thus maintaining the one-way translocation of dsDNA without reversal and sliding out. The finding of the presence of the asymmetrical hexameric architectures of many ATPases that use the revolving mechanism may provide insights into the understanding of translocation of the gigantic genomes including chromosomes in complicated systems without coiling and tangling to speed up dsDNA translocation and save energy.
Collapse
Affiliation(s)
- Tao Weitao
- UT Southwestern Medical CenterCenter for the Genetics of Host DefenseDallasTXUSA
- College of Science and MathematicsSouthwest Baptist UniversityBolivarMOUSA
| | - Giovanna Grandinetti
- Center for Electron Microscopy and AnalysisThe Ohio State UniversityColumbusOHUSA
| | - Peixuan Guo
- Center for RNA Nanobiotechnology and NanomedicineDivision of Pharmaceutics and Pharmacology, College of PharmacyDorothy M. Davis Heart and Lung Research Institute, James Comprehensive Cancer Center, College of MedicineThe Ohio State UniversityColumbusOHUSA
| |
Collapse
|
19
|
Siwy ZS, Bruening ML, Howorka S. Nanopores: synergy from DNA sequencing to industrial filtration - small holes with big impact. Chem Soc Rev 2023; 52:1983-1994. [PMID: 36794856 DOI: 10.1039/d2cs00894g] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/17/2023]
Abstract
Nanopores in thin membranes play important roles in science and industry. Single nanopores have provided a step-change in portable DNA sequencing and understanding nanoscale transport while multipore membranes facilitate food processing and purification of water and medicine. Despite the unifying use of nanopores, the fields of single nanopores and multipore membranes differ - to varying degrees - in terms of materials, fabrication, analysis, and applications. Such a partial disconnect hinders scientific progress as important challenges are best resolved together. This Viewpoint suggests how synergistic crosstalk between the two fields can provide considerable mutual benefits in fundamental understanding and the development of advanced membranes. We first describe the main differences including the atomistic definition of single pores compared to the less defined conduits in multipore membranes. We then outline steps to improve communication between the two fields such as harmonizing measurements and modelling of transport and selectivity. The resulting insight is expected to improve the rational design of porous membranes. The Viewpoint concludes with an outlook of other developments that can be best achieved by collaboration across the two fields to advance the understanding of transport in nanopores and create next-generation porous membranes tailored for sensing, filtration, and other applications.
Collapse
Affiliation(s)
- Zuzanna S Siwy
- Department of Physics and Astronomy, University of California, Irvine, USA.
| | - Merlin L Bruening
- Department of Chemical and Biomolecular Engineering, University of Notre Dame, USA.
| | - Stefan Howorka
- Department of Chemistry, Institute of Structural Molecular Biology, University College London, UK.
| |
Collapse
|
20
|
Bracamonte AG. Current Advances in Nanotechnology for the Next Generation of Sequencing (NGS). BIOSENSORS 2023; 13:260. [PMID: 36832027 PMCID: PMC9954403 DOI: 10.3390/bios13020260] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/29/2022] [Revised: 02/03/2023] [Accepted: 02/09/2023] [Indexed: 06/18/2023]
Abstract
This communication aims at discussing strategies based on developments from nanotechnology focused on the next generation of sequencing (NGS). In this regard, it should be noted that even in the advanced current situation of many techniques and methods accompanied with developments of technology, there are still existing challenges and needs focused on real samples and low concentrations of genomic materials. The approaches discussed/described adopt spectroscopical techniques and new optical setups. PCR bases are introduced to understand the role of non-covalent interactions by discussing about Nobel prizes related to genomic material detection. The review also discusses colorimetric methods, polymeric transducers, fluorescence detection methods, enhanced plasmonic techniques such as metal-enhanced fluorescence (MEF), semiconductors, and developments in metamaterials. In addition, nano-optics, challenges linked to signal transductions, and how the limitations reported in each technique could be overcome are considered in real samples. Accordingly, this study shows developments where optical active nanoplatforms generate signal detection and transduction with enhanced performances and, in many cases, enhanced signaling from single double-stranded deoxyribonucleic acid (DNA) interactions. Future perspectives on miniaturized instrumentation, chips, and devices aimed at detecting genomic material are analyzed. However, the main concept in this report derives from gained insights into nanochemistry and nano-optics. Such concepts could be incorporated into other higher-sized substrates and experimental and optical setups.
Collapse
Affiliation(s)
- Angel Guillermo Bracamonte
- Instituto de Investigaciones en Físicoquímica de Córdoba (INFIQC), Departamento de Química Orgánica, Facultad de Ciencias Químicas, Universidad Nacional de Córdoba, Ciudad Universitaria, 5000 Córdoba, Argentina; or
- Departement de Chimie et Centre d’Optique, Photonique et Laser (COPL), Université Laval, Québec, QC G1V 0A6, Canada
| |
Collapse
|
21
|
Marco-Sola S, Eizenga JM, Guarracino A, Paten B, Garrison E, Moreto M. Optimal gap-affine alignment in O(s) space. Bioinformatics 2023; 39:7030690. [PMID: 36749013 PMCID: PMC9940620 DOI: 10.1093/bioinformatics/btad074] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Revised: 01/02/2023] [Indexed: 02/08/2023] Open
Abstract
MOTIVATION Pairwise sequence alignment remains a fundamental problem in computational biology and bioinformatics. Recent advances in genomics and sequencing technologies demand faster and scalable algorithms that can cope with the ever-increasing sequence lengths. Classical pairwise alignment algorithms based on dynamic programming are strongly limited by quadratic requirements in time and memory. The recently proposed wavefront alignment algorithm (WFA) introduced an efficient algorithm to perform exact gap-affine alignment in O(ns) time, where s is the optimal score and n is the sequence length. Notwithstanding these bounds, WFA's O(s2) memory requirements become computationally impractical for genome-scale alignments, leading to a need for further improvement. RESULTS In this article, we present the bidirectional WFA algorithm, the first gap-affine algorithm capable of computing optimal alignments in O(s) memory while retaining WFA's time complexity of O(ns). As a result, this work improves the lowest known memory bound O(n) to compute gap-affine alignments. In practice, our implementation never requires more than a few hundred MBs aligning noisy Oxford Nanopore Technologies reads up to 1 Mbp long while maintaining competitive execution times. AVAILABILITY AND IMPLEMENTATION All code is publicly available at https://github.com/smarco/BiWFA-paper. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Santiago Marco-Sola
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona 08034, Spain.,Departament d'Arquitectura de Computadors i Sistemes Operatius, Universitat Autònoma de Barcelona, Barcelona 08193, Spain
| | - Jordan M Eizenga
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Andrea Guarracino
- Genomics Research Centre, Human Technopole, Milan 20157, Italy.,Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Benedict Paten
- Genomics Institute, University of California Santa Cruz, Santa Cruz, CA 95064, USA
| | - Erik Garrison
- Department of Genetics, Genomics and Informatics, University of Tennessee Health Science Center, Memphis, TN 38163, USA
| | - Miquel Moreto
- Computer Sciences Department, Barcelona Supercomputing Center, Barcelona 08034, Spain.,Departament d'Arquitectura de Computadors, Universitat Politècnica de Catalunya, Barcelona 08034, Spain
| |
Collapse
|
22
|
Holmes MJ, Mahjour B, Castro CP, Farnum GA, Diehl AG, Boyle AP. LRphase: an efficient method for assigning haplotype identity to long reads. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.01.18.524565. [PMID: 36712073 PMCID: PMC9882277 DOI: 10.1101/2023.01.18.524565] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 01/22/2023]
Abstract
Understanding the functional effects of sequence variation is among the primary goals of contemporary genomics. Individual human genomes contain millions of variants which are thought to contribute to phenotypic variability and differential disease risks at the population level. However, because variants rarely act in isolation, we cannot accurately predict functional effects without first considering the potential effects of other interacting variants on the same chromosome. This information can be obtained by phasing the read data from sequencing experiments. However, no standalone tools are available to simply phase reads based on known haplotypes. Here we present LRphase: a user-friendly utility for simple phasing of long sequencing reads.
Collapse
Affiliation(s)
- Monica J. Holmes
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA 48109
| | - Babak Mahjour
- Department of Medicinal Chemistry, University of Michigan, Ann Arbor, MI, USA 48109
| | - Christopher P. Castro
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA 48109
| | - Gregory A. Farnum
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA 48109
| | - Adam G. Diehl
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA 48109
| | - Alan P. Boyle
- Department of Computational Medicine and Bioinformatics, University of Michigan, Ann Arbor, MI, USA 48109
- Department of Human Genetics, University of Michigan, Ann Arbor, MI, USA 48109
| |
Collapse
|
23
|
Udine E, Jain A, van Blitterswijk M. Advances in sequencing technologies for amyotrophic lateral sclerosis research. Mol Neurodegener 2023; 18:4. [PMID: 36635726 PMCID: PMC9838075 DOI: 10.1186/s13024-022-00593-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Accepted: 12/23/2022] [Indexed: 01/14/2023] Open
Abstract
Amyotrophic lateral sclerosis (ALS) is caused by upper and lower motor neuron loss and has a fairly rapid disease progression, leading to fatality in an average of 2-5 years after symptom onset. Numerous genes have been implicated in this disease; however, many cases remain unexplained. Several technologies are being used to identify regions of interest and investigate candidate genes. Initial approaches to detect ALS genes include, among others, linkage analysis, Sanger sequencing, and genome-wide association studies. More recently, next-generation sequencing methods, such as whole-exome and whole-genome sequencing, have been introduced. While those methods have been particularly useful in discovering new ALS-linked genes, methodological advances are becoming increasingly important, especially given the complex genetics of ALS. Novel sequencing technologies, like long-read sequencing, are beginning to be used to uncover the contribution of repeat expansions and other types of structural variation, which may help explain missing heritability in ALS. In this review, we discuss how popular and/or upcoming methods are being used to discover ALS genes, highlighting emerging long-read sequencing platforms and their role in aiding our understanding of this challenging disease.
Collapse
Affiliation(s)
- Evan Udine
- grid.417467.70000 0004 0443 9942Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road S, Jacksonville, FL 32224 USA ,grid.417467.70000 0004 0443 9942Mayo Clinic Graduate School of Biomedical Sciences, 4500 San Pablo Road S, Jacksonville, FL 32224 USA
| | - Angita Jain
- grid.417467.70000 0004 0443 9942Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road S, Jacksonville, FL 32224 USA ,grid.417467.70000 0004 0443 9942Mayo Clinic Graduate School of Biomedical Sciences, 4500 San Pablo Road S, Jacksonville, FL 32224 USA ,grid.417467.70000 0004 0443 9942Center for Clinical and Translational Sciences, Mayo Clinic, 4500 San Pablo Road S, Jacksonville, FL 32224 USA
| | - Marka van Blitterswijk
- Department of Neuroscience, Mayo Clinic, 4500 San Pablo Road S, Jacksonville, FL, 32224, USA.
| |
Collapse
|
24
|
Tsukiyama S, Hasan MM, Kurata H. CNN6mA: Interpretable neural network model based on position-specific CNN and cross-interactive network for 6mA site prediction. Comput Struct Biotechnol J 2022; 21:644-654. [PMID: 36659917 PMCID: PMC9826936 DOI: 10.1016/j.csbj.2022.12.043] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/07/2022] [Revised: 12/26/2022] [Accepted: 12/27/2022] [Indexed: 12/29/2022] Open
Abstract
N6-methyladenine (6mA) plays a critical role in various epigenetic processing including DNA replication, DNA repair, silencing, transcription, and diseases such as cancer. To understand such epigenetic mechanisms, 6 mA has been detected by high-throughput technologies on a genome-wide scale at single-base resolution, together with conventional methods such as immunoprecipitation, mass spectrometry and capillary electrophoresis, but these experimental approaches are time-consuming and laborious. To complement these problems, we have developed a CNN-based 6 mA site predictor, named CNN6mA, which proposed two new architectures: a position-specific 1-D convolutional layer and a cross-interactive network. In the position-specific 1-D convolutional layer, position-specific filters with different window sizes were applied to an inquiry sequence instead of sharing the same filters over all positions in order to extract the position-specific features at different levels. The cross-interactive network explored the relationships between all the nucleotide patterns within the inquiry sequence. Consequently, CNN6mA outperformed the existing state-of-the-art models in many species and created the contribution score vector that intelligibly interpret the prediction mechanism. The source codes and web application in CNN6mA are freely accessible at https://github.com/kuratahiroyuki/CNN6mA.git and http://kurata35.bio.kyutech.ac.jp/CNN6mA/, respectively.
Collapse
Key Words
- 6mA, N6-methyladenine
- AUCs, Area under the curves
- BERT, Bidirectional Encoder Representations from Transformers
- CNN
- CNN, Convolutional neural network
- DNA modification
- Deep learning
- Interpretable prediction
- LSTM, Long short-term memory
- MCC, Matthews correlation coefficient
- Machine learning
- N6-methyladenine
- RF, Random forest
- SMRT, Single-molecule real-time
- SN, Sensitivity
- SP, Specificity
- UMAP, Uniform manifold approximation and projection
- t-SNE, t-distributed stochastic neighbor embedding
Collapse
Affiliation(s)
- Sho Tsukiyama
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680–4 Kawazu, Iizuka, Fukuoka 820-8502, Japan
| | - Md Mehedi Hasan
- Tulane Center for Aging and Department of Medicine, Tulane University Health Sciences Center, New Orleans, LA 70112, USA
| | - Hiroyuki Kurata
- Department of Bioscience and Bioinformatics, Kyushu Institute of Technology, 680–4 Kawazu, Iizuka, Fukuoka 820-8502, Japan,Corresponding author.
| |
Collapse
|
25
|
A narrative review of cancer molecular diagnostics: past, present, and future. JOURNAL OF BIO-X RESEARCH 2022. [DOI: 10.1097/jbr.0000000000000136] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
|
26
|
Ono Y, Hamada M, Asai K. PBSIM3: a simulator for all types of PacBio and ONT long reads. NAR Genom Bioinform 2022; 4:lqac092. [PMID: 36465498 PMCID: PMC9713900 DOI: 10.1093/nargab/lqac092] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2022] [Revised: 11/02/2022] [Accepted: 11/12/2022] [Indexed: 12/03/2022] Open
Abstract
Long-read sequencers, such as Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) sequencers, have improved their read length and accuracy, thereby opening up unprecedented research. Many tools and algorithms have been developed to analyze long reads, and rapid progress in PacBio and ONT has further accelerated their development. Together with the development of high-throughput sequencing technologies and their analysis tools, many read simulators have been developed and effectively utilized. PBSIM is one of the popular long-read simulators. In this study, we developed PBSIM3 with three new functions: error models for long reads, multi-pass sequencing for high-fidelity read simulation and transcriptome sequencing simulation. Therefore, PBSIM3 is now able to meet a wide range of long-read simulation requirements.
Collapse
Affiliation(s)
- Yukiteru Ono
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8561, Japan
| | - Michiaki Hamada
- Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), 63-520, 3-4-1, Okubo, Shinjuku-ku, Tokyo 169-8555, Japan
- Institute for Medical-Oriented Structural Biology, Waseda University, 2-2, Wakamatsu-cho, Shinjuku-ku, Tokyo 162-8480, Japan
- Graduate School of Medicine, Nippon Medical School, 1-1-5, Sendagi, Bunkyo-ku, Tokyo, 113-8602, Japan
| | - Kiyoshi Asai
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa 277-8561, Japan
- Artificial Intelligence Research Center (AIRC), National Institute of Advanced Industrial Science and Technology (AIST), 2-3-26, Aomi, Koto-ku, 135-0064 Tokyo, Japan
| |
Collapse
|
27
|
Lizak A, Szweda R. Czy plastik może rozpocząć nową erę w archiwizacji danych? ARCHEION 2022. [DOI: 10.4467/26581264arc.22.014.16667] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/23/2022] Open
Abstract
With the rapid development of information technology, many aspects of our lives are undergoing a digital transformation. An increasing number of users are going online every year, and constantly improving artificial intelligence is gaining popularity, which leads to the growing production of information. Nowadays, information is usually stored in data centres, which will be forced to increase their space with the constant flow of new bits of information. Together with the increase in their space, energy consumption and associated maintenance costs are escalating. In 2021, global data centre power consumption was 220–320 TWh, which is about 0.9–1.3% of global power consumption. Continuous power supply for database operations is responsible for about 1% of total carbon dioxide emissions. Furthermore, it has already been reported that with the exponentially growing amount of data, in about 20 years, the amount of silicon for microprocessors will no longer be sufficient to store all the information. Therefore, scientists are looking for alternatives to the currently used data storage solutions and are developing new technologies using chemical molecules. Recently, even plastic has been explored as a data carrier. In this work, we present examples of new technologies for data storage in polymers. We have discussed polymers as data carriers in comparison with currently used solutions and deliberated whether plastic can become a future material for information archiving.
Collapse
|
28
|
Sedaghat-Hamedani F, Rebs S, Kayvanpour E, Zhu C, Amr A, Müller M, Haas J, Wu J, Steinmetz LM, Ehlermann P, Streckfuss-Bömeke K, Frey N, Meder B. Genotype Complements the Phenotype: Identification of the Pathogenicity of an LMNA Splice Variant by Nanopore Long-Read Sequencing in a Large DCM Family. Int J Mol Sci 2022; 23:ijms232012230. [PMID: 36293084 PMCID: PMC9602549 DOI: 10.3390/ijms232012230] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2022] [Revised: 10/07/2022] [Accepted: 10/10/2022] [Indexed: 11/16/2022] Open
Abstract
Dilated cardiomyopathy (DCM) is a common cause of heart failure (HF) and is of familial origin in 20−40% of cases. Genetic testing by next-generation sequencing (NGS) has yielded a definite diagnosis in many cases; however, some remain elusive. In this study, we used a combination of NGS, human-induced pluripotent-stem-cell-derived cardiomyocytes (iPSC-CMs) and nanopore long-read sequencing to identify the causal variant in a multi-generational pedigree of DCM. A four-generation family with familial DCM was investigated. Next-generation sequencing (NGS) was performed on 22 family members. Skin biopsies from two affected family members were used to generate iPSCs, which were then differentiated into iPSC-CMs. Short-read RNA sequencing was used for the evaluation of the target gene expression, and long-read RNA nanopore sequencing was used to evaluate the relevance of the splice variants. The pedigree suggested a highly penetrant, autosomal dominant mode of inheritance. The phenotype of the family was suggestive of laminopathy, but previous genetic testing using both Sanger and panel sequencing only yielded conflicting evidence for LMNA p.R644C (rs142000963), which was not fully segregated. By re-sequencing four additional affected family members, further non-coding LMNA variants could be detected: rs149339264, rs199686967, rs201379016, and rs794728589. To explore the roles of these variants, iPSC-CMs were generated. RNA sequencing showed the LMNA expression levels to be significantly lower in the iPSC-CMs of the LMNA variant carriers. We demonstrated a dysregulated sarcomeric structure and altered calcium homeostasis in the iPSC-CMs of the LMNA variant carriers. Using targeted nanopore long-read sequencing, we revealed the biological significance of the variant c.356+1G>A, which generates a novel 5′ splice site in exon 1 of the cardiac isomer of LMNA, causing a nonsense mRNA product with almost complete RNA decay and haploinsufficiency. Using novel molecular analysis and nanopore technology, we demonstrated the pathogenesis of the rs794728589 (c.356+1G>A) splice variant in LMNA. This study highlights the importance of precise diagnostics in the clinical management and workup of cardiomyopathies.
Collapse
Affiliation(s)
- Farbod Sedaghat-Hamedani
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Sabine Rebs
- Department of Cardiology and Pneumology, Georg-August-University Göttingen, 37073 Göttingen, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Göttingen, 37075 Göttingen, Germany
- Department of Pharmacology and Toxicology, University of Würzburg, 97070 Würzburg, Germany
| | - Elham Kayvanpour
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Chenchen Zhu
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Ali Amr
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Marion Müller
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Clinic for General and Interventional Cardiology/Angiology, Herz- und Diabeteszentrum NRW, University Hospital of the Ruhr-Universität Bochum, 32545 Bad Oeynhausen, Germany
| | - Jan Haas
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Jingyan Wu
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Lars M. Steinmetz
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
| | - Philipp Ehlermann
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Katrin Streckfuss-Bömeke
- Department of Cardiology and Pneumology, Georg-August-University Göttingen, 37073 Göttingen, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Göttingen, 37075 Göttingen, Germany
- Department of Pharmacology and Toxicology, University of Würzburg, 97070 Würzburg, Germany
| | - Norbert Frey
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
| | - Benjamin Meder
- Institute for Cardiomyopathies Heidelberg (ICH), University Hospital Heidelberg, 69120 Heidelberg, Germany
- DZHK (German Centre for Cardiovascular Research), Partner Site Heidelberg and Mannheim, 69120 Heidelberg, Germany
- Department of Internal Medicine III, University Hospital Heidelberg, 69120 Heidelberg, Germany
- Department of Genetics, Stanford University, Stanford, CA 94305, USA
- Correspondence:
| |
Collapse
|
29
|
Cuenca-Guardiola J, de la Morena-Barrio B, García JL, Sanchis-Juan A, Corral J, Fernández-Breis JT. Improvement of large copy number variant detection by whole genome nanopore sequencing. J Adv Res 2022:S2090-1232(22)00241-7. [PMID: 36323370 PMCID: PMC10403694 DOI: 10.1016/j.jare.2022.10.012] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/12/2022] [Revised: 10/18/2022] [Accepted: 10/22/2022] [Indexed: 11/11/2022] Open
Abstract
INTRODUCTION Whole-genome sequencing using nanopore technologies can uncover structural variants, which are DNA rearrangements larger than 50 base pairs. Nanopore technologies can also characterize their boundaries with single-base accuracy, owing to the kilobase-long reads that encompass either full variants or their junctions. Other methods, such as next-generation short read sequencing or PCR assays, are limited in their capabilities to detect or characterize structural variants. However, the existing software for nanopore sequencing data analysis still reports incomplete variant sets, which also contain erroneous calls, a considerable obstacle for the molecular diagnosis or accurate genotyping of populations. METHODS We compared multiple factors affecting variant calling, such as reference genome version, aligner (minimap2, NGMLR, and lra) choice, and variant caller combinations (Sniffles, CuteSV, SVIM, and NanoVar), to find the optimal group of tools for calling large (>50 kb) deletions and duplications, using data from seven patients exhibiting gross gene defects on SERPINC1 and from a reference variant set as the control. The goal was to obtain the most complete, yet reasonably specific group of large variants using a single cell of PromethION sequencing, which yielded lower depth coverage than short-read sequencing. We also used a custom method for the statistical analysis of the coverage value to refine the resulting datasets. RESULTS We found that for large deletions and duplications (>50 kb), the existing software performed worse than for smaller ones, in terms of both sensitivity and specificity, and newer tools had not improved this. Our novel software, disCoverage, could polish variant callers' results, improving specificity by up to 62% and sensitivity by 15%, the latter requiring other data or samples. CONCLUSION We analyzed the current situation of >50-kb copy number variants with nanopore sequencing, which could be improved. The methods presented in this work could help to identify the known deletions and duplications in a set of patients, while also helping to filter out erroneous calls for these variants, which might aid the efforts to characterize a not-yet well-known fraction of genetic variability in the human genome.
Collapse
|
30
|
Ahmed YW, Alemu BA, Bekele SA, Gizaw ST, Zerihun MF, Wabalo EK, Teklemariam MD, Mihrete TK, Hanurry EY, Amogne TG, Gebrehiwot AD, Berga TN, Haile EA, Edo DO, Alemu BD. Epigenetic tumor heterogeneity in the era of single-cell profiling with nanopore sequencing. Clin Epigenetics 2022; 14:107. [PMID: 36030244 PMCID: PMC9419648 DOI: 10.1186/s13148-022-01323-6] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/27/2022] [Accepted: 08/12/2022] [Indexed: 11/29/2022] Open
Abstract
Nanopore sequencing has brought the technology to the next generation in the science of sequencing. This is achieved through research advancing on: pore efficiency, creating mechanisms to control DNA translocation, enhancing signal-to-noise ratio, and expanding to long-read ranges. Heterogeneity regarding epigenetics would be broad as mutations in the epigenome are sensitive to cause new challenges in cancer research. Epigenetic enzymes which catalyze DNA methylation and histone modification are dysregulated in cancer cells and cause numerous heterogeneous clones to evolve. Detection of this heterogeneity in these clones plays an indispensable role in the treatment of various cancer types. With single-cell profiling, the nanopore sequencing technology could provide a simple sequence at long reads and is expected to be used soon at the bedside or doctor's office. Here, we review the advancements of nanopore sequencing and its use in the detection of epigenetic heterogeneity in cancer.
Collapse
Affiliation(s)
- Yohannis Wondwosen Ahmed
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia.
| | - Berhan Ababaw Alemu
- Department of Medical Biochemistry, School of Medicine, St. Paul's Hospital, Millennium Medical College, Addis Ababa, Ethiopia
| | - Sisay Addisu Bekele
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Solomon Tebeje Gizaw
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Muluken Fekadie Zerihun
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Endriyas Kelta Wabalo
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Maria Degef Teklemariam
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Tsehayneh Kelemu Mihrete
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Endris Yibru Hanurry
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Tensae Gebru Amogne
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Assaye Desalegne Gebrehiwot
- Department of Medical Anatomy, School of Medicine, College of Health Sciences, Addis Ababa University, Addis Ababa, Ethiopia
| | - Tamirat Nida Berga
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Ebsitu Abate Haile
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Dessiet Oma Edo
- Department of Medical Biochemistry, School of Medicine, College of Health Sciences, Addis Ababa University, P.O. Box: 9086, Addis Ababa, Ethiopia
| | - Bizuwork Derebew Alemu
- Department of Statistics, College of Natural and Computational Sciences, Mizan Tepi University, Tepi, Ethiopia
| |
Collapse
|
31
|
Cechova M, Miga KH. Satellite DNAs and human sex chromosome variation. Semin Cell Dev Biol 2022; 128:15-25. [PMID: 35644878 DOI: 10.1016/j.semcdb.2022.04.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2022] [Revised: 04/26/2022] [Accepted: 04/27/2022] [Indexed: 11/17/2022]
Abstract
Satellite DNAs are present on every chromosome in the cell and are typically enriched in repetitive, heterochromatic parts of the human genome. Sex chromosomes represent a unique genomic and epigenetic context. In this review, we first report what is known about satellite DNA biology on human X and Y chromosomes, including repeat content and organization, as well as satellite variation in typical euploid individuals. Then, we review sex chromosome aneuploidies that are among the most common types of aneuploidies in the general population, and are better tolerated than autosomal aneuploidies. This is demonstrated also by the fact that aging is associated with the loss of the X, and especially the Y chromosome. In addition, supernumerary sex chromosomes enable us to study general processes in a cell, such as analyzing heterochromatin dosage (i.e. additional Barr bodies and long heterochromatin arrays on Yq) and their downstream consequences. Finally, genomic and epigenetic organization and regulation of satellite DNA could influence chromosome stability and lead to aneuploidy. In this review, we argue that the complete annotation of satellite DNA on sex chromosomes in human, and especially in centromeric regions, will aid in explaining the prevalence and the consequences of sex chromosome aneuploidies.
Collapse
Affiliation(s)
- Monika Cechova
- Faculty of Informatics, Masaryk University, Czech Republic
| | - Karen H Miga
- Department of Biomolecular Engineering, University of California Santa Cruz, CA, USA; UC Santa Cruz Genomics Institute, University of California Santa Cruz, CA 95064, USA
| |
Collapse
|
32
|
Schmidt J, Berghaus S, Blessing F, Herbeck H, Blessing J, Schierack P, Rödiger S, Roggenbuck D, Wenzel F. Genotyping of familial Mediterranean fever gene (MEFV)-Single nucleotide polymorphism-Comparison of Nanopore with conventional Sanger sequencing. PLoS One 2022; 17:e0265622. [PMID: 35298548 PMCID: PMC8929590 DOI: 10.1371/journal.pone.0265622] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/03/2021] [Accepted: 03/04/2022] [Indexed: 11/18/2022] Open
Abstract
Background Through continuous innovation and improvement, Nanopore sequencing has become a powerful technology. Because of its fast processing time, low cost, and ability to generate long reads, this sequencing technique would be particularly suitable for clinical diagnostics. However, its raw data accuracy is inferior in contrast to other sequencing technologies. This constraint still results in limited use of Nanopore sequencing in the field of clinical diagnostics and requires further validation and IVD certification. Methods We evaluated the performance of latest Nanopore sequencing in combination with a dedicated data-analysis pipeline for single nucleotide polymorphism (SNP) genotyping of the familial Mediterranean fever gene (MEFV) by amplicon sequencing of 47 clinical samples. Mutations in MEFV are associated with Mediterranean fever, a hereditary periodic fever syndrome. Conventional Sanger sequencing, which is commonly applied in clinical genetic diagnostics, was used as a reference method. Results Nanopore sequencing enabled the sequencing of 10 target regions within MEFV with high read depth (median read depth 7565x) in all samples and identified a total of 435 SNPs in the whole sample collective, of which 29 were unique. Comparison of both sequencing workflows showed a near perfect agreement with no false negative calls. Precision, Recall, and F1-Score of the Nanopore sequencing workflow were > 0.99, respectively. Conclusions These results demonstrated the great potential of current Nanopore sequencing for application in clinical diagnostics, at least for SNP genotyping by amplicon sequencing. Other more complex applications, especially structural variant identification, require further in-depth clinical validation.
Collapse
Affiliation(s)
- Jonas Schmidt
- Institute for Laboratory Medicine, Singen, Germany
- Faculty of Medical and Life Sciences, Furtwangen University, Villingen-Schwenningen, Germany
- Faculty Environment and Natural Sciences, Institute of Biotechnology, Brandenburg University of Technology Cottbus-Senftenberg, Senftenberg, Germany
| | | | - Frithjof Blessing
- Institute for Laboratory Medicine, Singen, Germany
- Faculty of Medical and Life Sciences, Furtwangen University, Villingen-Schwenningen, Germany
| | | | | | - Peter Schierack
- Faculty Environment and Natural Sciences, Institute of Biotechnology, Brandenburg University of Technology Cottbus-Senftenberg, Senftenberg, Germany
- Faculty of Health Sciences Brandenburg, Brandenburg University of Technology Cottbus–Senftenberg, Senftenberg, Germany
| | - Stefan Rödiger
- Faculty Environment and Natural Sciences, Institute of Biotechnology, Brandenburg University of Technology Cottbus-Senftenberg, Senftenberg, Germany
- Faculty of Health Sciences Brandenburg, Brandenburg University of Technology Cottbus–Senftenberg, Senftenberg, Germany
| | - Dirk Roggenbuck
- Faculty Environment and Natural Sciences, Institute of Biotechnology, Brandenburg University of Technology Cottbus-Senftenberg, Senftenberg, Germany
- Faculty of Health Sciences Brandenburg, Brandenburg University of Technology Cottbus–Senftenberg, Senftenberg, Germany
- * E-mail:
| | - Folker Wenzel
- Faculty of Medical and Life Sciences, Furtwangen University, Villingen-Schwenningen, Germany
| |
Collapse
|
33
|
Patel A, Dogan H, Payne A, Krause E, Sievers P, Schoebe N, Schrimpf D, Blume C, Stichel D, Holmes N, Euskirchen P, Hench J, Frank S, Rosenstiel-Goidts V, Ratliff M, Etminan N, Unterberg A, Dieterich C, Herold-Mende C, Pfister SM, Wick W, Loose M, von Deimling A, Sill M, Jones DTW, Schlesner M, Sahm F. Rapid-CNS 2: rapid comprehensive adaptive nanopore-sequencing of CNS tumors, a proof-of-concept study. Acta Neuropathol 2022; 143:609-612. [PMID: 35357562 PMCID: PMC9038836 DOI: 10.1007/s00401-022-02415-6] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/21/2022] [Revised: 03/21/2022] [Accepted: 03/22/2022] [Indexed: 01/01/2023]
Affiliation(s)
- Areeba Patel
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Helin Dogan
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Alexander Payne
- DeepSeq, School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Elena Krause
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
| | - Philipp Sievers
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Natalie Schoebe
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Daniel Schrimpf
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Christina Blume
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Damian Stichel
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Nadine Holmes
- DeepSeq, School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Philipp Euskirchen
- Department of Neurology, Charité-Universitätsmedizin Berlin, Berlin, Germany
| | - Jürgen Hench
- Division of Neuropathology, Institute of Pathology, University Hospital Basel, Basel, Switzerland
| | - Stephan Frank
- Division of Neuropathology, Institute of Pathology, University Hospital Basel, Basel, Switzerland
| | | | - Miriam Ratliff
- Department of Neurosurgery, University Hospital Mannheim, Mannheim, Germany
| | - Nima Etminan
- Department of Neurosurgery, University Hospital Mannheim, Mannheim, Germany
| | - Andreas Unterberg
- Department of Neurosurgery, University Hospital Heidelberg, Heidelberg, Germany
| | - Christoph Dieterich
- Department of Cardiology, Angiology, and Pneumology, University Hospital Heidelberg, University of Heidelberg, Heidelberg, Germany
| | | | - Stefan M Pfister
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
- Department of Pediatric Hematology and Oncology, Heidelberg University Hospital, Heidelberg, Germany
| | - Wolfgang Wick
- Clinical Cooperation Unit Neurooncology, German Consortium for Translational Cancer Research (DKTK), German Cancer Research Center (DKFZ), Heidelberg, Germany
- Department of Neurology and Neurooncology Program, National Center for Tumor Diseases, Heidelberg University Hospital, Heidelberg, Germany
| | - Matthew Loose
- DeepSeq, School of Life Sciences, University of Nottingham, Nottingham, UK
| | - Andreas von Deimling
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany
| | - Martin Sill
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Division of Pediatric Neurooncology, German Cancer Research Center (DKFZ), German Cancer Consortium (DKTK), Heidelberg, Germany
| | - David T W Jones
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany
- Pediatric Glioma Research Group, German Cancer Research Center (DKFZ), Heidelberg, Germany
| | - Matthias Schlesner
- Biomedical Informatics, Data Mining and Data Analytics, Augsburg University, Augsburg, Germany
| | - Felix Sahm
- Department of Neuropathology, University Hospital Heidelberg, Heidelberg, Germany.
- Clinical Cooperation Unit Neuropathology, German Cancer Consortium (DKTK), German Cancer Research Center (DKFZ), Im Neuenheimer Feld 224, 69120, Heidelberg, Germany.
- Hopp Children's Cancer Center (KiTZ), Heidelberg, Germany.
| |
Collapse
|
34
|
Uppuluri L, Wang Y, Young E, Wong JS, Abid HZ, Xiao M. Multiplex structural variant detection by whole-genome mapping and nanopore sequencing. Sci Rep 2022; 12:6512. [PMID: 35444207 PMCID: PMC9021263 DOI: 10.1038/s41598-022-10483-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2021] [Accepted: 04/08/2022] [Indexed: 11/26/2022] Open
Abstract
Identification of structural variants (SVs) breakpoints is important in studying mutations, mutagenic causes, and functional impacts. Next-generation sequencing and whole-genome optical mapping are extensively used in SV discovery and characterization. However, multiple platforms and computational approaches are needed for comprehensive analysis, making it resource-intensive and expensive. Here, we propose a strategy combining optical mapping and cas9-assisted targeted nanopore sequencing to analyze SVs. Optical mapping can economically and quickly detect SVs across a whole genome but does not provide sequence-level information or precisely resolve breakpoints. Furthermore, since only a subset of all SVs is known to affect biology, we attempted to type a subset of all SVs using targeted nanopore sequencing. Using our approach, we resolved the breakpoints of five deletions, five insertions, and an inversion, in a single experiment.
Collapse
Affiliation(s)
- Lahari Uppuluri
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA.,Department of Mechanical Engineering and Mechanics, Drexel University, Philadelphia, PA, USA
| | - Yilin Wang
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA
| | - Eleanor Young
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA
| | - Jessica S Wong
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA
| | - Heba Z Abid
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA
| | - Ming Xiao
- School of Biomedical Engineering, Science and Health Systems, Drexel University, Philadelphia, PA, USA. .,Center for Genomic Sciences, Institute of Molecular Medicine and Infectious Disease, Drexel University, Philadelphia, PA, USA.
| |
Collapse
|
35
|
Kashyap D, Pal D, Sharma R, Garg VK, Goel N, Koundal D, Zaguia A, Koundal S, Belay A. Global Increase in Breast Cancer Incidence: Risk Factors and Preventive Measures. BIOMED RESEARCH INTERNATIONAL 2022; 2022:9605439. [PMID: 35480139 PMCID: PMC9038417 DOI: 10.1155/2022/9605439] [Citation(s) in RCA: 132] [Impact Index Per Article: 66.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 01/17/2022] [Revised: 02/25/2022] [Accepted: 03/21/2022] [Indexed: 02/07/2023]
Abstract
Breast cancer is a global cause for concern owing to its high incidence around the world. The alarming increase in breast cancer cases emphasizes the management of disease at multiple levels. The management should start from the beginning that includes stringent cancer screening or cancer registry to effective diagnostic and treatment strategies. Breast cancer is highly heterogeneous at morphology as well as molecular levels and needs different therapeutic regimens based on the molecular subtype. Breast cancer patients with respective subtype have different clinical outcome prognoses. Breast cancer heterogeneity emphasizes the advanced molecular testing that will help on-time diagnosis and improved survival. Emerging fields such as liquid biopsy and artificial intelligence would help to under the complexity of breast cancer disease and decide the therapeutic regimen that helps in breast cancer management. In this review, we have discussed various risk factors and advanced technology available for breast cancer diagnosis to combat the worst breast cancer status and areas that need to be focused for the better management of breast cancer.
Collapse
Affiliation(s)
- Dharambir Kashyap
- Department of Histopathology, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India
| | - Deeksha Pal
- Department of Translational and Regenerative Medicine, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India
| | - Riya Sharma
- Department of Pulmonary Medicine, Postgraduate Institute of Medical Education and Research, Chandigarh 160012, India
| | - Vivek Kumar Garg
- Department of Medical Laboratory Technology, University Institute of Applied Health Sciences, Chandigarh University (Gharuan), Mohali 140313, India
| | - Neelam Goel
- Department of Information Technology, University Institute of Engineering & Technology, Panjab University, Chandigarh 160014, India
| | - Deepika Koundal
- Department of Systemics, School of Computer Science, University of Petroleum & Energy Studies, Dehradun, India
| | - Atef Zaguia
- Department of computer science, College of Computers and Information Technology, Taif University, P.O. BOX 11099, Taif 21944, Saudi Arabia
| | - Shubham Koundal
- Department of Medical Laboratory Technology, University Institute of Applied Health Sciences, Chandigarh University (Gharuan), Mohali 140313, India
| | - Assaye Belay
- Department of Statistics, Mizan-Tepi University, Ethiopia
| |
Collapse
|
36
|
Wang Y, Chen D, Zhu C, Zhao Z, Gao S, Gou J, Guo Y, Kong X. Genetic Surveillance of Five SARS-CoV-2 Clinical Samples in Henan Province Using Nanopore Sequencing. Front Immunol 2022; 13:814806. [PMID: 35444655 PMCID: PMC9013895 DOI: 10.3389/fimmu.2022.814806] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2021] [Accepted: 03/10/2022] [Indexed: 11/13/2022] Open
Abstract
Severe acute respiratory syndrome coronavirus 2 (SARS-CoV-2) has rapidly spread and poses a major threat to public health worldwide. The whole genome sequencing plays a crucial role in virus surveillance and evolutionary analysis. In this study, five genome sequences of SARS-CoV-2 were obtained from nasopharyngeal swab samples from Zhengzhou, China. Following RNA extraction and cDNA synthesis, multiplex PCR was performed with two primer pools to produce the overlapped amplicons of ~1,200 bp. The viral genomes were obtained with 96% coverage using nanopore sequencing. Forty-five missense nucleotide mutations were identified; out of these, 5 mutations located at Nsp2, Nsp3, Nsp14, and ORF10 genes occurred with a <0.1% frequency in the global dataset. On the basis of mutation profiles, five genomes were clustered into two sublineages (B.1.617.2 and AY.31) or subclades (21A and 21I). The phylogenetic analysis of viral genomes from several regions of China and Myanmar revealed that five patients had different viral transmission chains. Taken together, we established a nanopore sequencing platform for genetic surveillance of SARS-CoV-2 and identified the variants circulating in Zhengzhou during August 2021. Our study provided crucial support for government policymaking and prevention and control of COVID-19.
Collapse
Affiliation(s)
- Yanan Wang
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Duo Chen
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Chaofeng Zhu
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Zhenhua Zhao
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Shanshan Gao
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Jianjun Gou
- Department of Clinical Laboratory, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| | - Yongjun Guo
- Department of Pathology, Henan Academy of Medical Sciences, Zhengzhou, China
| | - Xiangdong Kong
- Genetic and Prenatal Diagnosis Center, the First Affiliated Hospital of Zhengzhou University, Zhengzhou, China
| |
Collapse
|
37
|
Stefan CP, Hall AT, Graham AS, Minogue TD. Comparison of Illumina and Oxford Nanopore Sequencing Technologies for Pathogen Detection from Clinical Matrices Using Molecular Inversion Probes. J Mol Diagn 2022; 24:395-405. [PMID: 35085783 DOI: 10.1016/j.jmoldx.2021.12.005] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2021] [Revised: 11/19/2021] [Accepted: 12/22/2021] [Indexed: 11/16/2022] Open
Abstract
Next-generation sequencing is rapidly finding footholds in numerous microbiological fields, including infectious disease diagnostics. Here, we describe a molecular inversion probe panel for the identification of bacterial, viral, and parasitic pathogens. We describe the ability of Illumina and Oxford Nanopore Technologies (ONT) to sequence small amplicons originating from this panel for the identification of pathogens in complex matrices. The panel correctly classified 31 bacterial pathogens directly from positive blood culture bottles with a genus-level concordance of 96.7% and 90.3% on the Illumina and ONT platforms, respectively. Both sequencing platforms detected 18 viral and parasitic organisms directly from mock clinical samples of plasma and whole blood at concentrations of 104 PFU/mL with few exceptions. In general, Illumina sequencing exhibited greater read counts with lower percent mapped reads; however, this resulted in no effect on limits of detection compared with ONT sequencing. Mock clinical evaluation of the probe panel on the Illumina and ONT platforms resulted in positive predictive values of 0.91 and 0.88 and negative predictive values of 1 and 1 from de-identified human chikungunya virus samples compared with gold standard quantitative RT-PCR. Overall, these data show that molecular inversion probes are an adaptable technology capable of pathogen detection from complex sample matrices on current next-generation sequencing platforms.
Collapse
Affiliation(s)
- Christopher P Stefan
- Diagnostic Systems Division, United States Army Medical Research Institute of Infectious Disease, Fort Detrick, Maryland
| | - Adrienne T Hall
- Diagnostic Systems Division, United States Army Medical Research Institute of Infectious Disease, Fort Detrick, Maryland
| | - Amanda S Graham
- Diagnostic Systems Division, United States Army Medical Research Institute of Infectious Disease, Fort Detrick, Maryland
| | - Timothy D Minogue
- Diagnostic Systems Division, United States Army Medical Research Institute of Infectious Disease, Fort Detrick, Maryland.
| |
Collapse
|
38
|
Marsili L, Duque KR, Bode RL, Kauffman MA, Espay AJ. Uncovering Essential Tremor Genetics: The Promise of Long-Read Sequencing. Front Neurol 2022; 13:821189. [PMID: 35401394 PMCID: PMC8983820 DOI: 10.3389/fneur.2022.821189] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 02/25/2022] [Indexed: 12/23/2022] Open
Abstract
Long-read sequencing (LRS) technologies have been recently introduced to overcome intrinsic limitations of widely-used next-generation sequencing (NGS) technologies, namely the sequencing limited to short-read fragments (150–300 base pairs). Since its introduction, LRS has permitted many successes in unraveling hidden mutational mechanisms. One area in clinical neurology in need of rethinking as it applies to genetic mechanisms is essential tremor (ET). This disorder, among the most common in neurology, is a syndrome often exhibiting an autosomal dominant pattern of inheritance whose large phenotypic spectrum suggest a multitude of genetic etiologies. Exome sequencing has revealed the genetic etiology only in rare ET families (FUS, SORT1, SCN4A, NOS3, KCNS2, HAPLN4/BRAL2, and USP46). We hypothesize that a reason for this shortcoming may be non-classical genetic mechanism(s) underpinning ET, among them trinucleotide, tetranucleotide, or pentanucleotide repeat disorders. In support of this hypothesis, trinucleotide (e.g., GGC repeats in NOTCH2NLC) and pentanucleotide repeat disorders (e.g., ATTTC repeats in STARD7) have been revealed as pathogenic in patients with a past history of what has come to be referred to as “ET plus,” bilateral hand tremor associated with epilepsy and/or leukoencephalopathy. A systematic review of LRS in neurodegenerative disorders showed that 10 of the 22 (45%) genetic etiologies ascertained by LRS include tremor in their phenotypic spectrum, suggesting that future clinical applications of LRS for tremor disorders may uncover genetic subtypes of familial ET that have eluded NGS, particularly those with associated leukoencephalopathy or family history of epilepsy. LRS provides a pathway for potentially uncovering novel genes and genetic mechanisms, helping narrow the large proportion of “idiopathic” ET.
Collapse
Affiliation(s)
- Luca Marsili
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Kevin R. Duque
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Rachel L. Bode
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Marcelo A. Kauffman
- Consultorio y Laboratorio de Neurogenética, Centro Universitario de Neurología José María Ramos Mejía, Buenos Aires, Argentina
| | - Alberto J. Espay
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
- *Correspondence: Alberto J. Espay
| |
Collapse
|
39
|
Zhao C, Li K, Mou X, Zhu Y, Chen C, Zhang M, Wang Y, Zhou K, Sheng Y, Liu H, Bai Y, Li X, Zhou C, Deng D, Wu J, Wu HC, Bao R, Geng J. High-fidelity biosensing of dNTPs and nucleic acids by controllable subnanometer channel PaMscS. Biosens Bioelectron 2022; 200:113894. [PMID: 34973563 DOI: 10.1016/j.bios.2021.113894] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2021] [Revised: 12/03/2021] [Accepted: 12/14/2021] [Indexed: 02/05/2023]
Abstract
Current tools for dNTP analysis mainly rely on expensive fluorescent labeling, mass spectrometry or electrochemistry. Single-molecule assay by protein nanopores with an internal diameter of ca. 1-3.6 nm provides a useful tool for dNTP sensing. However, the most commonly used protein nanopores require additional modifications to enable dNTP detection. In this study, the PaMscS channel (mechanosensitive channel of small conductance from Pseudomonas aeruginosa) embedded in the bilayer lipid membrane (BLM) of E. coli polar lipid extract was applied as a nanopore for single molecular sensing. Two mutants of PaMscS nanopores on the side portal region (PaMscS W130A and PaMscS K180R) were selected for direct dNTP or pyrophosphoric acid (PPi) detection without aptamer or protein modification. Notably, the PaMscS mutant pore can be adjusted by regulation of osmolarity differences, which is crucial for the optimal detection of specific molecules. In addition, we established a PaMscS-based diagnosis method for the rapid sensing of disease-associated nucleic acids by monitoring the consumption of dNTPs, with 86% specificity and 100% sensitivity among 22 clinical samples. This protein nanopore, without aptamer or modification, paves a new way for dNTPs, PPi direct sensing and nucleic acid detection with low cost but high versatility.
Collapse
Affiliation(s)
- Changjian Zhao
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Kaiju Li
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Xingyu Mou
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Yibo Zhu
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Chuan Chen
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China; School of Pharmacy, North Sichuan Medical College, Nanchong, 637000, China
| | - Ming Zhang
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Yu Wang
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Ke Zhou
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry, Chinese Academy of Sciences, Beijing, 100190, China
| | - Yingying Sheng
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry, Chinese Academy of Sciences, Beijing, 100190, China
| | - Hao Liu
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Yunjin Bai
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Xinqiong Li
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China
| | - Cuisong Zhou
- College of Chemistry, Sichuan University, Chengdu, 610041, China
| | - Dong Deng
- Division of Obstetrics, Key Laboratory of Birth Defects and Related Disease of Women and Children of MOE, State Key Laboratory of Biotherapy, West China Second Hospital, Sichuan University, Chengdu, 610041, Sichuan, China
| | - Jianping Wu
- Zhejiang Provincial Laboratory of Life Sciences and Biomedicine, Key Laboratory of Structural Biology of Zhejiang Province, School of Life Sciences, Westlake University, Hangzhou, Zhejiang, 310024, China; Institute of Biology, Westlake Institute for Advanced Study, Hangzhou, Zhejiang, 310024, China
| | - Hai-Chen Wu
- Beijing National Laboratory for Molecular Sciences, Key Laboratory of Analytical Chemistry for Living Biosystems, Institute of Chemistry, Chinese Academy of Sciences, Beijing, 100190, China.
| | - Rui Bao
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China.
| | - Jia Geng
- Department of Laboratory Medicine, State Key Laboratory of Biotherapy, Med-X Center for Manufacturing, West China Hospital, Sichuan University and Collaborative Innovation Center, Chengdu, 610041, China.
| |
Collapse
|
40
|
Leung AWS, Leung HCM, Wong CL, Zheng ZX, Lui WW, Luk HM, Lo IFM, Luo R, Lam TW. ECNano: A cost-effective workflow for target enrichment sequencing and accurate variant calling on 4800 clinically significant genes using a single MinION flowcell. BMC Med Genomics 2022; 15:43. [PMID: 35246132 PMCID: PMC8895767 DOI: 10.1186/s12920-022-01190-3] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/26/2021] [Accepted: 02/22/2022] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND The application of long-read sequencing using the Oxford Nanopore Technologies (ONT) MinION sequencer is getting more diverse in the medical field. Having a high sequencing error of ONT and limited throughput from a single MinION flowcell, however, limits its applicability for accurate variant detection. Medical exome sequencing (MES) targets clinically significant exon regions, allowing rapid and comprehensive screening of pathogenic variants. By applying MES with MinION sequencing, the technology can achieve a more uniform capture of the target regions, shorter turnaround time, and lower sequencing cost per sample. METHOD We introduced a cost-effective optimized workflow, ECNano, comprising a wet-lab protocol and bioinformatics analysis, for accurate variant detection at 4800 clinically important genes and regions using a single MinION flowcell. The ECNano wet-lab protocol was optimized to perform long-read target enrichment and ONT library preparation to stably generate high-quality MES data with adequate coverage. The subsequent variant-calling workflow, Clair-ensemble, adopted a fast RNN-based variant caller, Clair, and was optimized for target enrichment data. To evaluate its performance and practicality, ECNano was tested on both reference DNA samples and patient samples. RESULTS ECNano achieved deep on-target depth of coverage (DoC) at average > 100× and > 98% uniformity using one MinION flowcell. For accurate ONT variant calling, the generated reads sufficiently covered 98.9% of pathogenic positions listed in ClinVar, with 98.96% having at least 30× DoC. ECNano obtained an average read length of 1000 bp. The long reads of ECNano also covered the adjacent splice sites well, with 98.5% of positions having ≥ 30× DoC. Clair-ensemble achieved > 99% recall and accuracy for SNV calling. The whole workflow from wet-lab protocol to variant detection was completed within three days. CONCLUSION We presented ECNano, an out-of-the-box workflow comprising (1) a wet-lab protocol for ONT target enrichment sequencing and (2) a downstream variant detection workflow, Clair-ensemble. The workflow is cost-effective, with a short turnaround time for high accuracy variant calling in 4800 clinically significant genes and regions using a single MinION flowcell. The long-read exon captured data has potential for further development, promoting the application of long-read sequencing in personalized disease treatment and risk prediction.
Collapse
Affiliation(s)
- Amy Wing-Sze Leung
- Department of Computer Science, The University of Hong Kong, Hong Kong, China
| | | | - Chak-Lim Wong
- Department of Computer Science, The University of Hong Kong, Hong Kong, China
| | - Zhen-Xian Zheng
- Department of Computer Science, The University of Hong Kong, Hong Kong, China
| | - Wui-Wang Lui
- Department of Computer Science, The University of Hong Kong, Hong Kong, China
| | - Ho-Ming Luk
- Department of Health, Clinical Genetic Service, Hong Kong, SAR, China
| | - Ivan Fai-Man Lo
- Department of Health, Clinical Genetic Service, Hong Kong, SAR, China
| | - Ruibang Luo
- Department of Computer Science, The University of Hong Kong, Hong Kong, China.
| | - Tak-Wah Lam
- Department of Computer Science, The University of Hong Kong, Hong Kong, China.
| |
Collapse
|
41
|
Wan Y, Zong C, Li X, Wang A, Li Y, Yang T, Bao Q, Dubow M, Yang M, Rodrigo LA, Mao C. New Insights for Biosensing: Lessons from Microbial Defense Systems. Chem Rev 2022; 122:8126-8180. [PMID: 35234463 DOI: 10.1021/acs.chemrev.1c01063] [Citation(s) in RCA: 13] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Microorganisms have gained defense systems during the lengthy process of evolution over millions of years. Such defense systems can protect them from being attacked by invading species (e.g., CRISPR-Cas for establishing adaptive immune systems and nanopore-forming toxins as virulence factors) or enable them to adapt to different conditions (e.g., gas vesicles for achieving buoyancy control). These microorganism defense systems (MDS) have inspired the development of biosensors that have received much attention in a wide range of fields including life science research, food safety, and medical diagnosis. This Review comprehensively analyzes biosensing platforms originating from MDS for sensing and imaging biological analytes. We first describe a basic overview of MDS and MDS-inspired biosensing platforms (e.g., CRISPR-Cas systems, nanopore-forming proteins, and gas vesicles), followed by a critical discussion of their functions and properties. We then discuss several transduction mechanisms (optical, acoustic, magnetic, and electrical) involved in MDS-inspired biosensing. We further detail the applications of the MDS-inspired biosensors to detect a variety of analytes (nucleic acids, peptides, proteins, pathogens, cells, small molecules, and metal ions). In the end, we propose the key challenges and future perspectives in seeking new and improved MDS tools that can potentially lead to breakthrough discoveries in developing a new generation of biosensors with a combination of low cost; high sensitivity, accuracy, and precision; and fast detection. Overall, this Review gives a historical review of MDS, elucidates the principles of emulating MDS to develop biosensors, and analyzes the recent advancements, current challenges, and future trends in this field. It provides a unique critical analysis of emulating MDS to develop robust biosensors and discusses the design of such biosensors using elements found in MDS, showing that emulating MDS is a promising approach to conceptually advancing the design of biosensors.
Collapse
Affiliation(s)
- Yi Wan
- State Key Laboratory of Marine Resource Utilization in the South China Sea, School of Pharmaceutical Sciences, Marine College, Hainan University, Haikou 570228, P. R. China
| | - Chengli Zong
- State Key Laboratory of Marine Resource Utilization in the South China Sea, School of Pharmaceutical Sciences, Marine College, Hainan University, Haikou 570228, P. R. China
| | - Xiangpeng Li
- Department of Bioengineering and Therapeutic Sciences, Schools of Medicine and Pharmacy, University of California, San Francisco, 1700 Fourth Street, Byers Hall 303C, San Francisco, California 94158, United States
| | - Aimin Wang
- State Key Laboratory of Marine Resource Utilization in the South China Sea, School of Pharmaceutical Sciences, Marine College, Hainan University, Haikou 570228, P. R. China
| | - Yan Li
- College of Animal Science, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| | - Tao Yang
- School of Materials Science and Engineering, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| | - Qing Bao
- School of Materials Science and Engineering, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| | - Michael Dubow
- Institute for Integrative Biology of the Cell (I2BC), UMR 9198 CNRS, CEA, Université Paris-Saclay, Campus C.N.R.S, Bâtiment 12, Avenue de la Terrasse, 91190 Gif-sur-Yvette, France
| | - Mingying Yang
- College of Animal Science, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| | - Ledesma-Amaro Rodrigo
- Imperial College Centre for Synthetic Biology, Department of Bioengineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - Chuanbin Mao
- Department of Chemistry & Biochemistry, Stephenson Life Science Research Center, University of Oklahoma, 101 Stephenson Parkway, Norman, Oklahoma 73019, United States.,School of Materials Science and Engineering, Zhejiang University, Hangzhou, Zhejiang 310058, P. R. China
| |
Collapse
|
42
|
Methods to Improve Molecular Diagnosis in Genomic Cold Cases in Pediatric Neurology. Genes (Basel) 2022; 13:genes13020333. [PMID: 35205378 PMCID: PMC8871714 DOI: 10.3390/genes13020333] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 02/06/2022] [Accepted: 02/07/2022] [Indexed: 02/04/2023] Open
Abstract
During the last decade, genetic testing has emerged as an important etiological diagnostic tool for Mendelian diseases, including pediatric neurological conditions. A genetic diagnosis has a considerable impact on disease management and treatment; however, many cases remain undiagnosed after applying standard diagnostic sequencing techniques. This review discusses various methods to improve the molecular diagnostic rates in these genomic cold cases. We discuss extended analysis methods to consider, non-Mendelian inheritance models, mosaicism, dual/multiple diagnoses, periodic re-analysis, artificial intelligence tools, and deep phenotyping, in addition to integrating various omics methods to improve variant prioritization. Last, novel genomic technologies, including long-read sequencing, artificial long-read sequencing, and optical genome mapping are discussed. In conclusion, a more comprehensive molecular analysis and a timely re-analysis of unsolved cases are imperative to improve diagnostic rates. In addition, our current understanding of the human genome is still limited due to restrictions in technologies. Novel technologies are now available that improve upon some of these limitations and can capture all human genomic variation more accurately. Last, we recommend a more routine implementation of high molecular weight DNA extraction methods that is coherent with the ability to use and/or optimally benefit from these novel genomic methods.
Collapse
|
43
|
Wang XQ, Goytain A, Dickson BC, Nielsen TO. Advances in Sarcoma Molecular Diagnostics. Genes Chromosomes Cancer 2022; 61:332-345. [DOI: 10.1002/gcc.23025] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2021] [Revised: 01/10/2022] [Accepted: 01/15/2022] [Indexed: 11/11/2022] Open
Affiliation(s)
- Xue Qi Wang
- Faculty of Medicine University of British Columbia Vancouver Canada
- Genetic Pathology Evaluation Centre, Department of Pathology and Laboratory Medicine University of British Columbia Vancouver Canada
| | - Angela Goytain
- Genetic Pathology Evaluation Centre, Department of Pathology and Laboratory Medicine University of British Columbia Vancouver Canada
| | - Brendan C. Dickson
- Department of Pathology & Laboratory Medicine, Mount Sinai Hospital; Department of Laboratory Medicine and Pathobiology University of Toronto Toronto ON Canada
| | - Torsten Owen Nielsen
- Genetic Pathology Evaluation Centre, Department of Pathology and Laboratory Medicine University of British Columbia Vancouver Canada
| |
Collapse
|
44
|
Reis ALM, Deveson IW, Madala BS, Wong T, Barker C, Xu J, Lennon N, Tong W, Mercer TR. Using synthetic chromosome controls to evaluate the sequencing of difficult regions within the human genome. Genome Biol 2022; 23:19. [PMID: 35022065 PMCID: PMC8753822 DOI: 10.1186/s13059-021-02579-6] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Accepted: 12/16/2021] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Next-generation sequencing (NGS) can identify mutations in the human genome that cause disease and has been widely adopted in clinical diagnosis. However, the human genome contains many polymorphic, low-complexity, and repetitive regions that are difficult to sequence and analyze. Despite their difficulty, these regions include many clinically important sequences that can inform the treatment of human diseases and improve the diagnostic yield of NGS. RESULTS To evaluate the accuracy by which these difficult regions are analyzed with NGS, we built an in silico decoy chromosome, along with corresponding synthetic DNA reference controls, that encode difficult and clinically important human genome regions, including repeats, microsatellites, HLA genes, and immune receptors. These controls provide a known ground-truth reference against which to measure the performance of diverse sequencing technologies, reagents, and bioinformatic tools. Using this approach, we provide a comprehensive evaluation of short- and long-read sequencing instruments, library preparation methods, and software tools and identify the errors and systematic bias that confound our resolution of these remaining difficult regions. CONCLUSIONS This study provides an analytical validation of diagnosis using NGS in difficult regions of the human genome and highlights the challenges that remain to resolve these difficult regions.
Collapse
Affiliation(s)
- Andre L M Reis
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Ira W Deveson
- Kinghorn Centre for Clinical Genomics, Garvan Institute of Medical Research, Sydney, NSW, Australia
- St Vincent's Clinical School, University of New South Wales, Sydney, NSW, Australia
| | - Bindu Swapna Madala
- Genomics and Epigenetics Theme, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Ted Wong
- Genomics and Epigenetics Theme, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Chris Barker
- Genomics and Epigenetics Theme, Garvan Institute of Medical Research, Sydney, NSW, Australia
| | - Joshua Xu
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, 72079, USA
| | - Niall Lennon
- Broad Institute of MIT and Harvard, Cambridge, MA, 02142, USA
| | - Weida Tong
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR, 72079, USA
| | - Tim R Mercer
- Genomics and Epigenetics Theme, Garvan Institute of Medical Research, Sydney, NSW, Australia
- Australian Institute for Biotechnology and Nanoengineering, University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
45
|
Artificial Intelligence in Blood Transcriptomics. Artif Intell Med 2022. [DOI: 10.1007/978-3-030-64573-1_262] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
|
46
|
Do T, Guran R, Adam V, Zitka O. Use of MALDI-TOF mass spectrometry for virus identification: a review. Analyst 2022; 147:3131-3154. [DOI: 10.1039/d2an00431c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
The possibilities of virus identification, including SARS-CoV-2, by MALDI-TOF mass spectrometry are discussed in this review.
Collapse
Affiliation(s)
- Tomas Do
- Department of Chemistry and Biochemistry, Faculty of AgriSciences, Mendel University in Brno, Zemedelska 1, CZ-613 00 Brno, Czech Republic
| | - Roman Guran
- Department of Chemistry and Biochemistry, Faculty of AgriSciences, Mendel University in Brno, Zemedelska 1, CZ-613 00 Brno, Czech Republic
- Central European Institute of Technology, Brno University of Technology, Purkynova 656/123, CZ-612 00 Brno, Czech Republic
| | - Vojtech Adam
- Department of Chemistry and Biochemistry, Faculty of AgriSciences, Mendel University in Brno, Zemedelska 1, CZ-613 00 Brno, Czech Republic
- Central European Institute of Technology, Brno University of Technology, Purkynova 656/123, CZ-612 00 Brno, Czech Republic
| | - Ondrej Zitka
- Department of Chemistry and Biochemistry, Faculty of AgriSciences, Mendel University in Brno, Zemedelska 1, CZ-613 00 Brno, Czech Republic
- Central European Institute of Technology, Brno University of Technology, Purkynova 656/123, CZ-612 00 Brno, Czech Republic
| |
Collapse
|
47
|
Application of long-read sequencing to elucidate complex pharmacogenomic regions: a proof of principle. THE PHARMACOGENOMICS JOURNAL 2022; 22:75-81. [PMID: 34741133 PMCID: PMC8794781 DOI: 10.1038/s41397-021-00259-z] [Citation(s) in RCA: 12] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Abstract
The use of pharmacogenomics in clinical practice is becoming standard of care. However, due to the complex genetic makeup of pharmacogenes, not all genetic variation is currently accounted for. Here, we show the utility of long-read sequencing to resolve complex pharmacogenes by analyzing a well-characterised sample. This data consists of long reads that were processed to resolve phased haploblocks. 73% of pharmacogenes were fully covered in one phased haploblock, including 9/15 genes that are 100% complex. Variant calling accuracy in the pharmacogenes was high, with 99.8% recall and 100% precision for SNVs and 98.7% precision and 98.0% recall for Indels. For the majority of gene-drug interactions in the DPWG and CPIC guidelines, the associated genes could be fully resolved (62% and 63% respectively). Together, these findings suggest that long-read sequencing data offers promising opportunities in elucidating complex pharmacogenes and haplotype phasing while maintaining accurate variant calling.
Collapse
|
48
|
Christopher H, Burns A, Josephat E, Makani J, Schuh A, Nkya S. Using DNA testing for the precise, definite, and low-cost diagnosis of sickle cell disease and other Haemoglobinopathies: findings from Tanzania. BMC Genomics 2021; 22:902. [PMID: 34915846 PMCID: PMC8679995 DOI: 10.1186/s12864-021-08220-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2021] [Accepted: 11/22/2021] [Indexed: 11/30/2022] Open
Abstract
Background Sickle cell disease (SCD) is an important cause of under-five mortality. Tanzania is the 5th country in the world with the highest births prevalence of SCD individuals. Significant advances in the neonatal diagnosis of SCD using rapid point-of-care testing have been made. However genetic confirmation is still required for positive cases, in uncertain cases, in multiply transfused patients, to resolve compound heterozygosity (Hb S/ β0 Thal or Hb S/ β+ thal) not uncommon in the coastal regions of East Africa and increasingly also for pre-marital counselling and potentially for future curative approaches such as gene therapy. The currently available DNA tests are prohibitively expensive. Here, we describe an easy-to-use, affordable and accurate β-globin sequencing approach that can be easily integrated within existing NBS for SCD and other haemoglobinopathies especially in Low- and Middle-income Countries. Aim To evaluate an affordable DNA technology for the diagnosis of Sickle cell disease and other haemoglobinopathies in a resource-limited setting. Methods Laboratory-based validation study was conducted by Muhimbili University of Health and Allied Sciences and the University of Oxford involving sequencing of the entire β -haemoglobin locus using the Oxford Nanopore MinION platform. A total number of 36 Dried blood spots and whole blood samples were subjected to conventional protein-based methods (isoelectric focusing, HPLC), and/or sequenced by the Sanger method as comparators. Results Sequencing results for SCD using the MinION were 100% concordant with those from the Sanger method. In addition, the long-read DNA sequencing method enabled the resolution of cases with unusual phenotypes which make up 1% of all children in Tanzania. The cost is £11/ sample for consumables, which is cheaper compared to other sequencing platforms. Conclusions This is the first report of a comprehensive single DNA assay as a definitive diagnostic test for SCD and other haemoglobinopathies. The test is fast, precise, accurate and affordable.
Collapse
Affiliation(s)
- Heavenlight Christopher
- Sickle cell programme, Department of haematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania.
| | - Adam Burns
- Oxford Molecular Diagnostics Centre, University of Oxford, Oxford, UK
| | - Emmanuel Josephat
- Sickle cell programme, Department of haematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
| | - Julie Makani
- Sickle cell programme, Department of haematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania
| | - Anna Schuh
- Sickle cell programme, Department of haematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania.,Oxford Molecular Diagnostics Centre, University of Oxford, Oxford, UK
| | - Siana Nkya
- Sickle cell programme, Department of haematology and Blood Transfusion, Muhimbili University of Health and Allied Sciences, Dar es Salaam, Tanzania.,Department of Biological Sciences, Dar es Salaam University College of Education (DUCE), Dar es Salaam, Tanzania
| |
Collapse
|
49
|
Fukuda H, Yamaguchi D, Nyquist K, Yabuki Y, Miyatake S, Uchiyama Y, Hamanaka K, Saida K, Koshimizu E, Tsuchida N, Fujita A, Mitsuhashi S, Ohbo K, Satake Y, Sone J, Doi H, Morihara K, Okamoto T, Takahashi Y, Wenger AM, Shioda N, Tanaka F, Matsumoto N, Mizuguchi T. Father-to-offspring transmission of extremely long NOTCH2NLC repeat expansions with contractions: genetic and epigenetic profiling with long-read sequencing. Clin Epigenetics 2021; 13:204. [PMID: 34774111 PMCID: PMC8590777 DOI: 10.1186/s13148-021-01192-5] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 10/27/2021] [Indexed: 12/11/2022] Open
Abstract
Background GGC repeat expansions in NOTCH2NLC are associated with neuronal intranuclear inclusion disease. Very recently, asymptomatic carriers with NOTCH2NLC repeat expansions were reported. In these asymptomatic individuals, the CpG island in NOTCH2NLC is hypermethylated, suggesting that two factors repeat length and DNA methylation status should be considered to evaluate pathogenicity. Long-read sequencing can be used to simultaneously profile genomic and epigenomic alterations. We analyzed four sporadic cases with NOTCH2NLC repeat expansion and their phenotypically normal parents. The native genomic DNA that retains base modification was sequenced on a per-trio basis using both PacBio and Oxford Nanopore long-read sequencing technologies. A custom workflow was developed to evaluate DNA modifications. With these two technologies combined, long-range DNA methylation information was integrated with complete repeat DNA sequences to investigate the genetic origins of expanded GGC repeats in these sporadic cases. Results In all four families, asymptomatic fathers had longer expansions (median: 522, 390, 528 and 650 repeats) compared with their affected offspring (median: 93, 117, 162 and 140 repeats, respectively). These expansions are much longer than the disease-causing range previously reported (in general, 41–300 repeats). Repeat lengths were extremely variable in the father, suggesting somatic mosaicism. Instability is more frequent in alleles with uninterrupted pure GGCs. Single molecule epigenetic analysis revealed complex DNA methylation patterns and epigenetic heterogeneity. We identified an aberrant gain-of-methylation region (2.2 kb in size beyond the CpG island and GGC repeats) in asymptomatic fathers. This methylated region was unmethylated in the normal allele with bilateral transitional zones with both methylated and unmethylated CpG dinucleotides, which may be protected from methylation to ensure NOTCH2NLC expression. Conclusions We clearly demonstrate that the four sporadic NOTCH2NLC-related cases are derived from the paternal GGC repeat contraction associated with demethylation. The entire genetic and epigenetic landscape of the NOTCH2NLC region was uncovered using the custom workflow of long-read sequence data, demonstrating the utility of this method for revealing epigenetic/mutational changes in repetitive elements, which are difficult to characterize by conventional short-read/bisulfite sequencing methods. Our approach should be useful for biomedical research, aiding the discovery of DNA methylation abnormalities through the entire genome. Supplementary Information The online version contains supplementary material available at 10.1186/s13148-021-01192-5.
Collapse
Affiliation(s)
- Hiromi Fukuda
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.,Department of Neurology and Stroke Medicine, Yokohama City University Graduate School of Medicine, Yokohama, Japan
| | | | | | - Yasushi Yabuki
- Department of Genomic Neurology, Institute of Molecular Embryology and Genetics (IMEG), Kumamoto University, Kumamoto, Japan.,Graduate School of Pharmaceutical Sciences, Kumamoto University, Kumamoto, Japan
| | - Satoko Miyatake
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.,Clinical Genetics Department, Yokohama City University Hospital, Yokohama, Japan
| | - Yuri Uchiyama
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.,Department of Rare Disease Genomics, Yokohama City University Hospital, Yokohama, Japan
| | - Kohei Hamanaka
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Ken Saida
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Eriko Koshimizu
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Naomi Tsuchida
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.,Department of Rare Disease Genomics, Yokohama City University Hospital, Yokohama, Japan
| | - Atsushi Fujita
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan
| | - Satomi Mitsuhashi
- Department of Genomic Function and Diversity, Medical Research Institute Tokyo Medical and Dental University, Tokyo, Japan
| | - Kazuyuki Ohbo
- Department of Histology and Cell Biology, Yokohama City University Graduate School of Medicine, Yokohama, Japan
| | - Yuki Satake
- Department of Neurology, Yokkaichi Municipal Hospital, Yokkaichi, Japan
| | - Jun Sone
- Department of Neuropathology, Institute for Medical Science of Aging, Aichi Medical University, Nagakute, Japan.,Department of Neurology, National Hospital Organization Suzuka National Hospital, Suzuka, Japan
| | - Hiroshi Doi
- Department of Neurology and Stroke Medicine, Yokohama City University Graduate School of Medicine, Yokohama, Japan
| | - Keisuke Morihara
- Department of Neurology and Stroke Medicine, Yokohama City University Graduate School of Medicine, Yokohama, Japan
| | - Tomoko Okamoto
- Department of Neurology, National Center Hospital, National Center of Neurology and Psychiatry, Tokyo, Japan
| | - Yuji Takahashi
- Department of Neurology, National Center Hospital, National Center of Neurology and Psychiatry, Tokyo, Japan
| | | | - Norifumi Shioda
- Department of Genomic Neurology, Institute of Molecular Embryology and Genetics (IMEG), Kumamoto University, Kumamoto, Japan.,Graduate School of Pharmaceutical Sciences, Kumamoto University, Kumamoto, Japan
| | - Fumiaki Tanaka
- Department of Neurology and Stroke Medicine, Yokohama City University Graduate School of Medicine, Yokohama, Japan
| | - Naomichi Matsumoto
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.
| | - Takeshi Mizuguchi
- Department of Human Genetics, Yokohama City University Graduate School of Medicine, 3-9 Fukuura, Kanazawa-ku, Yokohama, 236-0004, Japan.
| |
Collapse
|
50
|
Yu Y, Chen L, Miao X, Li SC. SpecHap: a diploid phasing algorithm based on spectral graph theory. Nucleic Acids Res 2021; 49:e114. [PMID: 34403470 PMCID: PMC8565328 DOI: 10.1093/nar/gkab709] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/16/2020] [Revised: 07/25/2021] [Accepted: 08/02/2021] [Indexed: 11/30/2022] Open
Abstract
Haplotype phasing plays an important role in understanding the genetic data of diploid eukaryotic organisms. Different sequencing technologies (such as next-generation sequencing or third-generation sequencing) produce various genetic data that require haplotype assembly. Although multiple diploid haplotype phasing algorithms exist, only a few will work equally well across all sequencing technologies. In this work, we propose SpecHap, a novel haplotype assembly tool that leverages spectral graph theory. On both in silico and whole-genome sequencing datasets, SpecHap consumed less memory and required less CPU time, yet achieved comparable accuracy with state-of-art methods across all the test instances, which comprises sequencing data from next-generation sequencing, linked-reads, high-throughput chromosome conformation capture, PacBio single-molecule real-time, and Oxford Nanopore long-reads. Furthermore, SpecHap successfully phased an individual Ambystoma mexicanum, a species with gigantic diploid genomes, within 6 CPU hours and 945MB peak memory usage, while other tools failed to yield results either due to memory overflow (40GB) or time limit exceeded (5 days). Our results demonstrated that SpecHap is scalable, efficient, and accurate for diploid phasing across many sequencing platforms.
Collapse
Affiliation(s)
- Yonghan Yu
- Computer Science, City University of Hong Kong, Kowloon, Hong Kong 999077, China
| | - Lingxi Chen
- Computer Science, City University of Hong Kong, Kowloon, Hong Kong 999077, China
| | - Xinyao Miao
- Computer Science, City University of Hong Kong, Kowloon, Hong Kong 999077, China
| | - Shuai Cheng Li
- Computer Science, City University of Hong Kong, Kowloon, Hong Kong 999077, China
| |
Collapse
|