1
|
Tafazoli A, Hemmati M, Rafigh M, Alimardani M, Khaghani F, Korostyński M, Karnes JH. Leveraging long-read sequencing technologies for pharmacogenomic testing: applications, analytical strategies, challenges, and future perspectives. Front Genet 2025; 16:1435416. [PMID: 40370700 PMCID: PMC12075302 DOI: 10.3389/fgene.2025.1435416] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/20/2024] [Accepted: 04/07/2025] [Indexed: 05/16/2025] Open
Abstract
Long-read sequencing (LRS) was introduced as the third generation of next-generation sequencing technologies with a high accuracy rate in genomic variant identification for some of its platforms. Due to the structural complexity of many pharmacogenes, the presence of rare variants, and the limitations of genotyping and short-read sequencing approaches in detecting pharmacovariants, LRS methods are likely to become increasingly utilized in the near future. In this review, we aim to provide a comprehensive discussion of current and future applications of long-read genotyping methods by introducing the opportunities and advantages as well as the challenges and disadvantages of state-of-the-art LRS platforms for the implementation of pharmacogenomic tests in clinical and research settings. New approaches to data processing, as well as the challenges and pitfalls of performing such tests in daily practice, will be explored in detail. We provide references to resources for those who are interested or intend to employ LRS in pharmacogenomics screening, both in clinical and research settings.
Collapse
Affiliation(s)
- Alireza Tafazoli
- Department of Pharmacology and Toxicology, University of Toronto, Toronto, ON, Canada
| | - Mahboobeh Hemmati
- Department of Medical Genetics and Molecular Medicine, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Mahboobeh Rafigh
- Medical Genetics Research Center, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Maliheh Alimardani
- Department of Medical Genetics and Molecular Medicine, School of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
- Student Research Committee, Faculty of Medicine, Mashhad University of Medical Sciences, Mashhad, Iran
| | - Faeze Khaghani
- Department of Pharmaceutical Biotechnology, School of Pharmacy, Guilan University of Medical Sciences, Rasht, Iran
| | - Michał Korostyński
- Laboratory of Pharmacogenomics, Department of Molecular Neuropharmacology, Maj Institute of Pharmacology Polish Academy of Sciences, Kraków, Poland
| | - Jason H. Karnes
- Department of Pharmacy Practice and Science, University of Arizona R. Ken Coit College of Pharmacy, Tucson, AZ, United States
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, TN, United States
| |
Collapse
|
2
|
García-González P, Puerta R, Cano A, Olivè C, Marquié M, Valero S, Rosende-Roca M, Alegret M, Sanz P, Brosseron F, Martino-Adami P, de Rojas I, Heneka M, Ramírez A, Navarro A, Sáez ME, Tárraga L, Cavazos JE, Boada M, Fernandez MV, Cabrera-Socorro A, Ruiz A. APOE Haplotype Phasing Using ONT Long-Read Sequencing Reveals Two Common ε3 and ε4 intragenic haplotypes in the Spanish Population. MEDRXIV : THE PREPRINT SERVER FOR HEALTH SCIENCES 2025:2025.03.25.25324541. [PMID: 40196265 PMCID: PMC11974914 DOI: 10.1101/2025.03.25.25324541] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Subscribe] [Scholar Register] [Indexed: 04/09/2025]
Abstract
Background The apolipoprotein E (APOE) gene is a key genetic determinant of Alzheimer's disease (AD) risk, with the ε4 allele significantly increasing susceptibility. While the pathogenic effects of the ε4 allele are well established, the functional impact of distinct haplotype configurations within the broader ε3 and ε4 backgrounds remains poorly understood. This study investigates the role of intragenic sub haplotypes in modulating APOE expression and their potential influence on AD progression. Methods We utilized Oxford Nanopore Technology (ONT) long-read sequencing to phase variants within a 4-kilobase comprising the APOE locus in a cohort of 1,265 individuals with known APOE genotypes. We evaluated the impact of the identified intragenic haplotypes on APOE protein levels in cerebrospinal fluid (CSF) using the Olink platform, adjusting for demographic and molecular covariates. Statistical modeling was employed to assess the independent effects of these haplotypes alongside traditional APOE genotypes. Additionally, their influence on dementia progression in mild cognitive impairment (MCI) subjects was analyzed using adjusted Cox proportional hazards models. Results Our analysis identified 48 Single Nucleotide Variants (SNVs) within a 4-kilobase region containing the APOE gene, including nine novel variants. Phasing of variants within the APOE locus revealed 59 unique haplotypes in the Spanish population, which were grouped into five major haplogroups-ε2, ε3A, ε3B, ε4A, and ε4B-including two common haplogroups for each of the ε3 and ε4 isoforms. The ε4A haplogroup was associated with a significant decrease in APOE ε4 protein levels in CSF (p = 0.004), suggesting a regulatory mechanism that may mitigate the toxic gain-of-function effect typically attributed to the ε4 allele. Conversely, the ε3B haplogroup was linked to increased APOE ε3 protein levels in ε3/ε4 carriers (p = 0.025), potentially serving a compensatory role.These effects were independent of overall APOE genotype and remained significant after adjusting for covariates. Both haplogroups (ε4A and ε3B) demonstrated protective effects in the progression from MCI to dementia, underscoring their potential relevance in Alzheimer's disease. Conclusions This study provides new insights into the intragenic allelic variability of the APOE gene, demonstrating that intragenic APOE haplogroups within the ε3 and ε4 backgrounds can modulate APOE isoform expression in ways that might modulate AD. Our findings highlight the importance of considering haplotype-specific effects when interpreting the functional impact of APOE and in designing targeted therapeutic strategies. Further research is needed to explore the broader regulatory network of the APOE locus and its interaction with neighboring loci in the 19q13 region.
Collapse
Affiliation(s)
- Pablo García-González
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
- PhD program in Biotecnology, Faculty of Pharmacy and Food Sciences, University of Barcelona, 08028- Barcelona, Spain
| | - Raquel Puerta
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- PhD program in Biotecnology, Faculty of Pharmacy and Food Sciences, University of Barcelona, 08028- Barcelona, Spain
| | - Amanda Cano
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - Claudia Olivè
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
| | - Marta Marquié
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - Sergi Valero
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - Maitee Rosende-Roca
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - Montserrat Alegret
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - Pilar Sanz
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
| | - Frederik Brosseron
- Universitätsklinikum Bonn & Deutsches Zentrum für Neurodegenerative Erkrankungen (DZNE) Bonn, Germany
| | - Pamela Martino-Adami
- Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
| | - Itziar de Rojas
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Eschsur-Alzette/Belvaux, Luxembourg
| | - Michael Heneka
- Luxembourg Centre for Systems Biomedicine (LCSB), University of Luxembourg, Eschsur-Alzette/Belvaux, Luxembourg
| | - Alfredo Ramírez
- Division of Neurogenetics and Molecular Psychiatry, Department of Psychiatry and Psychotherapy, Faculty of Medicine and University Hospital Cologne, University of Cologne, 50937 Cologne, Germany
- Department of Neurodegenerative Diseases and Geriatric Psychiatry, University Hospital Bonn, Medical Faculty, 53127 Bonn, Germany
- German Center for Neurodegenerative Diseases (DZNE), 53127 Bonn, Germany
- Cluster of Excellence Cellular Stress Responses in Aging-associated Diseases (CECAD), University of Cologne, 50931 Cologne, Germany
- Department of Psychiatry and Glenn, Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, San An TX USA
| | - Arcadi Navarro
- Institució Catalana de Recerca i Estudis Avançats (ICREA) and Universitat Pompeu Fabra. Pg. Lluís Companys 23, 08010, Barcelona, Spain
- IBE, Institute of Evolutionary Biology (UPF-CSIC), Department of Medicine and Life Sciences, Universitat Pompeu Fabra. PRBB, C. Doctor Aiguader N88, 08003 Barcelona, Spain
- Center for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, Av. Doctor Aiguader, N88, 08003 Barcelona, Spain
- BarcelonaBeta Brain Research Center, Pasqual Maragall Foundation, C. Wellington 30, 08005, Barcelona, Spain
| | - María Eugenia Sáez
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CAEBI, Centro Andaluz de Estudios Bioinformáticos, Sevilla, Spain
| | - Lluís Tárraga
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | - José E. Cavazos
- South Texas Medical Science Training Program, University of Texas Health San Antonio, San Antonio, TX, USA
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, San Antonio, TX, USA
| | - Mercè Boada
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
| | | | | | - Agustín Ruiz
- ACE Alzheimer Center Barcelona – Universitat Internacional de Catalunya, Barcelona, Spain
- CIBERNED, Network Center for Biomedical Research in Neurodegenerative Diseases, National Institute of Health Carlos III, Madrid, Spain
- Glenn Biggs Institute for Alzheimer’s and Neurodegenerative Diseases, San Antonio, TX, USA
- Department of Microbiology, Immunology and Molecular Genetics. Long School of Medicine. University of Texas Health Science Center, San Antonio, TX, USA
| |
Collapse
|
3
|
El-Lagta N, Truong L, Ayora F, Mobegi F, Bruce S, Martinez P, D'Orsogna L, De Santis D. Revolutionising High Resolution HLA Genotyping for Transplant Assessment: Validation, Implementation and Challenges of Oxford Nanopore Technologies' Q20 + Sequencing. HLA 2024; 104:e15725. [PMID: 39435968 DOI: 10.1111/tan.15725] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/29/2024] [Revised: 09/24/2024] [Accepted: 10/01/2024] [Indexed: 10/23/2024]
Abstract
The advent of third-generation sequencing (TGS) represents a significant shift in the field of genetic sequencing, enabling single-molecule sequencing to overcome limitations of short-read NGS platforms. Several studies have assessed the utilisation of TGS in HLA genotyping, though many of these studies have described the high error rate as an obstacle to achieving a robust and highly accurate HLA typing assay. In 2021, Oxford Nanopore Technologies (ONT) released the high-accuracy sequencing Kit 14 and the MinION flow cell model R10.4.1, which were reported to achieve sequencing accuracies up to 99%. The aim of this study was to validate this novel high-accuracy sequencing kit for HLA genotyping coupled with a full-gene HLA PCR assay. Comparison with historical data obtained using legacy flow cell models such as R9.4, R10.3 and R10.4 was also done to assess progressive improvement in sequencing performance with each sequential release. The workflow was validated based on data throughput, sequence quality and accuracy, and HLA genotyping resolution. An initial validation was performed using an internal reference panel of 42 samples representing common HLA allele groups, followed by an analysis of data obtained from 111 sequencing batch runs since the implementation, to assess assay performance and define quality control metrics to assess inter-run variability and monitor quality. Furthermore, challenges arising from MinION flow cell stability and use, and assessment of barcode contamination are discussed. The findings of this study highlight advantages of ONT sequencing kit 14/R10.4.1 for HLA genotyping and the implementation considerations for the routine diagnostic HLA laboratory.
Collapse
Affiliation(s)
- Naser El-Lagta
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
- School of Biomedical Sciences, the University of Western Australia, Perth, Australia
| | - Linh Truong
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
| | - Felipe Ayora
- Research and Advanced Computing, BizData, Wellington, New Zealand
| | - Fredrick Mobegi
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
- School of Biomedical Sciences, the University of Western Australia, Perth, Australia
| | - Samuel Bruce
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
| | - Patricia Martinez
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
- The University of Western Australia Medical School, The University of Western Australia, Perth, Australia
| | - Lloyd D'Orsogna
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
- The University of Western Australia Medical School, The University of Western Australia, Perth, Australia
| | - Dianne De Santis
- Department of Clinical Immunology, PathWest Laboratory Medicine WA, Fiona Stanley Hospital, Perth, Australia
- School of Biomedical Sciences, the University of Western Australia, Perth, Australia
| |
Collapse
|
4
|
Ambrodji A, Sadlon A, Amstutz U, Hoch D, Berger MD, Bastian S, Offer SM, Largiadèr CR. Approach for Phased Sequence-Based Genotyping of the Critical Pharmacogene Dihydropyrimidine Dehydrogenase ( DPYD). Int J Mol Sci 2024; 25:7599. [PMID: 39062841 PMCID: PMC11277299 DOI: 10.3390/ijms25147599] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2024] [Revised: 07/04/2024] [Accepted: 07/08/2024] [Indexed: 07/28/2024] Open
Abstract
Pre-treatment genotyping of four well-characterized toxicity risk-variants in the dihydropyrimidine dehydrogenase gene (DPYD) has been widely implemented in Europe to prevent serious adverse effects in cancer patients treated with fluoropyrimidines. Current genotyping practices are largely limited to selected commonly studied variants and are unable to determine phasing when more than one variant allele is detected. Recent evidence indicates that common DPYD variants modulate the functional impact of deleterious variants in a phase-dependent manner, where a cis- or a trans-configuration translates into different toxicity risks and dosing recommendations. DPYD is a large gene with 23 exons spanning nearly a mega-base of DNA, making it a challenging candidate for full-gene sequencing in the diagnostic setting. Herein, we present a time- and cost-efficient long-read sequencing approach for capturing the complete coding region of DPYD. We demonstrate that this method can reliably produce phased genotypes, overcoming a major limitation with current methods. This method was validated using 21 subjects, including two cancer patients, each of whom carried multiple DPYD variants. Genotype assignments showed complete concordance with conventional approaches. Furthermore, we demonstrate that the method is robust to technical challenges inherent in long-range sequencing of PCR products, including reference alignment bias and PCR chimerism.
Collapse
Affiliation(s)
- Alisa Ambrodji
- Department of Clinical Chemistry, Inselspital, University Hospital of Bern, University of Bern, INO-F, 3010 Bern, Switzerland; (A.A.); (A.S.); (U.A.)
- Graduate School for Cellular and Biomedical Sciences, University of Bern, 3012 Bern, Switzerland
| | - Angélique Sadlon
- Department of Clinical Chemistry, Inselspital, University Hospital of Bern, University of Bern, INO-F, 3010 Bern, Switzerland; (A.A.); (A.S.); (U.A.)
| | - Ursula Amstutz
- Department of Clinical Chemistry, Inselspital, University Hospital of Bern, University of Bern, INO-F, 3010 Bern, Switzerland; (A.A.); (A.S.); (U.A.)
| | - Dennis Hoch
- Department of Medical Oncology, Inselspital, University Hospital of Bern, 3010 Bern, Switzerland; (D.H.); (M.D.B.)
| | - Martin D. Berger
- Department of Medical Oncology, Inselspital, University Hospital of Bern, 3010 Bern, Switzerland; (D.H.); (M.D.B.)
| | - Sara Bastian
- Department of Medical Oncology, Cantonal Hospital Graubünden, 7000 Chur, Switzerland;
| | - Steven M. Offer
- Department of Pathology, Carver College of Medicine, University of Iowa, Iowa City, IA 52242, USA;
| | - Carlo R. Largiadèr
- Department of Clinical Chemistry, Inselspital, University Hospital of Bern, University of Bern, INO-F, 3010 Bern, Switzerland; (A.A.); (A.S.); (U.A.)
| |
Collapse
|
5
|
Mikhaylova V, Rzepka M, Kawamura T, Xia Y, Chang PL, Zhou S, Paasch A, Pham L, Modi N, Yao L, Perez-Agustin A, Pagans S, Boles TC, Lei M, Wang Y, Garcia-Bassets I, Chen Z. Targeted phasing of 2-200 kilobase DNA fragments with a short-read sequencer and a single-tube linked-read library method. Sci Rep 2024; 14:7988. [PMID: 38580715 PMCID: PMC10997766 DOI: 10.1038/s41598-024-58733-0] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2023] [Accepted: 04/02/2024] [Indexed: 04/07/2024] Open
Abstract
In the human genome, heterozygous sites refer to genomic positions with a different allele or nucleotide variant on the maternal and paternal chromosomes. Resolving these allelic differences by chromosomal copy, also known as phasing, is achievable on a short-read sequencer when using a library preparation method that captures long-range genomic information. TELL-Seq is a library preparation that captures long-range genomic information with the aid of molecular identifiers (barcodes). The same barcode is used to tag the reads derived from the same long DNA fragment within a range of up to 200 kilobases (kb), generating linked-reads. This strategy can be used to phase an entire genome. Here, we introduce a TELL-Seq protocol developed for targeted applications, enabling the phasing of enriched loci of varying sizes, purity levels, and heterozygosity. To validate this protocol, we phased 2-200 kb loci enriched with different methods: CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis for the longest fragments, CRISPR/Cas9-mediated protection from exonuclease digestion for mid-size fragments, and long PCR for the shortest fragments. All selected loci have known clinical relevance: BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA. Collectively, the analyses show that TELL-Seq can accurately phase 2-200 kb targets using a short-read sequencer.
Collapse
Affiliation(s)
| | - Madison Rzepka
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | | | - Yu Xia
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | - Peter L Chang
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | | | - Amber Paasch
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | - Long Pham
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | - Naisarg Modi
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA
| | - Likun Yao
- Department of Medicine, University of California, San Diego, La Jolla, CA, 92093, USA
| | - Adrian Perez-Agustin
- Department of Medical Sciences, School of Medicine, University of Girona, Girona, Spain
| | - Sara Pagans
- Department of Medical Sciences, School of Medicine, University of Girona, Girona, Spain
| | | | - Ming Lei
- Universal Sequencing Technology Corp., Canton, MA, 02021, USA
| | - Yong Wang
- Universal Sequencing Technology Corp., Canton, MA, 02021, USA
| | | | - Zhoutao Chen
- Universal Sequencing Technology Corp., Carlsbad, CA, 92011, USA.
| |
Collapse
|
6
|
Backlund MP, Repo P, Kangas H, Donner K, Sankila EM, Krootila J, Paavo M, Wartiovaara K, Kivelä TT, Turunen JA. Characterisation of a LINE-1 Insertion in the RP1 Gene by Targeted Adaptive Nanopore Sequencing in a Family with Retinitis Pigmentosa. Hum Mutat 2024; 2024:6580561. [PMID: 40225919 PMCID: PMC11919084 DOI: 10.1155/2024/6580561] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2023] [Revised: 01/24/2024] [Accepted: 01/31/2024] [Indexed: 04/15/2025]
Abstract
Retinitis pigmentosa (RP) is a group of inherited degenerative retinal disorders affecting more than 1.5 million people worldwide. For 30-50% of individuals with RP, the genetic cause remains unresolved by current clinical diagnostic gene panels. It is likely explained by variants in novel RP-associated genes or noncoding regulatory regions, or by complex genetic alterations such as large structural variants. Recent developments in long-read sequencing techniques have opened an opportunity for efficient analysis of complex genetic variants. We analysed a Finnish family with dominantly inherited RP affecting six individuals in three generations. Two affected individuals underwent a comprehensive clinical examination in combination with a clinical diagnostic gene panel, followed by whole exome sequencing in our laboratory. They exhibited typical signs of RP, yet initial sequence analysis found no causative variants. Reanalysis of the sequencing data detected a LINE-1 (L1) retrotransposon insertion of unknown size in exon 4 of the RP1 axonemal microtubule-associated (RP1) gene. The large chimeric L1 insertion that segregated with the disease was further characterised using targeted adaptive nanopore sequencing of RP1, allowing us to identify a 5.6 kb L1 transposable element insertion in RP1 as the cause of RP in this family with dominantly inherited RP.
Collapse
Affiliation(s)
- Michael P. Backlund
- Eye Genetics Group, Folkhälsan Research Center, Biomedicum Helsinki, Helsinki, Finland
| | - Pauliina Repo
- Eye Genetics Group, Folkhälsan Research Center, Biomedicum Helsinki, Helsinki, Finland
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Harri Kangas
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Kati Donner
- Institute for Molecular Medicine Finland (FIMM), Helsinki Institute of Life Science (HiLIFE), University of Helsinki, Helsinki, Finland
| | - Eeva-Marja Sankila
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Julia Krootila
- Eye Genetics Group, Folkhälsan Research Center, Biomedicum Helsinki, Helsinki, Finland
| | - Maarjaliis Paavo
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Kirmo Wartiovaara
- Department of Clinical Genetics, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Tero T. Kivelä
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| | - Joni A. Turunen
- Eye Genetics Group, Folkhälsan Research Center, Biomedicum Helsinki, Helsinki, Finland
- Department of Ophthalmology, University of Helsinki and Helsinki University Hospital, Helsinki, Finland
| |
Collapse
|
7
|
Damaraju N, Miller AL, Miller DE. Long-Read DNA and RNA Sequencing to Streamline Clinical Genetic Testing and Reduce Barriers to Comprehensive Genetic Testing. J Appl Lab Med 2024; 9:138-150. [PMID: 38167773 DOI: 10.1093/jalm/jfad107] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/18/2023] [Accepted: 10/24/2023] [Indexed: 01/05/2024]
Abstract
BACKGROUND Obtaining a precise molecular diagnosis through clinical genetic testing provides information about disease prognosis or progression, allows accurate counseling about recurrence risk, and empowers individuals to benefit from precision therapies or take part in N-of-1 trials. Unfortunately, more than half of individuals with a suspected Mendelian condition remain undiagnosed after a comprehensive clinical evaluation, and the results of any individual clinical genetic test ordered during a typical evaluation may take weeks or months to return. Furthermore, commonly used technologies, such as short-read sequencing, are limited in the types of disease-causing variation they can identify. New technologies, such as long-read sequencing (LRS), are poised to solve these problems. CONTENT Recent technical advances have improved accuracy, increased throughput, and decreased the costs of commercially available LRS technologies. This has resolved many historical concerns about the use of LRS in the clinical environment and opened the door to widespread clinical adoption of LRS. Here, we review LRS technology, how it has been used in the research setting to clarify complex variants or identify disease-causing variation missed by prior clinical testing, and how it may be used clinically in the near future. SUMMARY LRS is unique in that, as a single data source, it has the potential to replace nearly every other clinical genetic test offered today. When analyzed in a stepwise fashion, LRS will simplify laboratory processes, reduce barriers to comprehensive genetic testing, increase the rate of genetic diagnoses, and shorten the amount of time required to make a molecular diagnosis.
Collapse
Affiliation(s)
- Nikhita Damaraju
- Institute for Public Health Genetics, University of Washington, Seattle, WA 98195, United States
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, United States
| | - Angela L Miller
- Department of Pediatrics, University of Washington, Seattle, WA 98195, United States
| | - Danny E Miller
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, WA 98195, United States
- Department of Pediatrics, University of Washington, Seattle, WA 98195, United States
- Brotman Baty Institute for Precision Medicine, University of Washington, Seattle, WA 98195, United States
| |
Collapse
|
8
|
Deserranno K, Tilleman L, Rubben K, Deforce D, Van Nieuwerburgh F. Targeted haplotyping in pharmacogenomics using Oxford Nanopore Technologies' adaptive sampling. Front Pharmacol 2023; 14:1286764. [PMID: 38026945 PMCID: PMC10679755 DOI: 10.3389/fphar.2023.1286764] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2023] [Accepted: 10/30/2023] [Indexed: 12/01/2023] Open
Abstract
Pharmacogenomics (PGx) studies the impact of interindividual genomic variation on drug response, allowing the opportunity to tailor the dosing regimen for each patient. Current targeted PGx testing platforms are mainly based on microarray, polymerase chain reaction, or short-read sequencing. Despite demonstrating great value for the identification of single nucleotide variants (SNVs) and insertion/deletions (INDELs), these assays do not permit identification of large structural variants, nor do they allow unambiguous haplotype phasing for star-allele assignment. Here, we used Oxford Nanopore Technologies' adaptive sampling to enrich a panel of 1,036 genes with well-documented PGx relevance extracted from the Pharmacogenomics Knowledge Base (PharmGKB). By evaluating concordance with existing truth sets, we demonstrate accurate variant and star-allele calling for five Genome in a Bottle reference samples. We show that up to three samples can be multiplexed on one PromethION flow cell without a significant drop in variant calling performance, resulting in 99.35% and 99.84% recall and precision for the targeted variants, respectively. This work advances the use of nanopore sequencing in clinical PGx settings.
Collapse
Affiliation(s)
| | | | | | | | - Filip Van Nieuwerburgh
- Laboratory of Pharmaceutical Biotechnology, Faculty of Pharmaceutical Sciences, Ghent University, Ghent, Belgium
| |
Collapse
|
9
|
McClinton B, Watson CM, Crinnion LA, McKibbin M, Ali M, Inglehearn CF, Toomes C. Haplotyping Using Long-Range PCR and Nanopore Sequencing to Phase Variants: Lessons Learned From the ABCA4 Locus. J Transl Med 2023; 103:100160. [PMID: 37088464 DOI: 10.1016/j.labinv.2023.100160] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 01/11/2023] [Accepted: 04/17/2023] [Indexed: 04/25/2023] Open
Abstract
Short-read next-generation sequencing has revolutionized our ability to identify variants underlying inherited diseases; however, it does not allow the phasing of variants to clarify their diagnostic interpretation. The advent of widespread, increasingly accurate long-read sequencing has opened up new applications not currently available through short-read next-generation sequencing. One such use is the ability to phase variants to clarify their diagnostic interpretation and to investigate the increasingly prevalent role of cis-acting variants in the pathogenesis of the inherited disease, so-called complex alleles. Complex alleles are becoming an increasingly prevalent part of the study of genes associated with inherited diseases, for example, in ABCA4-related diseases. We sought to establish a cost-effective method to phase contiguous segments of the 130-kb ABCA4 locus by long-read sequencing of overlapping amplification products. Using the comprehensively characterized CEPH sample, NA12878, we verified the accuracy and robustness of our assay. However, in-field assessment of its utility using clinical test cases was hampered by the paucity and distribution of identified variants and by PCR chimerism, particularly where the number of PCR cycles was high. Despite this, we were able to construct robust phase blocks of up to 94.9 kb, representing 73% of the ABCA4 locus. We conclude that, although haplotype analysis of variants located within discrete amplification products was robust and informative, the stitching together of larger phase blocks using overlapping single-molecule reads remained practically challenging.
Collapse
Affiliation(s)
- Benjamin McClinton
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK
| | - Christopher M Watson
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK; North East and Yorkshire Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, UK
| | - Laura A Crinnion
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK; North East and Yorkshire Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, UK
| | - Martin McKibbin
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK; Department of Ophthalmology, St. James's University Hospital, Leeds, UK
| | - Manir Ali
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK
| | - Chris F Inglehearn
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK
| | - Carmel Toomes
- Leeds Institute of Medical Research, University of Leeds, St James's University Hospital, Leeds, UK.
| |
Collapse
|
10
|
Mikhaylova V, Rzepka M, Kawamura T, Xia Y, Chang PL, Zhou S, Pham L, Modi N, Yao L, Perez-Agustin A, Pagans S, Boles TC, Lei M, Wang Y, Garcia-Bassets I, Chen Z. Targeted Phasing of 2-200 Kilobase DNA Fragments with a Short-Read Sequencer and a Single-Tube Linked-Read Library Method. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.03.05.531179. [PMID: 36945366 PMCID: PMC10028795 DOI: 10.1101/2023.03.05.531179] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/11/2023]
Abstract
In the human genome, heterozygous sites are genomic positions with different alleles inherited from each parent. On average, there is a heterozygous site every 1-2 kilobases (kb). Resolving whether two alleles in neighboring heterozygous positions are physically linked-that is, phased-is possible with a short-read sequencer if the sequencing library captures long-range information. TELL-Seq is a library preparation method based on millions of barcoded micro-sized beads that enables instrument-free phasing of a whole human genome in a single PCR tube. TELL-Seq incorporates a unique molecular identifier (barcode) to the short reads generated from the same high-molecular-weight (HMW) DNA fragment (known as 'linked-reads'). However, genome-scale TELL-Seq is not cost-effective for applications focusing on a single locus or a few loci. Here, we present an optimized TELL-Seq protocol that enables the cost-effective phasing of enriched loci (targets) of varying sizes, purity levels, and heterozygosity. Targeted TELL-Seq maximizes linked-read efficiency and library yield while minimizing input requirements, fragment collisions on microbeads, and sequencing burden. To validate the targeted protocol, we phased seven 180-200 kb loci enriched by CRISPR/Cas9-mediated excision coupled with pulse-field electrophoresis, four 20 kb loci enriched by CRISPR/Cas9-mediated protection from exonuclease digestion, and six 2-13 kb loci amplified by PCR. The selected targets have clinical and research relevance (BRCA1, BRCA2, MLH1, MSH2, MSH6, APC, PMS2, SCN5A-SCN10A, and PKI3CA). These analyses reveal that targeted TELL-Seq provides a reliable way of phasing allelic variants within targets (2-200 kb in length) with the low cost and high accuracy of short-read sequencing.
Collapse
Affiliation(s)
| | - Madison Rzepka
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| | | | - Yu Xia
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| | - Peter L. Chang
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| | | | - Long Pham
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| | - Naisarg Modi
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| | - Likun Yao
- Department of Medicine, University of California, San Diego, La Jolla, CA 92093 USA
| | - Adrian Perez-Agustin
- Department of Medical Sciences, School of Medicine, University of Girona, Girona, Spain
| | - Sara Pagans
- Department of Medical Sciences, School of Medicine, University of Girona, Girona, Spain
| | | | - Ming Lei
- Universal Sequencing Technology Corp., Canton, MA 02021, USA
| | - Yong Wang
- Universal Sequencing Technology Corp., Canton, MA 02021, USA
| | | | - Zhoutao Chen
- Universal Sequencing Technology Corp., Carlsbad, CA 92011, USA
| |
Collapse
|
11
|
Bai H, Zhang X, Bush WS. Pharmacogenomic and Statistical Analysis. Methods Mol Biol 2023; 2629:305-330. [PMID: 36929083 DOI: 10.1007/978-1-0716-2986-4_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023]
Abstract
Genetic variants can alter response to drugs and other therapeutic interventions. The study of this phenomenon, called pharmacogenomics, is similar in many ways to other types of genetic studies but has distinct methodological and statistical considerations. Genetic variants involved in the processing of exogenous compounds exhibit great diversity and complexity, and the phenotypes studied in pharmacogenomics are also more complex than typical genetic studies. In this chapter, we review basic concepts in pharmacogenomic study designs, data generation techniques, statistical analysis approaches, and commonly used methods and briefly discuss the ultimate translation of findings to clinical care.
Collapse
Affiliation(s)
- Haimeng Bai
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
- Department of Nutrition, Case Western Reserve University School of Medicine, Cleveland, OH, USA
| | - Xueyi Zhang
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA
| | - William S Bush
- Department of Population and Quantitative Health Sciences, Cleveland Institute for Computational Biology, Case Western Reserve University, Cleveland, OH, USA.
| |
Collapse
|
12
|
Rubben K, Tilleman L, Deserranno K, Tytgat O, Deforce D, Van Nieuwerburgh F. Cas9 targeted nanopore sequencing with enhanced variant calling improves CYP2D6-CYP2D7 hybrid allele genotyping. PLoS Genet 2022; 18:e1010176. [PMID: 36149915 PMCID: PMC9534437 DOI: 10.1371/journal.pgen.1010176] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2022] [Revised: 10/05/2022] [Accepted: 09/10/2022] [Indexed: 11/19/2022] Open
Abstract
CYP2D6 is a very important pharmacogene as it is responsible for the metabolization or bioactivation of 20 to 30% of the clinically used drugs. However, despite its relatively small length of only 4.4 kb, it is one of the most challenging pharmacogenes to genotype due to the high similarity with its neighboring pseudogenes and the frequent occurrence of CYP2D6-CYP2D7 hybrids. Unfortunately, most current genotyping methods are therefore not able to correctly determine the complete CYP2D6-CYP2D7 sequence. Therefore, we developed a genotyping assay to generate complete allele-specific consensus sequences of complex regions by optimizing the PCR-free nanopore Cas9-targeted sequencing (nCATS) method combined with adaptive sequencing, and developing a new comprehensive long read genotyping (CoLoRGen) pipeline. The CoLoRGen pipeline first generates consensus sequences of both alleles and subsequently determines both large structural and small variants to ultimately assign the correct star-alleles. In reference samples, our genotyping assay confirms the presence of CYP2D6-CYP2D7 large structural variants, single nucleotide variants (SNVs), and small insertions and deletions (INDELs) that go undetected by most current assays. Moreover, our results provide direct evidence that the CYP2D6 genotype of the NA12878 DNA should be updated to include the CYP2D6-CYP2D7 *68 hybrid and several additional single nucleotide variants compared to existing references. Ultimately, the nCATS-CoLoRGen genotyping assay additionally allows for more accurate gene function predictions by enabling the possibility to detect and phase de novo mutations in addition to known large structural and small variants. During the last decades, the usefulness of personalized medicine has become increasingly apparent. Directly linked to that is the need for accurate genotyping assays to determine the pharmacogenetic profile of patients. Continuing research has led to the development of genotyping assays that perform quite robustly. However, complex genes remain an issue when it comes to determining the complete sequence correctly. An example of such a complex but very important pharmacogene is CYP2D6. Therefore, we developed a genotyping assay in an attempt to generate complete allele-specific consensus sequences of CYP2D6, by optimizing a targeted amplification-free long-read sequencing method and developing a new analysis pipeline. In reference samples, we showed that our genotyping assay performed accurately and confirmed the presence of variants that go undetected by most current assays. However, the implementation of this assay in practice is still hampered as the selected enrichment strategies inherently lead to a low percentage of on-target reads, resulting in low on-target sequencing depths. Further optimization and validation of the assay is thus needed, but definitely worth considering for follow-up research as we already demonstrated the added value for generating more complete genotypes, which on its turn will result in more accurate gene function predictions.
Collapse
Affiliation(s)
- Kaat Rubben
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Laurentijn Tilleman
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Koen Deserranno
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | - Olivier Tytgat
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
- Department of Life Science Technologies, Imec, Leuven, Belgium
| | - Dieter Deforce
- Laboratory of Pharmaceutical Biotechnology, Ghent University, Ghent, Belgium
| | | |
Collapse
|
13
|
Intragenomic variation in nuclear ribosomal markers and its implication in species delimitation, identification and barcoding in fungi. FUNGAL BIOL REV 2022. [DOI: 10.1016/j.fbr.2022.04.002] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022]
|
14
|
Watson CM, Jackson L, Crinnion LA, Bonthron DT, Sheridan E. Long-read sequencing to resolve the parent of origin of a de novo pathogenic UBE3A variant. J Med Genet 2022; 59:1082-1086. [PMID: 35414530 DOI: 10.1136/jmedgenet-2021-108314] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/19/2022] [Accepted: 03/24/2022] [Indexed: 11/04/2022]
Abstract
BackgroundThe ever-increasing capacity of short-read sequencing instruments is driving the adoption of whole genome sequencing (WGS) as a universal approach to the diagnosis of rare genetic disorders. However, many challenging genomic regions remain, for which alternative technologies must be deployed in order to address the clinical question satisfactorily.MethodsHere we report the use of long-read sequencing to resolve ambiguity over a suspected diagnosis of Angelman syndrome.ResultsDespite a normal chromosomal microarray result and methylation studies at the imprinted 15q11q13 locus, the continued clinical suspicion of Angelman Syndrome prompted trio WGS of the proband and his parents. A de novo heterozygous frameshift variant, c.2370_2373del (NM_130838.2) p.(Asp790Glufs*7), in UBE3A was identified. To determine the parental allele on which this variant arose, long-read sequencing of the flanking genomic region was performed. Comparison of the resulting haplotypes allowed us to determine that the pathogenic frameshift variant arose on the maternal allele, confirming a diagnosis of Angelman syndrome in this case.ConclusionLong-read nanopore sequencing provides significant clinical utility when assessing the parental origin of de novo variants.
Collapse
Affiliation(s)
- Christopher Mark Watson
- North East and Yorkshire Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, UK .,Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK
| | - Lucy Jackson
- North East and Yorkshire Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, UK
| | - Laura A Crinnion
- North East and Yorkshire Genomic Laboratory Hub, Central Lab, St. James's University Hospital, Leeds, UK.,Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK
| | - David T Bonthron
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK.,Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, UK
| | - Eamonn Sheridan
- Leeds Institute of Medical Research, University of Leeds, St. James's University Hospital, Leeds, UK.,Yorkshire Regional Genetics Service, Chapel Allerton Hospital, Leeds, UK
| |
Collapse
|
15
|
Marsili L, Duque KR, Bode RL, Kauffman MA, Espay AJ. Uncovering Essential Tremor Genetics: The Promise of Long-Read Sequencing. Front Neurol 2022; 13:821189. [PMID: 35401394 PMCID: PMC8983820 DOI: 10.3389/fneur.2022.821189] [Citation(s) in RCA: 17] [Impact Index Per Article: 5.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2021] [Accepted: 02/25/2022] [Indexed: 12/23/2022] Open
Abstract
Long-read sequencing (LRS) technologies have been recently introduced to overcome intrinsic limitations of widely-used next-generation sequencing (NGS) technologies, namely the sequencing limited to short-read fragments (150–300 base pairs). Since its introduction, LRS has permitted many successes in unraveling hidden mutational mechanisms. One area in clinical neurology in need of rethinking as it applies to genetic mechanisms is essential tremor (ET). This disorder, among the most common in neurology, is a syndrome often exhibiting an autosomal dominant pattern of inheritance whose large phenotypic spectrum suggest a multitude of genetic etiologies. Exome sequencing has revealed the genetic etiology only in rare ET families (FUS, SORT1, SCN4A, NOS3, KCNS2, HAPLN4/BRAL2, and USP46). We hypothesize that a reason for this shortcoming may be non-classical genetic mechanism(s) underpinning ET, among them trinucleotide, tetranucleotide, or pentanucleotide repeat disorders. In support of this hypothesis, trinucleotide (e.g., GGC repeats in NOTCH2NLC) and pentanucleotide repeat disorders (e.g., ATTTC repeats in STARD7) have been revealed as pathogenic in patients with a past history of what has come to be referred to as “ET plus,” bilateral hand tremor associated with epilepsy and/or leukoencephalopathy. A systematic review of LRS in neurodegenerative disorders showed that 10 of the 22 (45%) genetic etiologies ascertained by LRS include tremor in their phenotypic spectrum, suggesting that future clinical applications of LRS for tremor disorders may uncover genetic subtypes of familial ET that have eluded NGS, particularly those with associated leukoencephalopathy or family history of epilepsy. LRS provides a pathway for potentially uncovering novel genes and genetic mechanisms, helping narrow the large proportion of “idiopathic” ET.
Collapse
Affiliation(s)
- Luca Marsili
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Kevin R. Duque
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Rachel L. Bode
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
| | - Marcelo A. Kauffman
- Consultorio y Laboratorio de Neurogenética, Centro Universitario de Neurología José María Ramos Mejía, Buenos Aires, Argentina
| | - Alberto J. Espay
- James J. and Joan A. Gardner Center for Parkinson's Disease and Movement Disorders, Department of Neurology, University of Cincinnati, Cincinnati, OH, United States
- *Correspondence: Alberto J. Espay
| |
Collapse
|
16
|
Charnaud S, Munro JE, Semenec L, Mazhari R, Brewster J, Bourke C, Ruybal-Pesántez S, James R, Lautu-Gumal D, Karunajeewa H, Mueller I, Bahlo M. PacBio long-read amplicon sequencing enables scalable high-resolution population allele typing of the complex CYP2D6 locus. Commun Biol 2022; 5:168. [PMID: 35217695 PMCID: PMC8881578 DOI: 10.1038/s42003-022-03102-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2021] [Accepted: 02/01/2022] [Indexed: 01/31/2023] Open
Abstract
The CYP2D6 enzyme is estimated to metabolize 25% of commonly used pharmaceuticals and is of intense pharmacogenetic interest due to the polymorphic nature of the CYP2D6 gene. Accurate allele typing of CYP2D6 has proved challenging due to frequent copy number variants (CNVs) and paralogous pseudogenes. SNP-arrays, qPCR and short-read sequencing have been employed to interrogate CYP2D6, however these technologies are unable to capture longer range information. Long-read sequencing using the PacBio Single Molecule Real Time (SMRT) sequencing platform has yielded promising results for CYP2D6 allele typing. However, previous studies have been limited in scale and have employed nascent data processing pipelines. We present a robust data processing pipeline "PLASTER" for accurate allele typing of SMRT sequenced amplicons. We demonstrate the pipeline by typing CYP2D6 alleles in a large cohort of 377 Solomon Islanders. This pharmacogenetic method will improve drug safety and efficacy through screening prior to drug administration.
Collapse
Affiliation(s)
- Sarah Charnaud
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Jacob E. Munro
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Lucie Semenec
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia ,grid.1004.50000 0001 2158 5405ARC Centre of Excellence in Synthetic Biology, Department of Molecular Sciences, Macquarie University, Sydney, NSW Australia
| | - Ramin Mazhari
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Jessica Brewster
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Caitlin Bourke
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Shazia Ruybal-Pesántez
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia ,grid.1056.20000 0001 2224 8486Burnet Institute, Melbourne, VIC Australia
| | - Robert James
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Dulcie Lautu-Gumal
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Harin Karunajeewa
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Ivo Mueller
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| | - Melanie Bahlo
- grid.1042.70000 0004 0432 4889Walter and Eliza Hall Institute of Medical Research, Melbourne, VIC Australia ,grid.1008.90000 0001 2179 088XDepartment of Medical Biology, University of Melbourne, Melbourne, VIC Australia
| |
Collapse
|
17
|
Murdock DR, Rosenfeld JA, Lee B. What Has the Undiagnosed Diseases Network Taught Us About the Clinical Applications of Genomic Testing? Annu Rev Med 2022; 73:575-585. [PMID: 35084988 PMCID: PMC10874501 DOI: 10.1146/annurev-med-042120-014904] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Genetic testing has undergone a revolution in the last decade, particularly with the advent of next-generation sequencing and its associated reductions in costs and increases in efficiencies. The Undiagnosed Diseases Network (UDN) has been a leader in the application of such genomic testing for rare disease diagnosis. This review discusses the current state of genomic testing performed within the UDN, with a focus on the strengths and limitations of whole-exome and whole-genome sequencing in clinical diagnostics and the importance of ongoing data reanalysis. The role of emerging technologies such as RNA and long-read sequencing to further improve diagnostic rates in the UDN is also described. This review concludes with a discussion of the challenges faced in insurance coverage of comprehensive genomic testing as well as the opportunities for a larger role of testing in clinical medicine.
Collapse
Affiliation(s)
- David R Murdock
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA;
| | - Jill A Rosenfeld
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA;
| | - Brendan Lee
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas 77030, USA;
- Texas Children's Hospital, Houston, Texas 77030, USA
| |
Collapse
|
18
|
Zhang J, Qi G, Han C, Zhou Y, Yang Y, Wang X, Liu S, Zhang X. The Landscape of Clinical Implementation of Pharmacogenetic Testing in Central China: A Single-Center Study. Pharmgenomics Pers Med 2021; 14:1619-1628. [PMID: 34934339 PMCID: PMC8684419 DOI: 10.2147/pgpm.s338198] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/24/2021] [Accepted: 12/02/2021] [Indexed: 11/23/2022] Open
Abstract
Purpose Pharmacogenetic testing is recognized as the major method for the individualized pharmacotherapy in clinical pharmacy practice, but information about the clinical implementation of pharmacogenetic testing in China is limited. The present study aimed to determine the situation of clinical implementation for pharmacogenetic testing in central China. Methods The study is conducted in the department of clinical pharmacy in The First Affiliated Hospital of Zhengzhou University, Zhengzhou, China. We collected and analyzed pharmacogenetic testing results from November 1, 2013 to November 2, 2018 in our hospital, which were checked in the electronic medical record system. The main outcome measures were the number and type of pharmacogenetic testing across five years. Results A total of 47,265 (56.9% male, mean age = 51.5 years) pharmacogenetic testing results were obtained with an average annual rate of growth of 63.0% across five years. A 50.2% (23,748/47,265) of all the pharmacogenetic testing results were for the determination of cytochrome P450 2C19 (CYP2C19) *2, *3 genotypes, and 41.7% were for the methylene tetrahydrofolate reductase (MTHFR) C677T genotype. The number of departments performing the pharmacogenetic testing was 35, 63, 55, 52, 52 and 39 for 2013–2018, respectively, and the main top five departments were cardiology, psychiatry, ICU, cardiac surgery and intervention. Conclusion Clinical implementation of pharmacogenetic testing in China is growing rapidly, but the types and implementing departments of pharmacogenetic testing were limited. Our present study reported the real-world implementation modality of pharmacogenomic tests in China. It will help us to understand the testing of pharmacogenetics in China in order to promote the rational development of pharmacogenetics.
Collapse
Affiliation(s)
- Jingmin Zhang
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Guangzhao Qi
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Chao Han
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Yubing Zhou
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Yongjie Yang
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Xinru Wang
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Suna Liu
- Newborn Screening Center, Department of Pediatrics, The Third Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| | - Xiaojian Zhang
- Department of Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China.,Henan Key Laboratory for Precision Clinical Pharmacy, The First Affiliated Hospital of Zhengzhou University, Zhengzhou, People's Republic of China
| |
Collapse
|
19
|
CStone: A de novo transcriptome assembler for short-read data that identifies non-chimeric contigs based on underlying graph structure. PLoS Comput Biol 2021; 17:e1009631. [PMID: 34813594 PMCID: PMC8651127 DOI: 10.1371/journal.pcbi.1009631] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/23/2021] [Revised: 12/07/2021] [Accepted: 11/11/2021] [Indexed: 11/19/2022] Open
Abstract
With the exponential growth of sequence information stored over the last decade, including that of de novo assembled contigs from RNA-Seq experiments, quantification of chimeric sequences has become essential when assembling read data. In transcriptomics, de novo assembled chimeras can closely resemble underlying transcripts, but patterns such as those seen between co-evolving sites, or mapped read counts, become obscured. We have created a de Bruijn based de novo assembler for RNA-Seq data that utilizes a classification system to describe the complexity of underlying graphs from which contigs are created. Each contig is labelled with one of three levels, indicating whether or not ambiguous paths exist. A by-product of this is information on the range of complexity of the underlying gene families present. As a demonstration of CStones ability to assemble high-quality contigs, and to label them in this manner, both simulated and real data were used. For simulated data, ten million read pairs were generated from cDNA libraries representing four species, Drosophila melanogaster, Panthera pardus, Rattus norvegicus and Serinus canaria. These were assembled using CStone, Trinity and rnaSPAdes; the latter two being high-quality, well established, de novo assembers. For real data, two RNA-Seq datasets, each consisting of ≈30 million read pairs, representing two adult D. melanogaster whole-body samples were used. The contigs that CStone produced were comparable in quality to those of Trinity and rnaSPAdes in terms of length, sequence identity of aligned regions and the range of cDNA transcripts represented, whilst providing additional information on chimerism. Here we describe the details of CStones assembly and classification process, and propose that similar classification systems can be incorporated into other de novo assembly tools. Within a related side study, we explore the effects that chimera’s within reference sets have on the identification of differentially expression genes. CStone is available at: https://sourceforge.net/projects/cstone/. Within transcriptome reference sets, non-chimeric sequences are representations of transcribed genes, while artificially generated chimeric ones are mosaics of two or more pieces of DNA incorrectly pieced together. One area where such sets are utilized is in the quantification of gene expression patterns; where RNA-Seq reads are mapped to the sequences within, and subsequent count values reflect expression levels. Artificial chimeras can have a negative impact on count values by erroneously increasing variation in relation to the reads being mapped. Reference sets can be created from de novo assembled contigs, but chimeras can be introduced during the assembly process via the required traversal of graphs, representing gene families, constructed from the RNA-Seq data. Graph complexity determines how likely chimeras will arise. We have created CStone, a de novo assembler that utilizes a classification system to describe such complexity. Contigs created by CStone are labelled in a manner that indicates whether or not they are non-chimeric. This encourages contig dependent results to be presented with increased objectivity by maintaining the context of ambiguity associated with the assembly process. CStone has been tested extensively. Additionally, we have quantified the relationship between chimeras within reference sets and the identification of differentially expressed genes.
Collapse
|
20
|
Bhat JA, Yu D, Bohra A, Ganie SA, Varshney RK. Features and applications of haplotypes in crop breeding. Commun Biol 2021; 4:1266. [PMID: 34737387 PMCID: PMC8568931 DOI: 10.1038/s42003-021-02782-y] [Citation(s) in RCA: 57] [Impact Index Per Article: 14.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/21/2021] [Accepted: 10/09/2021] [Indexed: 12/17/2022] Open
Abstract
Climate change with altered pest-disease dynamics and rising abiotic stresses threatens resource-constrained agricultural production systems worldwide. Genomics-assisted breeding (GAB) approaches have greatly contributed to enhancing crop breeding efficiency and delivering better varieties. Fast-growing capacity and affordability of DNA sequencing has motivated large-scale germplasm sequencing projects, thus opening exciting avenues for mining haplotypes for breeding applications. This review article highlights ways to mine haplotypes and apply them for complex trait dissection and in GAB approaches including haplotype-GWAS, haplotype-based breeding, haplotype-assisted genomic selection. Improvement strategies that efficiently deploy superior haplotypes to hasten breeding progress will be key to safeguarding global food security.
Collapse
Affiliation(s)
- Javaid Akhter Bhat
- National Center for Soybean Improvement, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Deyue Yu
- National Center for Soybean Improvement, State Key Laboratory of Crop Genetics and Germplasm Enhancement, Nanjing Agricultural University, Nanjing, China
| | - Abhishek Bohra
- Crop Improvement Division, ICAR- Indian Institute of Pulses Research (ICAR- IIPR), Kanpur, India
| | - Showkat Ahmad Ganie
- Department of Biotechnology, Visva-Bharati, Santiniketan, 731235, WB, India.
| | - Rajeev K Varshney
- Center of Excellence in Genomics & Systems Biology, International Crops Research Institute for the Semi-Arid Tropics (ICRISAT), Hyderabad, 502324, India.
- State Agricultural Biotechnology Centre, Centre for Crop & Food Research Innovation, Food Futures Institute, Murdoch University, Murdoch, WA, Australia.
| |
Collapse
|
21
|
Stephens Z, Milosevic D, Kipp B, Grebe S, Iyer RK, Kocher JPA. PB-Motif-A Method for Identifying Gene/Pseudogene Rearrangements With Long Reads: An Application to CYP21A2 Genotyping. Front Genet 2021; 12:716586. [PMID: 34394200 PMCID: PMC8355628 DOI: 10.3389/fgene.2021.716586] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2021] [Accepted: 07/05/2021] [Indexed: 12/30/2022] Open
Abstract
Long read sequencing technologies have the potential to accurately detect and phase variation in genomic regions that are difficult to fully characterize with conventional short read methods. These difficult to sequence regions include several clinically relevant genes with highly homologous pseudogenes, many of which are prone to gene conversions or other types of complex structural rearrangements. We present PB-Motif, a new method for identifying rearrangements between two highly homologous genomic regions using PacBio long reads. PB-Motif leverages clustering and filtering techniques to efficiently report rearrangements in the presence of sequencing errors and other systematic artifacts. Supporting reads for each high-confidence rearrangement can then be used for copy number estimation and phased variant calling. First, we demonstrate PB-Motif's accuracy with simulated sequence rearrangements of PMS2 and its pseudogene PMS2CL using simulated reads sweeping over a range of sequencing error rates. We then apply PB-Motif to 26 clinical samples, characterizing CYP21A2 and its pseudogene CYP21A1P as part of a diagnostic assay for congenital adrenal hyperplasia. We successfully identify damaging variation and patient carrier status concordant with clinical diagnosis obtained from multiplex ligation-dependent amplification (MLPA) and Sanger sequencing. The source code is available at: github.com/zstephens/pb-motif.
Collapse
Affiliation(s)
- Zachary Stephens
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | | | | | | - Ravishankar K Iyer
- Department of Electrical and Computer Engineering, University of Illinois Urbana-Champaign, Urbana, IL, United States
| | | |
Collapse
|
22
|
Langenfeld K, Chin K, Roy A, Wigginton K, Duhaime MB. Comparison of ultrafiltration and iron chloride flocculation in the preparation of aquatic viromes from contrasting sample types. PeerJ 2021; 9:e11111. [PMID: 33996275 PMCID: PMC8106395 DOI: 10.7717/peerj.11111] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 02/23/2021] [Indexed: 12/24/2022] Open
Abstract
Viral metagenomes (viromes) are a valuable untargeted tool for studying viral diversity and the central roles viruses play in host disease, ecology, and evolution. Establishing effective methods to concentrate and purify viral genomes prior to sequencing is essential for high quality viromes. Using virus spike-and-recovery experiments, we stepwise compared two common approaches for virus concentration, ultrafiltration and iron chloride flocculation, across diverse matrices: wastewater influent, wastewater secondary effluent, river water, and seawater. Viral DNA was purified by removing cellular DNA via chloroform cell lysis, filtration, and enzymatic degradation of extra-viral DNA. We found that viral genomes were concentrated 1-2 orders of magnitude more with ultrafiltration than iron chloride flocculation for all matrices and resulted in higher quality DNA suitable for amplification-free and long-read sequencing. Given its widespread use and utility as an inexpensive field method for virome sampling, we nonetheless sought to optimize iron flocculation. We found viruses were best concentrated in seawater with five-fold higher iron concentrations than the standard used, inhibition of DNase activity reduced purification effectiveness, and five-fold more iron was needed to flocculate viruses from freshwater than seawater—critical knowledge for those seeking to apply this broadly used method to freshwater virome samples. Overall, our results demonstrated that ultrafiltration and purification performed better than iron chloride flocculation and purification in the tested matrices. Given that the method performance depended on the solids content and salinity of the samples, we suggest spike-and-recovery experiments be applied when concentrating and purifying sample types that diverge from those tested here.
Collapse
Affiliation(s)
- Kathryn Langenfeld
- Department of Civil and Environmental Engineering, University of Michigan - Ann Arbor, Ann Arbor, MI, United States of America
| | - Kaitlyn Chin
- Department of Civil and Environmental Engineering, University of Michigan - Ann Arbor, Ann Arbor, MI, United States of America
| | - Ariel Roy
- Department of Civil and Environmental Engineering, University of Michigan - Ann Arbor, Ann Arbor, MI, United States of America
| | - Krista Wigginton
- Department of Civil and Environmental Engineering, University of Michigan - Ann Arbor, Ann Arbor, MI, United States of America
| | - Melissa B Duhaime
- Department of Ecology and Evolutionary Biology, University of Michigan - Ann Arbor, Ann Arbor, MI, United States of America
| |
Collapse
|
23
|
Harris AM, DeGiorgio M. A Likelihood Approach for Uncovering Selective Sweep Signatures from Haplotype Data. Mol Biol Evol 2021; 37:3023-3046. [PMID: 32392293 PMCID: PMC7530616 DOI: 10.1093/molbev/msaa115] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Selective sweeps are frequent and varied signatures in the genomes of natural populations, and detecting them is consequently important in understanding mechanisms of adaptation by natural selection. Following a selective sweep, haplotypic diversity surrounding the site under selection decreases, and this deviation from the background pattern of variation can be applied to identify sweeps. Multiple methods exist to locate selective sweeps in the genome from haplotype data, but none leverages the power of a model-based approach to make their inference. Here, we propose a likelihood ratio test statistic T to probe whole-genome polymorphism data sets for selective sweep signatures. Our framework uses a simple but powerful model of haplotype frequency spectrum distortion to find sweeps and additionally make an inference on the number of presently sweeping haplotypes in a population. We found that the T statistic is suitable for detecting both hard and soft sweeps across a variety of demographic models, selection strengths, and ages of the beneficial allele. Accordingly, we applied the T statistic to variant calls from European and sub-Saharan African human populations, yielding primarily literature-supported candidates, including LCT, RSPH3, and ZNF211 in CEU, SYT1, RGS18, and NNT in YRI, and HLA genes in both populations. We also searched for sweep signatures in Drosophila melanogaster, finding expected candidates at Ace, Uhg1, and Pimet. Finally, we provide open-source software to compute the T statistic and the inferred number of presently sweeping haplotypes from whole-genome data.
Collapse
Affiliation(s)
- Alexandre M Harris
- Department of Biology, Pennsylvania State University, University Park, PA.,Molecular, Cellular, and Integrative Biosciences, Huck Institutes of the Life Sciences, Pennsylvania State University, University Park, PA
| | - Michael DeGiorgio
- Department of Computer and Electrical Engineering and Computer Science, Florida Atlantic University, Boca Raton, FL
| |
Collapse
|
24
|
Zhao A, Shen J, Ding Y, Sheng M, Zuo M, Lv H, Wang J, Shen Y, Wang H, Sun L. Long-read sequencing identified a novel nonsense and a de novo missense of PPA2 in trans in a Chinese patient with autosomal recessive infantile sudden cardiac failure. Clin Chim Acta 2021; 519:163-171. [PMID: 33826954 DOI: 10.1016/j.cca.2021.03.029] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2021] [Revised: 03/30/2021] [Accepted: 03/31/2021] [Indexed: 10/21/2022]
Abstract
BACKGROUND AND AIMS Biallelic missense variants in PPA2 gene cause infantile sudden cardiac failure (SCFI; OMIM #617222) characterized by sudden cardiac failure, sudden cardiac death in infants. Here, we present an unusual survivor with one inherited plus one de novo variant in PPA2. Since next-generation sequencing (NGS) fails to resolve variant phasing, which require long-read sequencing to clarify the diagnosis. MATERIALS AND METHODS Whole exome and Sanger sequencing were initially performed to identify the causative variants. PCR-based short tandem repeats (STRs) analysis and long-read single molecule real-time (SMRT) sequencing were further implemented for paternity testing and variant phasing. Pathogenicity evaluation of the biallelic variants in PPA2 was conducted according to the American College of Medical Genetics and Genomics (ACMG) and the Association for Molecular Pathology (AMP) guidelines based on VarSome. RESULTS Whole exome and Sanger sequencing revealed two variants in PPA2, with one novel nonsense variant (c.524C > G; p.Ser175*) inherited from the mother and one de novo missense variant (c.379C > T; p.Arg127Cys). PCR-based STRs analysis verified the paternity. And long-read SMRT sequencing phased the two variants in trans and identified the paternal origin of the de novo variant. The genetic diagnosis clarified the genetic etiology of the proband and assisted in patient management and counseling. CONCLUSION We identified a rare combination of one inherited plus one de novo variant of PPA2 in a patient with autosomal recessive SCFI, which expanded the mutation spectrum of PPA2 and demonstrated the power of target long-read sequencing to make up the diagnostic gap of prevailing NGS.
Collapse
Affiliation(s)
- Arman Zhao
- Department of Clinical Laboratory, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Jie Shen
- Department of Cardiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Yueyue Ding
- Department of Cardiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Mao Sheng
- Department of Radiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Mengying Zuo
- Department of Cardiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Haitao Lv
- Department of Cardiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Jian Wang
- Department of Pediatric Surgery, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| | - Yiping Shen
- Department of Genetics and Metabolism, Maternal and Child Health Hospital of Guangxi Zhuang Autonomous Region, Nanning 530003, Guangxi, China; Shanghai Children's Medical Center, Shanghai Jiao Tong University School of Medicine, Shanghai 200127, China; Division of Genetics and Genomics, Boston Children's Hospital, Boston, MA, United States; Department of Neurology, Harvard Medical School, 300 Longwood Avenue, Boston, MA 02115, United States.
| | - Hongying Wang
- Department of Clinical Laboratory, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China; Department of Clinical Laboratory, Children's Hospital of Wujiang District, Suzhou, 169 Park Road, Wujiang District, Suzhou 215234, Jiangsu, China.
| | - Ling Sun
- Department of Cardiology, Children's Hospital of Soochow University, 92 Zhongnan Street, Suzhou Industrial Park, Suzhou 215025, Jiangsu, China.
| |
Collapse
|
25
|
Noninvasive prenatal testing for β-thalassemia by targeted nanopore sequencing combined with relative haplotype dosage (RHDO): a feasibility study. Sci Rep 2021; 11:5714. [PMID: 33707551 PMCID: PMC7952549 DOI: 10.1038/s41598-021-85128-2] [Citation(s) in RCA: 20] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2020] [Accepted: 02/22/2021] [Indexed: 01/31/2023] Open
Abstract
Noninvasive prenatal testing (NIPT) for single gene disorders remains challenging. One approach that allows for accurate detection of the slight increase of the maternally inherited allele is the relative haplotype dosage (RHDO) analysis, which requires the construction of parental haplotypes. Recently, the nanopore sequencing technologies have become available and may be an ideal tool for direct construction of haplotypes. Here, we explored the feasibility of combining nanopore sequencing with the RHDO analysis in NIPT of β-thalassemia. Thirteen families at risk for β-thalassemia were recruited. Targeted region of parental genomic DNA was amplified by long-range PCR of 10 kb and 20 kb amplicons. Parental haplotypes were constructed using nanopore sequencing and next generation sequencing data. Fetal inheritance of parental haplotypes was classified by the RHDO analysis using data from maternal plasma DNA sequencing. Haplotype phasing was achieved in 12 families using data from 10 kb library. While data from the 20 kb library gave a better performance that haplotype phasing was achieved in all 13 families. Fetal status was correctly classified in 12 out of 13 families. Thus, targeted nanopore sequencing combined with the RHDO analysis is feasible to NIPT for β-thalassemia.
Collapse
|
26
|
MinION technology for microbiome sequencing applications for the conservation of cultural heritage. Microbiol Res 2021; 247:126727. [PMID: 33652267 DOI: 10.1016/j.micres.2021.126727] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2020] [Revised: 02/04/2021] [Accepted: 02/10/2021] [Indexed: 02/02/2023]
Abstract
The MinION single-molecule sequencing system has been attracting the attention of the community of microbiologists involved in the conservation of cultural heritage. The use of MinION for the conservation of cultural heritage is extremely recent, but surprisingly the only few applications available have been exploring many different substrates: stone, textiles, paintings and wax. The use of MinION sequencing is mainly used to address the metataxonomy (with special emphasis on non-cultivable microorganisms) with the effort to identify species involved in the degradation of the substrates. In this review, we show the current applications available on different artworks, showing how this technology can be a useful tool for microbiologists and conservators also in light of its low cost and the easy chemistry.
Collapse
|
27
|
Tong H, Phan NVT, Nguyen TT, Nguyen DV, Vo NS, Le L. Review on Databases and Bioinformatic Approaches on Pharmacogenomics of Adverse Drug Reactions. PHARMACOGENOMICS & PERSONALIZED MEDICINE 2021; 14:61-75. [PMID: 33469342 PMCID: PMC7812041 DOI: 10.2147/pgpm.s290781] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/06/2020] [Accepted: 12/26/2020] [Indexed: 12/27/2022]
Abstract
Pharmacogenomics has been used effectively in studying adverse drug reactions by determining the person-specific genetic factors associated with individual response to a drug. Current approaches have revealed the significant importance of sequencing technologies and sequence analysis strategies for interpreting the contribution of genetic variation in developing adverse reactions. Advance in next generation sequencing and platform brings new opportunities in validating the genetic candidates in certain reactions, and could be used to develop the preemptive tests to predict the outcome of the variation in a personal response to a drug. With the highly accumulated available data recently, the in silico approach with data analysis and modeling plays as other important alternatives which significantly support the final decisions in the transformation from research to clinical applications such as diagnosis and treatments for various types of adverse responses.
Collapse
Affiliation(s)
- Hang Tong
- School of Biotechnology, International University, Ho Chi Minh City, Vietnam.,Vietnam National University, Ho Chi Minh City, Vietnam
| | - Nga V T Phan
- School of Biotechnology, International University, Ho Chi Minh City, Vietnam.,Vietnam National University, Ho Chi Minh City, Vietnam
| | - Thanh T Nguyen
- Department of Translational Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
| | - Dinh V Nguyen
- Department of Respiratory, Allergy and Clinical Immunology, Vinmec International Hospital, Hanoi, Vietnam.,College of Health Sciences, VinUniversity, Hanoi, Vietnam
| | - Nam S Vo
- Department of Translational Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
| | - Ly Le
- School of Biotechnology, International University, Ho Chi Minh City, Vietnam.,Vietnam National University, Ho Chi Minh City, Vietnam.,Department of Translational Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
| |
Collapse
|
28
|
Maestri S, Maturo MG, Cosentino E, Marcolungo L, Iadarola B, Fortunati E, Rossato M, Delledonne M. A Long-Read Sequencing Approach for Direct Haplotype Phasing in Clinical Settings. Int J Mol Sci 2020; 21:E9177. [PMID: 33271988 PMCID: PMC7731377 DOI: 10.3390/ijms21239177] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2020] [Revised: 11/20/2020] [Accepted: 11/27/2020] [Indexed: 11/17/2022] Open
Abstract
The reconstruction of individual haplotypes can facilitate the interpretation of disease risks; however, high costs and technical challenges still hinder their assessment in clinical settings. Second-generation sequencing is the gold standard for variant discovery but, due to the production of short reads covering small genomic regions, allows only indirect haplotyping based on statistical methods. In contrast, third-generation methods such as the nanopore sequencing platform developed by Oxford Nanopore Technologies (ONT) generate long reads that can be used for direct haplotyping, with fewer drawbacks. However, robust standards for variant phasing in ONT-based target resequencing efforts are not yet available. In this study, we presented a streamlined proof-of-concept workflow for variant calling and phasing based on ONT data in a clinically relevant 12-kb region of the APOE locus, a hotspot for variants and haplotypes associated with aging-related diseases and longevity. Starting with sequencing data from simple amplicons of the target locus, we demonstrated that ONT data allow for reliable single-nucleotide variant (SNV) calling and phasing from as little as 60 reads, although the recognition of indels is less efficient. Even so, we identified the best combination of ONT read sets (600) and software (BWA/Minimap2 and HapCUT2) that enables full haplotype reconstruction when both SNVs and indels have been identified previously using a highly-accurate sequencing platform. In conclusion, we established a rapid and inexpensive workflow for variant phasing based on ONT long reads. This allowed for the analysis of multiple samples in parallel and can easily be implemented in routine clinical practice, including diagnostic testing.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Marzia Rossato
- Department of Biotechnology, University of Verona, 37134 Verona, Italy; (S.M.); (M.G.M.); (E.C.); (L.M.); (B.I.); (E.F.); (M.D.)
| | | |
Collapse
|
29
|
Schultzhaus Z, Wang Z, Stenger D. CRISPR-based enrichment strategies for targeted sequencing. Biotechnol Adv 2020; 46:107672. [PMID: 33253795 DOI: 10.1016/j.biotechadv.2020.107672] [Citation(s) in RCA: 20] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/22/2020] [Revised: 10/31/2020] [Accepted: 11/22/2020] [Indexed: 12/26/2022]
Abstract
The ability to easily produce or procure sequencing data has expanded to be within the reach of most clinics and research laboratories, but the complexity of sequence analysis remains a hurdle for many scientists, and a decline in sequencing cost means that the generation of gratuitous information in a given experiment is a challenge that is more and more often being encountered. To address this issue, methods have been present, some dating to the advent of nucleic acid sequencing, for capturing, targeting, or otherwise enriching specific nucleic acids in order to obtain greater depth of reads from a small portion of sequences within a complex sample. However, many of these methods have been complicated and laborious, relying on the design of hundreds to thousands of oligonucleotide probes, fabrication of microarray chips, and long hybridization times. Here, we review these methods, their benefits and uses, and catalog and discuss the implications of a recent development that has enabled a more efficient and expanded set of tools for enriching nucleic acids - the application of CRISPR technology. This introduction and analysis of the capabilities of new CRISPR-based enrichment strategies shows that it has the potential to expand the scope of enrichment to new possibilities, including the coupling of DNA and RNA targeting with long-read, portable sequencing platforms. Moreover, there are several areas where CRISPR-enrichment is a logical next step to more powerful and simplified sequencing for applications such as diagnostics and environmental monitoring.
Collapse
Affiliation(s)
- Zachary Schultzhaus
- Center for Biomolecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue, SW, Washington, DC 20375, USA.
| | - Zheng Wang
- Center for Biomolecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue, SW, Washington, DC 20375, USA.
| | - David Stenger
- Center for Biomolecular Science and Engineering, Naval Research Laboratory, 4555 Overlook Avenue, SW, Washington, DC 20375, USA.
| |
Collapse
|
30
|
Gillingham MAF, Montero BK, Wihelm K, Grudzus K, Sommer S, Santos PSC. A novel workflow to improve genotyping of multigene families in wildlife species: An experimental set-up with a known model system. Mol Ecol Resour 2020; 21:982-998. [PMID: 33113273 DOI: 10.1111/1755-0998.13290] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2020] [Revised: 10/19/2020] [Accepted: 10/22/2020] [Indexed: 12/30/2022]
Abstract
Genotyping complex multigene families in novel systems is particularly challenging. Target primers frequently amplify simultaneously multiple loci leading to high PCR and sequencing artefacts such as chimeras and allele amplification bias. Most genotyping pipelines have been validated in nonmodel systems whereby the real genotype is unknown and the generation of artefacts may be highly repeatable. Further hindering accurate genotyping, the relationship between artefacts and genotype complexity (i.e. number of alleles per genotype) within a PCR remains poorly described. Here, we investigated the latter by experimentally combining multiple known major histocompatibility complex (MHC) haplotypes of a model organism (chicken, Gallus gallus, 43 artificial genotypes with 2-13 alleles per amplicon). In addition to well-defined 'optimal' primers, we simulated a nonmodel species situation by designing 'cross-species' primers based on sequence data from closely related Galliform species. We applied a novel open-source genotyping pipeline (ACACIA; https://gitlab.com/psc_santos/ACACIA), and compared its performance with another, previously published pipeline (AmpliSAS). Allele calling accuracy was higher when using ACACIA (98.5% versus 97% and 77.8% versus 75% for the 'optimal' and 'cross-species' data sets, respectively). Systematic allele dropout of three alleles owing to primer mismatch in the 'cross-species' data set explained high allele calling repeatability (100% when using ACACIA) despite low accuracy, demonstrating that repeatability can be misleading when evaluating genotyping workflows. Genotype complexity was positively associated with nonchimeric artefacts, chimeric artefacts (nonlinearly by levelling when amplifying more than 4-6 alleles) and allele amplification bias. Our study exemplifies and demonstrates pitfalls researchers should avoid to reliably genotype complex multigene families.
Collapse
Affiliation(s)
- Mark A F Gillingham
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany
| | - B Karina Montero
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany.,Zoological Institute, Animal Ecology and Conservation, Biocenter Grindel, Universität Hamburg, Hamburg,, Germany
| | - Kerstin Wihelm
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany
| | - Kara Grudzus
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany
| | - Simone Sommer
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany
| | - Pablo S C Santos
- Institute of Evolutionary Ecology and Conservation Genomics, Ulm Universität, Ulm, Germany
| |
Collapse
|
31
|
Zhuang X, Lee Yu HL, Hsing IM. Toehold probe-based interrogation for haplotype phasing of long nucleic acid strands. ANALYTICAL METHODS : ADVANCING METHODS AND APPLICATIONS 2020; 12:4185-4190. [PMID: 32785301 DOI: 10.1039/d0ay00946f] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/22/2023]
Abstract
The arrangement of multiple single nucleotide polymorphisms (SNPs) in a gene, called a haplotype phase, is increasingly recognized as critical for accurate determination of disease risk and severity. However, conventional toehold-mediated strand displacement reactions are only able to interrogate SNPs, but not phase them since it is not known whether two SNPs in the same copy of the gene (cis) or in different copies of the same gene (trans) will give the same readout. While the rational introduction of an enzyme enables haplotype phasing, the complicated and stable secondary structure of long, single-stranded DNA sequences at room temperature limits its use. Complex nucleic acid structures make the hybridization of the probes difficult. Thus, we designed a molecular method to reveal the relative positions of SNPs located 1.4 kb apart in two copies of a gene by employing a competitive toehold probes and sink strategy at an elevated temperature. As such, we have successfully differentiated 20 nM of the 10 possible diplotypes in a long DNA target with two SNP sites located 1.4 kb apart within an hour without any additional amplification step. This offers a promising technology for accurate and fast haplotype phasing of SNPs that are over multiple kilobases away from each other.
Collapse
Affiliation(s)
- Xinyu Zhuang
- Department of Chemical and Biological Engineering, The Hong Kong University of Science and Technology, Clear Water Bay, Kowloon, Hong Kong.
| | | | | |
Collapse
|
32
|
Liau Y, Maggo S, Miller AL, Pearson JF, Kennedy MA, Cree SL. Nanopore sequencing of the pharmacogene CYP2D6 allows simultaneous haplotyping and detection of duplications. Pharmacogenomics 2020; 20:1033-1047. [PMID: 31559921 DOI: 10.2217/pgs-2019-0080] [Citation(s) in RCA: 25] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Aim: Long read sequencing offers the promise of overcoming some of the challenges in accurate genotyping of complex genes, along with the advantage of straightforward variant phasing. We have established methods for sequencing and haplotyping of the whole CYP2D6 gene using nanopore sequencing. Materials and methods: 32 samples covering various haplotypes including gene duplication were sequenced on the GridION platform. Results: Haplotypes of 52 alleles matched accurately to known star (*) allele subvariants, with the remaining 12 being assigned as new alleles, or new subvariants of known alleles. Duplicated alleles could be detected by analyzing the allelic balance. Conclusion: Nanopore sequencing of CYP2D6 offers a high throughput method for accurate haplotyping, detection of new variants and determination of duplicated alleles.
Collapse
Affiliation(s)
- Yusmiati Liau
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Simran Maggo
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Allison L Miller
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| | - John F Pearson
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Martin A Kennedy
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| | - Simone L Cree
- Department of Pathology & Biomedical Science, University of Otago, Christchurch, New Zealand
| |
Collapse
|
33
|
Magdy T, Kuo HH, Burridge PW. Precise and Cost-Effective Nanopore Sequencing for Post-GWAS Fine-Mapping and Causal Variant Identification. iScience 2020; 23:100971. [PMID: 32203907 PMCID: PMC7096756 DOI: 10.1016/j.isci.2020.100971] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2019] [Revised: 01/13/2020] [Accepted: 03/05/2020] [Indexed: 01/01/2023] Open
Abstract
Fine-mapping of interesting loci discovered by genome-wide association study (GWAS) is mandatory to pinpoint causal variants. Traditionally, this fine-mapping is completed through increasing the genotyping density at candidate loci, for which imputation is the current standard approach. Although imputation is a useful technique, it has a number of limitations that impede accuracy. In this work, we describe the development of a precise and cost-effective Nanopore sequencing-based pipeline that provides comprehensive and accurate information at candidate loci to identify potential causal single-nucleotide polymorphisms (SNPs). We demonstrate the utility of this technique via the fine-mapping of a GWAS positive hit comprising a synonymous SNP that is associated with doxorubicin-induced cardiotoxicity. In this work, we provide a proof of principle for the application of Nanopore sequencing in post-GWAS fine-mapping and pinpointing of potential causal SNPs with a minimal cost of just ~$10/100 kb/sample.
Collapse
Affiliation(s)
- Tarek Magdy
- Department of Pharmacology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA; Center for Pharmacogenomics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Hui-Hsuan Kuo
- Department of Pharmacology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA; Center for Pharmacogenomics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA
| | - Paul W Burridge
- Department of Pharmacology, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA; Center for Pharmacogenomics, Northwestern University Feinberg School of Medicine, Chicago, IL 60611, USA.
| |
Collapse
|
34
|
Using Nanopore Whole-Transcriptome Sequencing for Human Leukocyte Antigen Genotyping and Correlating Donor Human Leukocyte Antigen Expression with Flow Cytometric Crossmatch Results. J Mol Diagn 2020; 22:101-110. [DOI: 10.1016/j.jmoldx.2019.09.005] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2019] [Revised: 08/27/2019] [Accepted: 09/11/2019] [Indexed: 01/07/2023] Open
|
35
|
Snellings DA, Gallione CJ, Clark DS, Vozoris NT, Faughnan ME, Marchuk DA. Somatic Mutations in Vascular Malformations of Hereditary Hemorrhagic Telangiectasia Result in Bi-allelic Loss of ENG or ACVRL1. Am J Hum Genet 2019; 105:894-906. [PMID: 31630786 DOI: 10.1016/j.ajhg.2019.09.010] [Citation(s) in RCA: 91] [Impact Index Per Article: 15.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2019] [Accepted: 09/09/2019] [Indexed: 12/22/2022] Open
Abstract
Hereditary hemorrhagic telangiectasia (HHT) is a Mendelian disease characterized by vascular malformations (VMs) including visceral arteriovenous malformations and mucosal telangiectasia. HHT is caused by loss-of-function (LoF) mutations in one of three genes, ENG, ACVRL1, or SMAD4, and is inherited as an autosomal-dominant condition. Intriguingly, the constitutional mutation causing HHT is present throughout the body, yet the multiple VMs in individuals with HHT occur focally, rather than manifesting as a systemic vascular defect. This disconnect between genotype and phenotype suggests that a local event is necessary for the development of VMs. We investigated the hypothesis that local somatic mutations seed the formation HHT-related telangiectasia in a genetic two-hit mechanism. We identified low-frequency somatic mutations in 9/19 telangiectasia through the use of next-generation sequencing. We established phase for seven of nine samples, which confirms that the germline and somatic mutations in all seven samples exist in trans configuration; this is consistent with a genetic two-hit mechanism. These combined data suggest that bi-allelic loss of ENG or ACVRL1 may be a required event in the development of telangiectasia, and that rather than haploinsufficiency, VMs in HHT are caused by a Knudsonian two-hit mechanism.
Collapse
Affiliation(s)
- Daniel A Snellings
- Molecular Genetics and Microbiology, Duke University, Durham, NC 27710, USA
| | - Carol J Gallione
- Molecular Genetics and Microbiology, Duke University, Durham, NC 27710, USA
| | - Dewi S Clark
- Toronto HHT Centre, Division of Respirology, St. Michael's Hospital and Li Ka Shing Knowledge Institute, Toronto, ON M5B 1A6, Canada
| | - Nicholas T Vozoris
- Toronto HHT Centre, Division of Respirology, St. Michael's Hospital and Li Ka Shing Knowledge Institute, Toronto, ON M5B 1A6, Canada; Department of Medicine, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Marie E Faughnan
- Toronto HHT Centre, Division of Respirology, St. Michael's Hospital and Li Ka Shing Knowledge Institute, Toronto, ON M5B 1A6, Canada; Department of Medicine, University of Toronto, Toronto, ON M5S 1A8, Canada
| | - Douglas A Marchuk
- Molecular Genetics and Microbiology, Duke University, Durham, NC 27710, USA.
| |
Collapse
|
36
|
Midha MK, Wu M, Chiu KP. Long-read sequencing in deciphering human genetics to a greater depth. Hum Genet 2019; 138:1201-1215. [PMID: 31538236 DOI: 10.1007/s00439-019-02064-y] [Citation(s) in RCA: 63] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/22/2019] [Accepted: 09/13/2019] [Indexed: 12/12/2022]
Abstract
Through four decades' development, DNA sequencing has inched into the era of single-molecule sequencing (SMS), or the third-generation sequencing (TGS), as represented by two distinct technical approaches developed independently by Pacific Bioscience (PacBio) and Oxford Nanopore Technologies (ONT). Historically, each generation of sequencing technologies was marked by innovative technological achievements and novel applications. Long reads (LRs) are considered as the most advantageous feature of SMS shared by both PacBio and ONT to distinguish SMS from next-generation sequencing (NGS, or the second-generation sequencing) and Sanger sequencing (the first-generation sequencing). Long reads overcome the limitations of NGS and drastically improves the quality of genome assembly. Besides, ONT also contributes several unique features including ultra-long reads (ULRs) with read length above 300 kb and some close to 1 million bp, direct RNA sequencing and superior portability as made possible by pocket-sized MinION sequencer. Here, we review the history of DNA sequencing technologies and associated applications, with a special focus on the advantages as well as the limitations of ULR sequencing in genome assembly.
Collapse
Affiliation(s)
- Mohit K Midha
- Genomics Research Center, Academia Sinica, 128 Academia Road, Sec. 2, Nankang District, Taipei, 115, Taiwan.,Institute of Biochemistry and Molecular Biology, National Yang-Ming University, Taipei, Taiwan
| | - Mengchu Wu
- Health GeneTech, 22F No. 99, Xin Pu 6th St., Taoyuan, Taiwan
| | - Kuo-Ping Chiu
- Genomics Research Center, Academia Sinica, 128 Academia Road, Sec. 2, Nankang District, Taipei, 115, Taiwan. .,Institute of Biochemistry and Molecular Biology, National Yang-Ming University, Taipei, Taiwan. .,Department of Life Sciences, College of Life Sciences, National Taiwan University, Taipei, Taiwan.
| |
Collapse
|
37
|
Šoltys K, Planý M, Biocca P, Vianello V, Bučková M, Puškárová A, Sclocchi MC, Colaizzi P, Bicchieri M, Pangallo D, Pinzari F. Lead soaps formation and biodiversity in a XVIII Century wax seal coloured with minium. Environ Microbiol 2019; 22:1517-1534. [PMID: 31291047 DOI: 10.1111/1462-2920.14735] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/14/2019] [Revised: 06/24/2019] [Accepted: 07/08/2019] [Indexed: 01/30/2023]
Abstract
A multidisciplinary approach was carried out in order to study the biodeterioration and the associated microbiome of a XVIII Century wax seal coloured with minium. A small wax seal fragment was observed by scanning electron microscopy combined with energy dispersive spectroscopy in non-destructive mode. The same object was analysed by Raman and Fourier-transform infrared spectroscopy. The identification of the microbiota growing on the seal was performed with both a culture-dependent strategy, combined with hydrolytic assays, and high-throughput sequencing using the MinION platform. The whole bacterial 16S rRNA gene and the fungal markers ITS and 28S rRNA were targeted. It was observed that the carnauba wax coloured with lead tetroxide (minium) was covered by a biofilm consisting of a network of filaments and other structures of microbial origin. The culture-dependent and culture-independent investigations showed the presence of a complex microbiota composed mainly by fungal members, which demonstrated interesting properties related to lipids and lead processing. The formation of lead soaps and secondary biogenic minerals was also described.
Collapse
Affiliation(s)
- Katarína Šoltys
- Comenius University in Bratislava, Comenius University Science Park, Ilkovičova 8, Bratislava, 84104, Slovakia.,Faculty of Natural Sciences, Department of Molecular Biology, Comenius University in Bratislava, Mlynská dolina Ilkovičova 6, Bratislava, 84215, Slovakia
| | - Matej Planý
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, Bratislava, 84551, Slovakia
| | - Paola Biocca
- Department of Chemistry, Istituto centrale restauro conservazione patrimonio archivistico librario, Via Milano 76, 00184, Roma, Italy
| | - Valentina Vianello
- Department of Restoration, Istituto centrale restauro conservazione patrimonio archivistico librario, Via Milano 76, 00184, Roma, Italy
| | - Mária Bučková
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, Bratislava, 84551, Slovakia
| | - Andrea Puškárová
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, Bratislava, 84551, Slovakia
| | - Maria Carla Sclocchi
- Department of Biology, Istituto centrale restauro conservazione patrimonio archivistico librario, Via Milano 76, 00184, Roma, Italy
| | - Piero Colaizzi
- Department of Biology, Istituto centrale restauro conservazione patrimonio archivistico librario, Via Milano 76, 00184, Roma, Italy
| | - Marina Bicchieri
- Department of Chemistry, Istituto centrale restauro conservazione patrimonio archivistico librario, Via Milano 76, 00184, Roma, Italy
| | - Domenico Pangallo
- Institute of Molecular Biology, Slovak Academy of Sciences, Dúbravská cesta 21, Bratislava, 84551, Slovakia.,Caravella s.r.o., Tupolevova 2, Bratislava, 85101, Slovakia
| | - Flavia Pinzari
- Council for Agricultural Research and Economics, Research Centre for Agriculture and Environment, Via della Navicella 2-4, 00184, Rome, Italy.,Life Sciences Department, Natural History Museum, Cromwell Road, SW7 5BD, London, UK
| |
Collapse
|
38
|
Barsakis K, Babrzadeh F, Chi A, Mallempati K, Pickle W, Mindrinos M, Fernández-Viña MA. Complete nucleotide sequence characterization of DRB5 alleles reveals a homogeneous allele group that is distinct from other DRB genes. Hum Immunol 2019; 80:437-448. [PMID: 30954494 PMCID: PMC6622178 DOI: 10.1016/j.humimm.2019.04.001] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Revised: 03/23/2019] [Accepted: 04/01/2019] [Indexed: 01/28/2023]
Abstract
Next Generation Sequencing allows for testing and typing of entire genes of the HLA region. A better and comprehensive sequence assessment can be achieved by the inclusion of full gene sequences of all the common alleles at a given locus. The common alleles of DRB5 are under-characterized with the full exon-intron sequence of two alleles available. In the present study the DRB5 genes from 18 subjects alleles were cloned and sequenced; haplotype analysis showed that 17 of them had a single copy of DRB5 and one consanguineous subject was homozygous at all HLA loci. Methodological approaches including robust and efficient long-range PCR amplification, molecular cloning, nucleotide sequencing and de novo sequence assembly were combined to characterize DRB5 alleles. DRB5 sequences covering from 5'UTR to the end of intron 5 were obtained for DRB5*01:01, 01:02 and 02:02; partial coverage including a segment spanning exon 2 to exon 6 was obtained for DRB5*01:03, 01:08N and 02:03. Phylogenetic analysis of the generated sequences showed that the DRB5 alleles group together and have distinctive differences with other DRB loci. Novel intron variants of DRB5*01:01:01, 01:02 and 02:02 were identified. The newly characterized DRB5 intron variants of each DRB5 allele were found in subjects harboring distinct associations with alleles of DRB1, B and/or ethnicity. The new information provided by this study provides reference sequences for HLA typing methodologies. Extending sequence coverage may lead to identify the disease susceptibility factors of DRB5 containing haplotypes while the unexpected intron variations may shed light on understanding of the evolution of the DRB region.
Collapse
Affiliation(s)
- Konstantinos Barsakis
- Stanford Blood Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA; Department of Biology, University of Crete, Heraklion, Crete 71003, Greece
| | - Farbod Babrzadeh
- Stanford Genome Technology Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA
| | - Anjo Chi
- Stanford Genome Technology Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA
| | - Kalyan Mallempati
- Stanford Blood Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA
| | - William Pickle
- Stanford Blood Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA
| | - Michael Mindrinos
- Stanford Genome Technology Center, Stanford University School of Medicine, Palo Alto, CA 94304, USA
| | | |
Collapse
|
39
|
Mantere T, Kersten S, Hoischen A. Long-Read Sequencing Emerging in Medical Genetics. Front Genet 2019; 10:426. [PMID: 31134132 PMCID: PMC6514244 DOI: 10.3389/fgene.2019.00426] [Citation(s) in RCA: 257] [Impact Index Per Article: 42.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/25/2018] [Accepted: 04/18/2019] [Indexed: 12/12/2022] Open
Abstract
The wide implementation of next-generation sequencing (NGS) technologies has revolutionized the field of medical genetics. However, the short read lengths of currently used sequencing approaches pose a limitation for the identification of structural variants, sequencing repetitive regions, phasing of alleles and distinguishing highly homologous genomic regions. These limitations may significantly contribute to the diagnostic gap in patients with genetic disorders who have undergone standard NGS, like whole exome or even genome sequencing. Now, the emerging long-read sequencing (LRS) technologies may offer improvements in the characterization of genetic variation and regions that are difficult to assess with the prevailing NGS approaches. LRS has so far mainly been used to investigate genetic disorders with previously known or strongly suspected disease loci. While these targeted approaches already show the potential of LRS, it remains to be seen whether LRS technologies can soon enable true whole genome sequencing routinely. Ultimately, this could allow the de novo assembly of individual whole genomes used as a generic test for genetic disorders. In this article, we summarize the current LRS-based research on human genetic disorders and discuss the potential of these technologies to facilitate the next major advancements in medical genetics.
Collapse
Affiliation(s)
- Tuomo Mantere
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Laboratory of Cancer Genetics and Tumor Biology, Cancer and Translational Medicine Research Unit and Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Simone Kersten
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| | - Alexander Hoischen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, Netherlands
- Department of Internal Medicine, Center for Infectious Diseases (RCI), Radboud University Medical Center, Nijmegen, Netherlands
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, Netherlands
| |
Collapse
|
40
|
Bethune K, Mariac C, Couderc M, Scarcelli N, Santoni S, Ardisson M, Martin J, Montúfar R, Klein V, Sabot F, Vigouroux Y, Couvreur TLP. Long-fragment targeted capture for long-read sequencing of plastomes. APPLICATIONS IN PLANT SCIENCES 2019; 7:e1243. [PMID: 31139509 PMCID: PMC6526642 DOI: 10.1002/aps3.1243] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/14/2018] [Accepted: 03/21/2019] [Indexed: 05/09/2023]
Abstract
PREMISE Third-generation sequencing methods generate significantly longer reads than those produced using alternative sequencing methods. This provides increased possibilities for the study of biodiversity, phylogeography, and population genetics. We developed a protocol for in-solution enrichment hybridization capture of long DNA fragments applicable to complete plastid genomes. METHODS AND RESULTS The protocol uses cost-effective in-house probes developed via long-range PCR and was used in six non-model monocot species (Poaceae: African rice, pearl millet, fonio; and three palm species). DNA was extracted from fresh and silica gel-dried leaves. Our protocol successfully captured long-read plastome fragments (3151 bp median on average), with an enrichment rate ranging from 15% to 98%. DNA extracted from silica gel-dried leaves led to low-quality plastome assemblies when compared to DNA extracted from fresh tissue. CONCLUSIONS Our protocol could also be generalized to capture long sequences from specific nuclear fragments.
Collapse
Affiliation(s)
| | | | | | | | - Sylvain Santoni
- UMR AGAP, Equipe Diversité et Adaptation de la Vigne et des Espèces MéditerranéennesINRA2 Place Viala34060MontpellierFrance
| | - Morgane Ardisson
- UMR AGAP, Equipe Diversité et Adaptation de la Vigne et des Espèces MéditerranéennesINRA2 Place Viala34060MontpellierFrance
| | | | - Rommel Montúfar
- Facultad de Ciencias Exactas y NaturalesPontificia Universidad Católica del EcuadorQuitoEcuador
| | | | | | | | | |
Collapse
|
41
|
Warwick-Dugdale J, Solonenko N, Moore K, Chittick L, Gregory AC, Allen MJ, Sullivan MB, Temperton B. Long-read viral metagenomics captures abundant and microdiverse viral populations and their niche-defining genomic islands. PeerJ 2019; 7:e6800. [PMID: 31086738 PMCID: PMC6487183 DOI: 10.7717/peerj.6800] [Citation(s) in RCA: 81] [Impact Index Per Article: 13.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2018] [Accepted: 03/14/2019] [Indexed: 01/18/2023] Open
Abstract
Marine viruses impact global biogeochemical cycles via their influence on host community structure and function, yet our understanding of viral ecology is constrained by limitations in host culturing and a lack of reference genomes and 'universal' gene markers to facilitate community surveys. Short-read viral metagenomic studies have provided clues to viral function and first estimates of global viral gene abundance and distribution, but their assemblies are confounded by populations with high levels of strain evenness and nucleotide diversity (microdiversity), limiting assembly of some of the most abundant viruses on Earth. Such features also challenge assembly across genomic islands containing niche-defining genes that drive ecological speciation. These populations and features may be successfully captured by single-virus genomics and fosmid-based approaches, at least in abundant taxa, but at considerable cost and technical expertise. Here we established a low-cost, low-input, high throughput alternative sequencing and informatics workflow to improve viral metagenomic assemblies using short-read and long-read technology. The 'VirION' (Viral, long-read metagenomics via MinION sequencing) approach was first validated using mock communities where it was found to be as relatively quantitative as short-read methods and provided significant improvements in recovery of viral genomes. We then then applied VirION to the first metagenome from a natural viral community from the Western English Channel. In comparison to a short-read only approach, VirION: (i) increased number and completeness of assembled viral genomes; (ii) captured abundant, highly microdiverse virus populations, and (iii) captured more and longer genomic islands. Together, these findings suggest that VirION provides a high throughput and cost-effective alternative to fosmid and single-virus genomic approaches to more comprehensively explore viral communities in nature.
Collapse
Affiliation(s)
- Joanna Warwick-Dugdale
- Plymouth Marine Laboratory, Plymouth, Devon, United Kingdom
- School of Biosciences, University of Exeter, Exeter, Devon, United Kingdom
| | - Natalie Solonenko
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
| | - Karen Moore
- School of Biosciences, University of Exeter, Exeter, Devon, United Kingdom
| | - Lauren Chittick
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
| | - Ann C. Gregory
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
| | - Michael J. Allen
- Plymouth Marine Laboratory, Plymouth, Devon, United Kingdom
- School of Biosciences, University of Exeter, Exeter, Devon, United Kingdom
| | - Matthew B. Sullivan
- Department of Microbiology, Ohio State University, Columbus, OH, United States of America
- Civil, Environmental and Geodetic Engineering, Ohio State University, Columbus, OH, United States of America
| | - Ben Temperton
- School of Biosciences, University of Exeter, Exeter, Devon, United Kingdom
| |
Collapse
|
42
|
Challenging the proposed causes of the PCR plateau phase. BIOMOLECULAR DETECTION AND QUANTIFICATION 2019; 17:100082. [PMID: 30886826 PMCID: PMC6403077 DOI: 10.1016/j.bdq.2019.100082] [Citation(s) in RCA: 18] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/05/2018] [Revised: 01/08/2019] [Accepted: 02/15/2019] [Indexed: 12/13/2022]
Abstract
Despite the wide-spread use of the polymerase chain reaction (PCR) in various life-science applications, the causes of arrested amplicon generation in late cycles have not been confidently identified. This so-called plateau phase has been attributed to depletion or thermal break-down of primers or nucleotides, thermal inactivation of the DNA polymerase, and product accumulation resulting in competition between primer annealing and product re-hybridization as well as blocking of DNA polymerase by double-stranded amplicons. In the current study, we experimentally investigate the proposed limiting factors of PCR product formation. By applying robust and validated qPCR assays, we elucidate the impact of adding non-target and target amplicons to the reactions, mimicking the high amount of products in late PCR cycles. Further, the impact of increased primer concentrations and thermal stability of reagents are explored. Our results show that high amounts of non-target amplicons inhibit amplification by binding to the DNA polymerase, but that this effect is counteracted by addition of more DNA polymerase or prolonged annealing/extension times. Adding high amounts of target amplicons that also act as templates in the reaction is far less inhibitory to amplification, although a decrease in amplification rate is seen. When primer concentrations are increased, both amplification rates and end-product yields are elevated. Taken together, our results suggest that the main cause of PCR plateau formation is primer depletion and not product accumulation or degradation of reagents. We stress that a PCR plateau caused by primer depletion is assay-dependent, i.e. dependent on the primer design and primer characteristics such as the probability of primer-dimer formation. Our findings contribute to an improved understanding of the major parameters controlling the PCR dynamics at later cycles and the limitations of continued product formation, which in the end can facilitate PCR optimization.
Collapse
|
43
|
Leija‐Salazar M, Sedlazeck FJ, Toffoli M, Mullin S, Mokretar K, Athanasopoulou M, Donald A, Sharma R, Hughes D, Schapira AH, Proukakis C. Evaluation of the detection of GBA missense mutations and other variants using the Oxford Nanopore MinION. Mol Genet Genomic Med 2019; 7:e564. [PMID: 30637984 PMCID: PMC6418358 DOI: 10.1002/mgg3.564] [Citation(s) in RCA: 60] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Revised: 11/23/2018] [Accepted: 12/13/2018] [Indexed: 01/09/2023] Open
Abstract
BACKGROUND Mutations in GBA cause Gaucher disease when biallelic and are strong risk factors for Parkinson's disease when heterozygous. GBA analysis is complicated by the nearby pseudogene. We aimed to design and validate a method for sequencing GBA using long reads. METHODS We sequenced GBA on the Oxford Nanopore MinION as an 8.9 kb amplicon from 102 individuals, including patients with Parkinson's and Gaucher diseases. We used NanoOK for quality metrics, NGMLR to align data (after comparing with GraphMap), Nanopolish and Sniffles to call variants, and WhatsHap for phasing. RESULTS We detected all known missense mutations in these samples, including the common p.N409S (N370S) and p.L483P (L444P) in multiple samples, and nine rarer ones, as well as a splicing and a truncating mutation, and intronic SNPs. We demonstrated the ability to phase mutations, confirm compound heterozygosity, and assign haplotypes. We also detected two known risk variants in some Parkinson's patients. Rare false positives were easily identified and filtered, with the Nanopolish quality score adjusted for the number of reads a very robust discriminator. In two individuals carrying a recombinant allele, we were able to detect and fully define it in one carrier, where it included a 55-base pair deletion, but not in another one, suggesting a limitation of the PCR enrichment method. Missense mutations were detected at the correct zygosity, except for the case where the RecNciI one was missed. CONCLUSION The Oxford Nanopore MinION can detect missense mutations and an exonic deletion in this difficult gene, with the added advantages of phasing and intronic analysis. It can be used as an efficient research tool, but additional work is required to exclude all recombinants.
Collapse
Affiliation(s)
- Melissa Leija‐Salazar
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
| | | | - Marco Toffoli
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
| | - Stephen Mullin
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
- Institute of Translational and Stratified MedicinePlymouth University Peninsula School of MedicinePlymouthUK
| | - Katya Mokretar
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
| | - Maria Athanasopoulou
- Department of Molecular Neuroscience, Institute of NeurologyUniversity College LondonLondonUK
| | - Aimee Donald
- Department of PaediatricsRoyal Manchester Children’s HospitalManchesterUK
| | - Reena Sharma
- The Mark Holland Metabolic Unit, Salford Royal Foundation NHS TrustSalfordUK
| | - Derralynn Hughes
- Institute of Immunity and TransplantationLysosomal Storage Disorders Unit, Royal Free HospitalLondonUK
| | - Anthony H.V. Schapira
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
| | - Christos Proukakis
- Department of Clinical and Movement Neurosciences, Royal Free Campus, Institute of NeurologyUniversity College LondonLondonUK
| |
Collapse
|
44
|
Detection and Classification of Hard and Soft Sweeps from Unphased Genotypes by Multilocus Genotype Identity. Genetics 2018; 210:1429-1452. [PMID: 30315068 DOI: 10.1534/genetics.118.301502] [Citation(s) in RCA: 51] [Impact Index Per Article: 7.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/03/2018] [Accepted: 10/08/2018] [Indexed: 11/18/2022] Open
Abstract
Positive natural selection can lead to a decrease in genomic diversity at the selected site and at linked sites, producing a characteristic signature of elevated expected haplotype homozygosity. These selective sweeps can be hard or soft. In the case of a hard selective sweep, a single adaptive haplotype rises to high population frequency, whereas multiple adaptive haplotypes sweep through the population simultaneously in a soft sweep, producing distinct patterns of genetic variation in the vicinity of the selected site. Measures of expected haplotype homozygosity have previously been used to detect sweeps in multiple study systems. However, these methods are formulated for phased haplotype data, typically unavailable for nonmodel organisms, and some may have reduced power to detect soft sweeps due to their increased genetic diversity relative to hard sweeps. To address these limitations, we applied the H12 and H2/H1 statistics proposed in 2015 by Garud et al., which have power to detect both hard and soft sweeps, to unphased multilocus genotypes, denoting them as G12 and G2/G1. G12 (and the more direct expected homozygosity analog to H12, denoted G123) has comparable power to H12 for detecting both hard and soft sweeps. G2/G1 can be used to classify hard and soft sweeps analogously to H2/H1, conditional on a genomic region having high G12 or G123 values. The reason for this power is that, under random mating, the most frequent haplotypes will yield the most frequent multilocus genotypes. Simulations based on parameters compatible with our recent understanding of human demographic history suggest that expected homozygosity methods are best suited for detecting recent sweeps, and increase in power under recent population expansions. Finally, we find candidates for selective sweeps within the 1000 Genomes CEU, YRI, GIH, and CHB populations, which corroborate and complement existing studies.
Collapse
|
45
|
Heeger F, Bourne EC, Baschien C, Yurkov A, Bunk B, Spröer C, Overmann J, Mazzoni CJ, Monaghan MT. Long-read DNA metabarcoding of ribosomal RNA in the analysis of fungi from aquatic environments. Mol Ecol Resour 2018; 18:1500-1514. [DOI: 10.1111/1755-0998.12937] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/03/2018] [Revised: 07/05/2018] [Accepted: 07/28/2018] [Indexed: 01/06/2023]
Affiliation(s)
- Felix Heeger
- Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB); Berlin Germany
- Berlin Center for Genomics in Biodiversity Research; Berlin Germany
| | - Elizabeth C. Bourne
- Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB); Berlin Germany
- Berlin Center for Genomics in Biodiversity Research; Berlin Germany
| | - Christiane Baschien
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures; Braunschweig Germany
| | - Andrey Yurkov
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures; Braunschweig Germany
| | - Boyke Bunk
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures; Braunschweig Germany
| | - Cathrin Spröer
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures; Braunschweig Germany
| | - Jörg Overmann
- Leibniz Institute DSMZ-German Collection of Microorganisms and Cell Cultures; Braunschweig Germany
| | - Camila J. Mazzoni
- Berlin Center for Genomics in Biodiversity Research; Berlin Germany
- Leibniz Institute of Zoo- and Wildlife Research (IZW); Berlin Germany
| | - Michael T. Monaghan
- Leibniz Institute of Freshwater Ecology and Inland Fisheries (IGB); Berlin Germany
- Berlin Center for Genomics in Biodiversity Research; Berlin Germany
| |
Collapse
|
46
|
Blighe K, DeDionisio L, Christie KA, Chawes B, Shareef S, Kakouli-Duarte T, Chao-Shern C, Harding V, Kelly RS, Castellano L, Stebbing J, Lasky-Su JA, Nesbit MA, Moore CBT. Gene editing in the context of an increasingly complex genome. BMC Genomics 2018; 19:595. [PMID: 30086710 PMCID: PMC6081867 DOI: 10.1186/s12864-018-4963-8] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2017] [Accepted: 07/26/2018] [Indexed: 12/15/2022] Open
Abstract
The reporting of the first draft of the human genome in 2000 brought with it much hope for the future in what was felt as a paradigm shift toward improved health outcomes. Indeed, we have now mapped the majority of variation across human populations with landmark projects such as 1000 Genomes; in cancer, we have catalogued mutations across the primary carcinomas; whilst, for other diseases, we have identified the genetic variants with strongest association. Despite this, we are still awaiting the genetic revolution in healthcare to materialise and translate itself into the health benefits for which we had hoped. A major problem we face relates to our underestimation of the complexity of the genome, and that of biological mechanisms, generally. Fixation on DNA sequence alone and a 'rigid' mode of thinking about the genome has meant that the folding and structure of the DNA molecule -and how these relate to regulation- have been underappreciated. Projects like ENCODE have additionally taught us that regulation at the level of RNA is just as important as that at the spatiotemporal level of chromatin.In this review, we chart the course of the major advances in the biomedical sciences in the era pre- and post the release of the first draft sequence of the human genome, taking a focus on technology and how its development has influenced these. We additionally focus on gene editing via CRISPR/Cas9 as a key technique, in particular its use in the context of complex biological mechanisms. Our aim is to shift the mode of thinking about the genome to that which encompasses a greater appreciation of the folding of the DNA molecule, DNA- RNA/protein interactions, and how these regulate expression and elaborate disease mechanisms.Through the composition of our work, we recognise that technological improvement is conducive to a greater understanding of biological processes and life within the cell. We believe we now have the technology at our disposal that permits a better understanding of disease mechanisms, achievable through integrative data analyses. Finally, only with greater understanding of disease mechanisms can techniques such as gene editing be faithfully conducted.
Collapse
Affiliation(s)
- K Blighe
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA.
- Department of Cancer Studies and Molecular Medicine, Robert Kilpatrick Clinical Sciences Building, Leicester Royal Infirmary, Leicester, LE2 7LX, UK.
- Bill Lyons Informatics Centre, UCL Cancer Institute, University College London, WC1E 6DD, London, UK.
| | - L DeDionisio
- Avellino Laboratories, Menlo Park, CA, 94025, USA
| | - K A Christie
- Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK
| | - B Chawes
- COPSAC, Copenhagen Prospective Studies on Asthma in Childhood, Herlev and Gentofte Hospital, University of Copenhagen, Copenhagen, Denmark
| | - S Shareef
- University of Raparin, Ranya, Kurdistan Region, Iraq
| | - T Kakouli-Duarte
- Institute of Technology Carlow, Department of Science and Health, Kilkenny Road, Carlow, Ireland
| | - C Chao-Shern
- Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK
- Avellino Laboratories, Menlo Park, CA, 94025, USA
| | - V Harding
- Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK
| | - R S Kelly
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA
| | - L Castellano
- Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK
- JMS Building, School of Life Sciences, University of Sussex, Falmer, Brighton, BN1 9QG, UK
| | - J Stebbing
- Imperial College London, Division of Cancer, Department of Surgery and Cancer, Hammersmith Hospital Campus, Du Cane Road, London, W12 0NN, UK
| | - J A Lasky-Su
- Channing Division of Network Medicine, Brigham and Women's Hospital and Harvard Medical School, 181 Longwood Avenue, Boston, MA, USA
| | - M A Nesbit
- Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK
| | - C B T Moore
- Biomedical Sciences Research Institute, University of Ulster, Coleraine, Northern Ireland, BT52 1SA, UK.
- Avellino Laboratories, Menlo Park, CA, 94025, USA.
| |
Collapse
|
47
|
Orr RJS, Zhao S, Klaveness D, Yabuki A, Ikeda K, Makoto WM, Shalchian-Tabrizi K. Enigmatic Diphyllatea eukaryotes: culturing and targeted PacBio RS amplicon sequencing reveals a higher order taxonomic diversity and global distribution. BMC Evol Biol 2018; 18:115. [PMID: 30021531 PMCID: PMC6052632 DOI: 10.1186/s12862-018-1224-z] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2018] [Accepted: 06/29/2018] [Indexed: 12/31/2022] Open
Abstract
BACKGROUND The class Diphyllatea belongs to a group of enigmatic unicellular eukaryotes that play a key role in reconstructing the morphological innovation and diversification of early eukaryotic evolution. Despite its evolutionary significance, very little is known about the phylogeny and species diversity of Diphyllatea. Only three species have described morphology, being taxonomically divided by flagella number, two or four, and cell size. Currently, one 18S rRNA Diphyllatea sequence is available, with environmental sequencing surveys reporting only a single partial sequence from a Diphyllatea-like organism. Accordingly, geographical distribution of Diphyllatea based on molecular data is limited, despite morphological data suggesting the class has a global distribution. We here present a first attempt to understand species distribution, diversity and higher order structure of Diphyllatea. RESULTS We cultured 11 new strains, characterised these morphologically and amplified their rRNA for a combined 18S-28S rRNA phylogeny. We sampled environmental DNA from multiple sites and designed new Diphyllatea-specific PCR primers for long-read PacBio RSII technology. Near full-length 18S rRNA sequences from environmental DNA, in addition to supplementary Diphyllatea sequence data mined from public databases, resolved the phylogeny into three deeply branching and distinct clades (Diphy I - III). Of these, the Diphy III clade is entirely novel, and in congruence with Diphy II, composed of species morphologically consistent with the earlier described Collodictyon triciliatum. The phylogenetic split between the Diphy I and Diphy II + III clades corresponds with a morphological division of Diphyllatea into bi- and quadriflagellate cell forms. CONCLUSIONS This altered flagella composition must have occurred early in the diversification of Diphyllatea and may represent one of the earliest known morphological transitions among eukaryotes. Further, the substantial increase in molecular data presented here confirms Diphyllatea has a global distribution, seemingly restricted to freshwater habitats. Altogether, the results reveal the advantage of combining a group-specific PCR approach and long-read high-throughput amplicon sequencing in surveying enigmatic eukaryote lineages. Lastly, our study shows the capacity of PacBio RS when targeting a protist class for increasing phylogenetic resolution.
Collapse
Affiliation(s)
- Russell J. S. Orr
- Section for Genetics and Evolutionary Biology (EVOGENE), Department of Biosciences, University of Oslo, Kristine Bonnevies hus, Blindernveien 31, 0371 Oslo, Norway
- Centre for Integrative Microbial Evolution (CIME), Section for Genetics and Evolutionary Biology (EVOGENE), Department of Biosciences, University of Oslo, Kristine Bonnevies hus, Blindernveien 31, 0371 Oslo, Norway
| | - Sen Zhao
- Department of Molecular Oncology, Institute of Cancer Research, Oslo University Hospital-Radiumhospitalet, Oslo, Norway
- Medical Faculty, Center for Cancer Biomedicine, University of Oslo University Hospital, Oslo, Norway
| | - Dag Klaveness
- Section for Aquatic Biology and Toxicology (AQUA), Department of Biosciences, University of Oslo, Oslo, Norway
| | - Akinori Yabuki
- Japan Agency for Marine-Earth Science and Technology (JAMSTEC), 2-15 Natsushima, Yokosuka, Kanagawa 237-0061 Japan
| | - Keiji Ikeda
- Faculty of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 305-8572 Japan
| | - Watanabe M. Makoto
- Faculty of Life and Environmental Sciences, University of Tsukuba, 1-1-1 Tennodai, Tsukuba, Ibaraki, 305-8572 Japan
| | - Kamran Shalchian-Tabrizi
- Section for Genetics and Evolutionary Biology (EVOGENE), Department of Biosciences, University of Oslo, Kristine Bonnevies hus, Blindernveien 31, 0371 Oslo, Norway
- Centre for Integrative Microbial Evolution (CIME), Section for Genetics and Evolutionary Biology (EVOGENE), Department of Biosciences, University of Oslo, Kristine Bonnevies hus, Blindernveien 31, 0371 Oslo, Norway
| |
Collapse
|
48
|
Nofziger C, Paulmichl M. Accurately genotyping CYP2D6: not for the faint of heart. Pharmacogenomics 2018; 19:999-1002. [PMID: 30020016 DOI: 10.2217/pgs-2018-0105] [Citation(s) in RCA: 26] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Affiliation(s)
| | - Markus Paulmichl
- Center for Health & Bioresources, Austrian Institute of Technology, Vienna, Austria.,NESMOS Department, University of Rome Sapienza, Rome, Italy
| |
Collapse
|
49
|
Abstract
The HLA region is the most polymorphic genes in the human genome and is associated with an increasing number of disease states. Historically, HLA typing methodology has been governed by phenotypic determination. This practice has evolved into the use of molecular methods such as real-time PCR, sequence-specific oligonucleotides, and sequencing-based methods. Numerous studies have identified HLA matching as a key determinate to improve patient outcomes from transplantation. Solid-organ transplants focus on HLA-DRB1 in renal organ allocation while hematopoietic cell transplants focus on HLA-A, -B, -C, -DRB1 matching. The role of HLA typing in the future will be driven by HLA expression, understanding of HLA haplotypes, and rapid HLA typing.
Collapse
Affiliation(s)
- Claire H Edgerly
- Department of Pathology and Laboratory Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA
| | - Eric T Weimer
- Department of Pathology and Laboratory Medicine, School of Medicine, University of North Carolina at Chapel Hill, Chapel Hill, NC, USA.
| |
Collapse
|
50
|
Hamada M, Ono Y, Asai K, Frith MC. Training alignment parameters for arbitrary sequencers with LAST-TRAIN. Bioinformatics 2017; 33:926-928. [PMID: 28039163 PMCID: PMC5351549 DOI: 10.1093/bioinformatics/btw742] [Citation(s) in RCA: 44] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/09/2016] [Accepted: 11/18/2016] [Indexed: 01/05/2023] Open
Abstract
Summary LAST-TRAIN improves sequence alignment accuracy by inferring substitution and gap scores that fit the frequencies of substitutions, insertions, and deletions in a given dataset. We have applied it to mapping DNA reads from IonTorrent and PacBio RS, and we show that it reduces reference bias for Oxford Nanopore reads. Availability and Implementation the source code is freely available at http://last.cbrc.jp/. Contact mhamada@waseda.jp or mcfrith@edu.k.u-tokyo.ac.jp. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Michiaki Hamada
- Department of Electrical Engineering and Bioscience, Faculty of Science and Engineering, Waseda University, 55N-06-10, 3-4-1, Okubo Shinjuku-ku, Tokyo 169-8555, Japan.,Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan
| | | | - Kiyoshi Asai
- Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan.,Graduate School of Frontier Sciences, University of Tokyo, Chiba 277-8562, Japan
| | - Martin C Frith
- Computational Bio Big-Data Open Innovation Laboratory (CBBD-OIL), National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 169-8555, Japan.,Artificial Intelligence Research Center, National Institute of Advanced Industrial Science and Technology (AIST), Tokyo 135-0064, Japan.,Graduate School of Frontier Sciences, University of Tokyo, Chiba 277-8562, Japan
| |
Collapse
|