1
|
Khazeeva G, Sablauskas K, van der Sanden B, Steyaert W, Kwint M, Rots D, Hinne M, van Gerven M, Yntema H, Vissers L, Gilissen C. DeNovoCNN: a deep learning approach to de novo variant calling in next generation sequencing data. Nucleic Acids Res 2022; 50:e97. [PMID: 35713566 PMCID: PMC9508836 DOI: 10.1093/nar/gkac511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2021] [Revised: 04/19/2022] [Accepted: 05/30/2022] [Indexed: 11/14/2022] Open
Abstract
De novo mutations (DNMs) are an important cause of genetic disorders. The accurate identification of DNMs from sequencing data is therefore fundamental to rare disease research and diagnostics. Unfortunately, identifying reliable DNMs remains a major challenge due to sequence errors, uneven coverage, and mapping artifacts. Here, we developed a deep convolutional neural network (CNN) DNM caller (DeNovoCNN), that encodes the alignment of sequence reads for a trio as 160\documentclass[12pt]{minimal}
\usepackage{amsmath}
\usepackage{wasysym}
\usepackage{amsfonts}
\usepackage{amssymb}
\usepackage{amsbsy}
\usepackage{upgreek}
\usepackage{mathrsfs}
\setlength{\oddsidemargin}{-69pt}
\begin{document}
}{}$ \times$\end{document}164 resolution images. DeNovoCNN was trained on DNMs of 5616 whole exome sequencing (WES) trios achieving total 96.74% recall and 96.55% precision on the test dataset. We find that DeNovoCNN has increased recall/sensitivity and precision compared to existing DNM calling approaches (GATK, DeNovoGear, DeepTrio, Samtools) based on the Genome in a Bottle reference dataset and independent WES and WGS trios. Validations of DNMs based on Sanger and PacBio HiFi sequencing confirm that DeNovoCNN outperforms existing methods. Most importantly, our results suggest that DeNovoCNN is likely robust against different exome sequencing and analyses approaches, thereby allowing the application on other datasets. DeNovoCNN is freely available as a Docker container and can be run on existing alignment (BAM/CRAM) and variant calling (VCF) files from WES and WGS without a need for variant recalling.
Collapse
Affiliation(s)
- Gelana Khazeeva
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Karolis Sablauskas
- Institute of Clinical Medicine, Faculty of Medicine, Vilnius University, Vilnius, Lithuania
| | - Bart van der Sanden
- Department of Human Genetics, Donders Centre for Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Wouter Steyaert
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Michael Kwint
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Dmitrijs Rots
- Department of Human Genetics, Donders Centre for Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Max Hinne
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, The Netherlands
| | - Marcel van Gerven
- Donders Institute for Brain, Cognition and Behaviour, Radboud University Nijmegen, The Netherlands
| | - Helger Yntema
- Department of Human Genetics, Donders Centre for Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Lisenka Vissers
- Department of Human Genetics, Donders Centre for Neuroscience, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| | - Christian Gilissen
- Department of Human Genetics, Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Geert Grooteplein 10, 6525 GA Nijmegen, The Netherlands
| |
Collapse
|
2
|
Bhat GR, Sethi I, Rah B, Kumar R, Afroze D. Innovative in Silico Approaches for Characterization of Genes and Proteins. Front Genet 2022; 13:865182. [PMID: 35664302 PMCID: PMC9159363 DOI: 10.3389/fgene.2022.865182] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/29/2022] [Accepted: 04/11/2022] [Indexed: 11/13/2022] Open
Abstract
Bioinformatics is an amalgamation of biology, mathematics and computer science. It is a science which gathers the information from biology in terms of molecules and applies the informatic techniques to the gathered information for understanding and organizing the data in a useful manner. With the help of bioinformatics, the experimental data generated is stored in several databases available online like nucleotide database, protein databases, GENBANK and others. The data stored in these databases is used as reference for experimental evaluation and validation. Till now several online tools have been developed to analyze the genomic, transcriptomic, proteomics, epigenomics and metabolomics data. Some of them include Human Splicing Finder (HSF), Exonic Splicing Enhancer Mutation taster, and others. A number of SNPs are observed in the non-coding, intronic regions and play a role in the regulation of genes, which may or may not directly impose an effect on the protein expression. Many mutations are thought to influence the splicing mechanism by affecting the existing splice sites or creating a new sites. To predict the effect of mutation (SNP) on splicing mechanism/signal, HSF was developed. Thus, the tool is helpful in predicting the effect of mutations on splicing signals and can provide data even for better understanding of the intronic mutations that can be further validated experimentally. Additionally, rapid advancement in proteomics have steered researchers to organize the study of protein structure, function, relationships, and dynamics in space and time. Thus the effective integration of all of these technological interventions will eventually lead to steering up of next-generation systems biology, which will provide valuable biological insights in the field of research, diagnostic, therapeutic and development of personalized medicine.
Collapse
Affiliation(s)
- Gh. Rasool Bhat
- Advanced Centre for Human Genetics, Sher-I- Kashmir Institute of Medical Sciences, Soura, India
| | - Itty Sethi
- Institute of Human Genetics, University of Jammu, Jammu, India
| | - Bilal Rah
- Advanced Centre for Human Genetics, Sher-I- Kashmir Institute of Medical Sciences, Soura, India
| | - Rakesh Kumar
- School of Biotechnology, Shri Mata Vaishno Devi University, Katra, India
| | - Dil Afroze
- Advanced Centre for Human Genetics, Sher-I- Kashmir Institute of Medical Sciences, Soura, India
- *Correspondence: Dil Afroze,
| |
Collapse
|
3
|
Corominas J, Smeekens SP, Nelen MR, Yntema HG, Kamsteeg EJ, Pfundt R, Gilissen C. Clinical exome sequencing - mistakes and caveats. Hum Mutat 2022; 43:1041-1055. [PMID: 35191116 PMCID: PMC9541396 DOI: 10.1002/humu.24360] [Citation(s) in RCA: 16] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/10/2021] [Revised: 01/11/2022] [Accepted: 02/18/2022] [Indexed: 11/30/2022]
Abstract
Massive parallel sequencing technology has become the predominant technique for genetic diagnostics and research. Many genetic laboratories have wrestled with the challenges of setting up genetic testing workflows based on a completely new technology. The learning curve we went through as a laboratory was accompanied by growing pains while we gained new knowledge and expertise. Here we discuss some important mistakes that have been made in our laboratory through 10 years of clinical exome sequencing but that have given us important new insights on how to adapt our working methods. We provide these examples and the lessons that we learned to help other laboratories avoid to make the same mistakes.
Collapse
Affiliation(s)
- Jordi Corominas
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Sanne P Smeekens
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Marcel R Nelen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Helger G Yntema
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Erik-Jan Kamsteeg
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Rolph Pfundt
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.,Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, Nijmegen, The Netherlands
| | - Christian Gilissen
- Department of Human Genetics, Radboud University Medical Center, Nijmegen, The Netherlands.,Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Nijmegen, The Netherlands
| |
Collapse
|
4
|
Iyer GR, Kumar R, Poornima S, Kamireddy AP, Juturu KK, Bhatnagar L, Arora S, Suresh V, Utage PR, Bailur S, Pujar AN, Hasan Q. Utility of next-generation sequencing in genetic testing and counseling of disorders involving the musculoskeletal system—trends observed from a single genetic unit. J Orthop Surg Res 2022; 17:76. [PMID: 35123515 PMCID: PMC8818190 DOI: 10.1186/s13018-022-02969-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 04/16/2021] [Accepted: 01/25/2022] [Indexed: 11/10/2022] Open
Abstract
Abstract
Background
Disorders involving the musculoskeletal system are often identified with short stature and a range of orthopedic problems. The clinical and genetic heterogeneity of these diseases along with several characteristic overlaps makes definitive diagnosis difficult for clinicians. Hence, using molecular testing in addition to conventional tests becomes essential for appropriate diagnosis and management.
Methods
Comprehensive clinical examination, detailed pretest and posttest counseling, molecular diagnosis with next-generation sequencing (NGS), genotype–phenotype correlation and Sanger sequencing for targeted variant analysis.
Results
This manuscript reports a molecular spectrum of variants in 34 orthopedic cases referred to a single genetic unit attached to a tertiary care hospital. The diagnostic yield of NGS-based tests coupled with genetic counseling and segregation analysis was 79% which included 7 novel variants. In about 53% (i.e. 18/34 cases), molecular testing outcome was actionable since 8 of the 18 underwent prenatal diagnosis, as they were either in their early gestation or had planned a pregnancy subsequent to molecular testing, while ten cases were premaritally/prenatally counseled for the families to take informed decisions as they were in the reproductive age.
Conclusions
The report highlights the importance of NGS-based tests even in a low resource setting as it helps patients, families and healthcare providers in reducing the economic, social and emotional burden of these disorders.
Collapse
|
5
|
Musacchia F, Karali M, Torella A, Laurie S, Policastro V, Pizzo M, Beltran S, Casari G, Nigro V, Banfi S. VarGenius-HZD Allows Accurate Detection of Rare Homozygous or Hemizygous Deletions in Targeted Sequencing Leveraging Breadth of Coverage. Genes (Basel) 2021; 12:genes12121979. [PMID: 34946927 PMCID: PMC8701221 DOI: 10.3390/genes12121979] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2021] [Revised: 12/06/2021] [Accepted: 12/08/2021] [Indexed: 11/17/2022] Open
Abstract
Homozygous deletions (HDs) may be the cause of rare diseases and cancer, and their discovery in targeted sequencing is a challenging task. Different tools have been developed to disentangle HD discovery but a sensitive caller is still lacking. We present VarGenius-HZD, a sensitive and scalable algorithm that leverages breadth-of-coverage for the detection of rare homozygous and hemizygous single-exon deletions (HDs). To assess its effectiveness, we detected both real and synthetic rare HDs in fifty exomes from the 1000 Genomes Project obtaining higher sensitivity in comparison with state-of-the-art algorithms that each missed at least one event. We then applied our tool on targeted sequencing data from patients with Inherited Retinal Dystrophies and solved five cases that still lacked a genetic diagnosis. We provide VarGenius-HZD either stand-alone or integrated within our recently developed software, enabling the automated selection of samples using the internal database. Hence, it could be extremely useful for both diagnostic and research purposes.
Collapse
Affiliation(s)
- Francesco Musacchia
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
- Center for Human Technologies, Istituto Italiano di Tecnologia, 16163 Genova, Italy
- Correspondence:
| | - Marianthi Karali
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
- Eye Clinic, Multidisciplinary Department of Medical, Surgical and Dental Sciences, Università degli Studi della Campania ‘Luigi Vanvitelli’, 80138 Naples, Italy
| | - Annalaura Torella
- Medical Genetics, Department of Precision Medicine, Università degli Studi della Campania ‘Luigi Vanvitelli’, 80138 Naples, Italy;
| | - Steve Laurie
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; (S.L.); (S.B.)
| | - Valeria Policastro
- Institute for Applied Mathematics “Mauro Picone” (IAC), National Research Council, 80131 Naples, Italy;
- Department of Environmental, Biological and Pharmaceutical Sciences and Technologies, Università degli Studi della Campania ‘Luigi Vanvitelli’, 81100 Caserta, Italy
| | - Mariateresa Pizzo
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
| | - Sergi Beltran
- CNAG-CRG, Centre for Genomic Regulation (CRG), The Barcelona Institute of Science and Technology, 08003 Barcelona, Spain; (S.L.); (S.B.)
- Universitat Pompeu Fabra (UPF), 08017 Barcelona, Spain
- Department of Genetics, Microbiology and Statistics, Universitat de Barcelona (UB), 08028 Barcelona, Spain
| | - Giorgio Casari
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
- Neurogenomics Unit, Center for Genomics, Bioinformatics and Biostatistics, San Raffaele Scientific Institute, 20132 Milan, Italy
| | - Vincenzo Nigro
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
- Medical Genetics, Department of Precision Medicine, Università degli Studi della Campania ‘Luigi Vanvitelli’, 80138 Naples, Italy;
| | - Sandro Banfi
- Telethon Institute of Genetics and Medicine, 80078 Pozzuoli, Italy; (M.K.); (M.P.); (G.C.); (V.N.); (S.B.)
- Medical Genetics, Department of Precision Medicine, Università degli Studi della Campania ‘Luigi Vanvitelli’, 80138 Naples, Italy;
| |
Collapse
|
6
|
Missense3D-DB web catalogue: an atom-based analysis and repository of 4M human protein-coding genetic variants. Hum Genet 2021; 140:805-812. [PMID: 33502607 PMCID: PMC8052235 DOI: 10.1007/s00439-020-02246-z] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 12/07/2020] [Indexed: 12/22/2022]
Abstract
The interpretation of human genetic variation is one of the greatest challenges of modern genetics. New approaches are urgently needed to prioritize variants, especially those that are rare or lack a definitive clinical interpretation. We examined 10,136,597 human missense genetic variants from GnomAD, ClinVar and UniProt. We were able to perform large-scale atom-based mapping and phenotype interpretation of 3,960,015 of these variants onto 18,874 experimental and 84,818 in house predicted three-dimensional coordinates of the human proteome. We demonstrate that 14% of amino acid substitutions from the GnomAD database that could be structurally analysed are predicted to affect protein structure (n = 568,548, of which 566,439 rare or extremely rare) and may, therefore, have a yet unknown disease-causing effect. The same is true for 19.0% (n = 6266) of variants of unknown clinical significance or conflicting interpretation reported in the ClinVar database. The results of the structural analysis are available in the dedicated web catalogue Missense3D-DB ( http://missense3d.bc.ic.ac.uk/ ). For each of the 4 M variants, the results of the structural analysis are presented in a friendly concise format that can be included in clinical genetic reports. A detailed report of the structural analysis is also available for the non-experts in structural biology. Population frequency and predictions from SIFT and PolyPhen are included for a more comprehensive variant interpretation. This is the first large-scale atom-based structural interpretation of human genetic variation and offers geneticists and the biomedical community a new approach to genetic variant interpretation.
Collapse
|
7
|
Yang J, Liu A, He I, Bai Y. Bioinformatics Analysis Revealed Novel 3'UTR Variants Associated with Intellectual Disability. Genes (Basel) 2020; 11:genes11090998. [PMID: 32858868 PMCID: PMC7563394 DOI: 10.3390/genes11090998] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2020] [Revised: 08/15/2020] [Accepted: 08/24/2020] [Indexed: 01/03/2023] Open
Abstract
MicroRNAs (or miRNAs) are short nucleotide sequences (~17–22 bp long) that play important roles in gene regulation through targeting genes in the 3′untranslated regions (UTRs). Variants located in genomic regions might have different biological consequences in changing gene expression. Exonic variants (e.g., coding variant and 3′UTR variant) are often causative of diseases due to their influence on gene product. Variants harbored in the 3′UTR region where miRNAs perform their targeting function could potentially alter the binding relationships for target pairs, which could relate to disease causation. We gathered miRNA–mRNA targeting pairs from published studies and then employed the database of microRNA Target Site single nucleotide variants (SNVs) (dbMTS) to discover novel SNVs within the selected pairs. We identified a total of 183 SNVs for the 114 pairs of accurate miRNA–mRNA targeting pairs selected. Detailed bioinformatics analysis of the three genes with identified variants that were exclusively located in the 3′UTR section indicated their association with intellectual disability (ID). Our result showed an exceptionally high expression of GPR88 in brain tissues based on GTEx gene expression data, while WNT7A expression data were relatively high in brain tissues when compared to other tissues. Motif analysis for the 3′UTR region of WNT7A showed that five identified variants were well-conserved across three species (human, mouse, and rat); the motif that contains the variant identified in GPR88 is significant at the level of the 3′UTR of the human genome. Studies of pathways, protein–protein interactions, and relations to diseases further suggest potential association with intellectual disability of our discovered SNVs. Our results demonstrated that 3′UTR variants could change target interactions of miRNA–mRNA pairs in the context of their association with ID. We plan to automate the methods through developing a bioinformatics pipeline for identifying novel 3′UTR SNVs harbored by miRNA-targeted genes in the future.
Collapse
Affiliation(s)
- Junmeng Yang
- Shanghai Starriver Bilingual School, Shanghai 201100, China;
| | - Anna Liu
- Appleby College, Oakville, ON L6L3V7, Canada;
| | - Isabella He
- Pittsford Mendon High School, 472 Mendon Road, Pittsford, NY 14534, USA;
| | - Yongsheng Bai
- Department of Biology, Eastern Michigan University, 441 Mark Jefferson Hall, Ypsilanti, MI 48197, USA
- Next-Gen Intelligent Science Training, Ann Arbor, MI 48105, USA
- Correspondence:
| |
Collapse
|
8
|
Abílio AP, Silva M, Kampango A, Narciso I, Gudo ES, das Neves LCB, Sidat M, Fafetine JM, de Almeida APG, Parreira R. A survey of RNA viruses in mosquitoes from Mozambique reveals novel genetic lineages of flaviviruses and phenuiviruses, as well as frequent flavivirus-like viral DNA forms in Mansonia. BMC Microbiol 2020; 20:225. [PMID: 32723369 PMCID: PMC7385898 DOI: 10.1186/s12866-020-01905-5] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2020] [Accepted: 07/14/2020] [Indexed: 11/12/2022] Open
Abstract
BACKGROUND Mosquito-borne diseases involving arboviruses represent expanding threats to sub-Saharan Africa imposing as considerable burden to human and veterinary public health. In Mozambique over one hundred species of potential arbovirus mosquito vectors have been identified, although their precise role in maintaining such viruses in circulation in the country remains to be elucidated. The aim of this study was to screen for the presence of flaviviruses, alphaviruses and bunyaviruses in mosquitoes from different regions of Mozambique. RESULTS Our survey analyzed 14,519 mosquitoes, and the results obtained revealed genetically distinct insect-specific flaviviruses, detected in multiple species of mosquitoes from different genera. In addition, smaller flavivirus-like NS5 sequences, frequently detected in Mansonia seemed to correspond to defective viral sequences, present as viral DNA forms. Furthermore, three lineages of putative members of the Phenuiviridae family were also detected, two of which apparently corresponding to novel viral genetic lineages. CONCLUSION This study reports for the first-time novel insect-specific flaviviruses and novel phenuiviruses, as well as frequent flavivirus-like viral DNA forms in several widely known vector species. This unique work represents recent investigation of virus screening conducted in mosquitoes from Mozambique and an important contribution to inform the establishment of a vector control program for arbovirus in the country and in the region.
Collapse
Affiliation(s)
- Ana Paula Abílio
- Instituto Nacional de Saúde (INS)-Ministry of Health (MISAU), Vila de Marracuene, Estrada Nacional N°1, Parcela N°3943, P.O. Box: 264, Maputo, Mozambique.
- Faculty of Medicine, Eduardo Mondlane University (UEM), Maputo, Mozambique.
| | - Manuel Silva
- Unidade de Microbiologia Médica, Instituto de Higiene e Medicina Tropical (IHMT)/Universidade Nova de Lisboa (NOVA), and Global Health and Tropical Medicine (GHTM) Research Centre, Lisbon, Portugal
| | - Ayubo Kampango
- Instituto Nacional de Saúde (INS)-Ministry of Health (MISAU), Vila de Marracuene, Estrada Nacional N°1, Parcela N°3943, P.O. Box: 264, Maputo, Mozambique
| | - Inácio Narciso
- Unidade de Parasitologia Médica, Instituto de Higiene e Medicina Tropical (IHMT)/Universidade Nova de Lisboa (NOVA), and Global Health and Tropical Medicine (GHTM) Research Centre, Lisbon, Portugal
| | - Eduardo Samo Gudo
- Instituto Nacional de Saúde (INS)-Ministry of Health (MISAU), Vila de Marracuene, Estrada Nacional N°1, Parcela N°3943, P.O. Box: 264, Maputo, Mozambique
| | | | - Mohsin Sidat
- Faculty of Medicine, Eduardo Mondlane University (UEM), Maputo, Mozambique
| | | | - António Paulo Gouveia de Almeida
- Unidade de Parasitologia Médica, Instituto de Higiene e Medicina Tropical (IHMT)/Universidade Nova de Lisboa (NOVA), and Global Health and Tropical Medicine (GHTM) Research Centre, Lisbon, Portugal
| | - Ricardo Parreira
- Unidade de Microbiologia Médica, Instituto de Higiene e Medicina Tropical (IHMT)/Universidade Nova de Lisboa (NOVA), and Global Health and Tropical Medicine (GHTM) Research Centre, Lisbon, Portugal
| |
Collapse
|
9
|
Alaimo JT, Glinton KE, Liu N, Xiao J, Yang Y, Reid Sutton V, Elsea SH. Integrated analysis of metabolomic profiling and exome data supplements sequence variant interpretation, classification, and diagnosis. Genet Med 2020; 22:1560-1566. [PMID: 32439973 PMCID: PMC7483344 DOI: 10.1038/s41436-020-0827-0] [Citation(s) in RCA: 27] [Impact Index Per Article: 6.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/09/2020] [Revised: 04/26/2020] [Accepted: 04/27/2020] [Indexed: 12/17/2022] Open
Abstract
PURPOSE A primary barrier to improving exome sequencing diagnostic rates is the interpretation of variants of uncertain clinical significance. We aimed to determine the contribution of integrated untargeted metabolomics in the analysis of exome sequencing data by retrospective analysis of patients evaluated by both exome sequencing and untargeted metabolomics within the same clinical laboratory. METHODS Exome sequencing and untargeted metabolomic data were collected and analyzed for 170 patients. Pathogenic variants, likely pathogenic variants, and variants of uncertain significance in genes associated with a biochemical phenotype were extracted. Metabolomic data were evaluated to determine if these variants resulted in biochemical abnormalities that could be used to support their interpretation using current American College of Genetics and Genomics (ACMG) guidelines. RESULTS Metabolomic data contributed to the interpretation of variants in 74 individuals (43.5%) over 73 different genes. The data allowed for the reclassification of 9 variants as likely benign, 15 variants as likely pathogenic, and 3 variants as pathogenic. Metabolomic data confirmed a clinical diagnosis in 21 cases, for a diagnostic rate of 12.3% in this population. CONCLUSION Untargeted metabolomics can serve as a useful adjunct to exome sequencing by providing valuable functional data that may not otherwise be clinically available, resulting in improved variant classification.
Collapse
Affiliation(s)
- Joseph T Alaimo
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Baylor Genetics, Houston, TX, USA.,Department of Pediatrics, University of Missouri-Kansas City School of Medicine, Kansas City, MO, USA
| | - Kevin E Glinton
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - Ning Liu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Baylor Genetics, Houston, TX, USA
| | - Jing Xiao
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Baylor Genetics, Houston, TX, USA
| | - Yaping Yang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA
| | - V Reid Sutton
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.,Baylor Genetics, Houston, TX, USA
| | - Sarah H Elsea
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA. .,Baylor Genetics, Houston, TX, USA.
| |
Collapse
|
10
|
Rasool N, Akhtar A, Hussain W. Insights into the inhibitory potential of selective phytochemicals against Mpro of 2019-nCoV: a computer-aided study. Struct Chem 2020; 31:1777-1783. [PMID: 32362735 PMCID: PMC7193139 DOI: 10.1007/s11224-020-01536-6] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/17/2020] [Accepted: 04/13/2020] [Indexed: 12/14/2022]
Abstract
At the end of December 2019, a novel strain of coronavirus, given the name of 2019-nCoV, emerged for exhibiting symptoms of severe acute respiratory syndrome. The virus is spreading rapidly in China and around the globe, affecting thousands of people leading to a pandemic. To control the mortality rate associated with the 2019-nCoV, prompt steps are needed. Until now there is no effective treatment or drug present to control its life-threatening effects in the humans. The scientist is struggling to find new inhibitors of this deadly virus. In this study, to identify the effective inhibitor candidates against the main protease (Mpro) of 2019-nCoV, computational approaches were adopted. Phytochemicals having immense medicinal properties as ligands were docked against the Mpro of 2019-nCoV to study their binding properties. ADMET and DFT analyses were also further carried out to analyze the potential of these phytochemicals as an effective inhibitor against Mpro of 2019-nCoV.
Collapse
Affiliation(s)
- Nouman Rasool
- 1Dr Panjwani Center for Molecular Medicine and Drug Research, International Center for Chemical and Biological Sciences, University of Karachi, Karachi, 75270 Pakistan.,Center for Professional Studies, Lahore, Pakistan
| | - Ammara Akhtar
- 2Department of Life Sciences, University of Management and Technology, Lahore, Pakistan
| | - Waqar Hussain
- 3National Center of Artificial Intelligence, Punjab University College of Information Technology, University of the Punjab, Lahore, Pakistan.,Center for Professional Studies, Lahore, Pakistan
| |
Collapse
|
11
|
Pereira R, Oliveira J, Sousa M. Bioinformatics and Computational Tools for Next-Generation Sequencing Analysis in Clinical Genetics. J Clin Med 2020; 9:E132. [PMID: 31947757 PMCID: PMC7019349 DOI: 10.3390/jcm9010132] [Citation(s) in RCA: 77] [Impact Index Per Article: 19.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2019] [Revised: 12/15/2019] [Accepted: 12/30/2019] [Indexed: 12/13/2022] Open
Abstract
Clinical genetics has an important role in the healthcare system to provide a definitive diagnosis for many rare syndromes. It also can have an influence over genetics prevention, disease prognosis and assisting the selection of the best options of care/treatment for patients. Next-generation sequencing (NGS) has transformed clinical genetics making possible to analyze hundreds of genes at an unprecedented speed and at a lower price when comparing to conventional Sanger sequencing. Despite the growing literature concerning NGS in a clinical setting, this review aims to fill the gap that exists among (bio)informaticians, molecular geneticists and clinicians, by presenting a general overview of the NGS technology and workflow. First, we will review the current NGS platforms, focusing on the two main platforms Illumina and Ion Torrent, and discussing the major strong points and weaknesses intrinsic to each platform. Next, the NGS analytical bioinformatic pipelines are dissected, giving some emphasis to the algorithms commonly used to generate process data and to analyze sequence variants. Finally, the main challenges around NGS bioinformatics are placed in perspective for future developments. Even with the huge achievements made in NGS technology and bioinformatics, further improvements in bioinformatic algorithms are still required to deal with complex and genetically heterogeneous disorders.
Collapse
Affiliation(s)
- Rute Pereira
- Laboratory of Cell Biology, Department of Microscopy, Institute of Biomedical Sciences Abel Salazar (ICBAS), University of Porto (UP), 4050-313 Porto, Portugal;
- Biology and Genetics of Reproduction Unit, Multidisciplinary Unit for Biomedical Research (UMIB), ICBAS-UP, 4050-313 Porto, Portugal;
| | - Jorge Oliveira
- Biology and Genetics of Reproduction Unit, Multidisciplinary Unit for Biomedical Research (UMIB), ICBAS-UP, 4050-313 Porto, Portugal;
- UnIGENe and CGPP–Centre for Predictive and Preventive Genetics-Institute for Molecular and Cell Biology (IBMC), i3S-Institute for Research and Innovation in Health-UP, 4200-135 Porto, Portugal
| | - Mário Sousa
- Laboratory of Cell Biology, Department of Microscopy, Institute of Biomedical Sciences Abel Salazar (ICBAS), University of Porto (UP), 4050-313 Porto, Portugal;
- Biology and Genetics of Reproduction Unit, Multidisciplinary Unit for Biomedical Research (UMIB), ICBAS-UP, 4050-313 Porto, Portugal;
| |
Collapse
|
12
|
Vidal S, Xiol C, Pascual-Alonso A, O'Callaghan M, Pineda M, Armstrong J. Genetic Landscape of Rett Syndrome Spectrum: Improvements and Challenges. Int J Mol Sci 2019; 20:ijms20163925. [PMID: 31409060 PMCID: PMC6719047 DOI: 10.3390/ijms20163925] [Citation(s) in RCA: 22] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2019] [Revised: 08/08/2019] [Accepted: 08/10/2019] [Indexed: 02/06/2023] Open
Abstract
Rett syndrome (RTT) is an early-onset neurodevelopmental disorder that primarily affects females, resulting in severe cognitive and physical disabilities, and is one of the most prevalent causes of intellectual disability in females. More than fifty years after the first publication on Rett syndrome, and almost two decades since the first report linking RTT to the MECP2 gene, the research community's effort is focused on obtaining a better understanding of the genetics and the complex biology of RTT and Rett-like phenotypes without MECP2 mutations. Herein, we review the current molecular genetic studies, which investigate the genetic causes of RTT or Rett-like phenotypes which overlap with other genetic disorders and document the swift evolution of the techniques and methodologies employed. This review also underlines the clinical and genetic heterogeneity of the Rett syndrome spectrum and provides an overview of the RTT-related genes described to date, many of which are involved in epigenetic gene regulation, neurotransmitter action or RNA transcription/translation. Finally, it discusses the importance of including both phenotypic and genetic diagnosis to provide proper genetic counselling from a patient's perspective and the appropriate treatment.
Collapse
Affiliation(s)
- Silvia Vidal
- Sant Joan de Déu Research Foundation, 08950 Barcelona, Spain
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, 08950 Barcelona, Spain
| | - Clara Xiol
- Sant Joan de Déu Research Foundation, 08950 Barcelona, Spain
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, 08950 Barcelona, Spain
| | - Ainhoa Pascual-Alonso
- Sant Joan de Déu Research Foundation, 08950 Barcelona, Spain
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, 08950 Barcelona, Spain
| | - M O'Callaghan
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, 08950 Barcelona, Spain
- Neurology Service, Hospital Sant Joan de Déu, 08950 Barcelona, Spain
- CIBER-ER (Biomedical Network Research Center for Rare Diseases), Institute of Health Carlos III (ISCIII), 28029 Madrid, Spain
| | - Mercè Pineda
- Sant Joan de Déu Research Foundation, 08950 Barcelona, Spain
| | - Judith Armstrong
- Institut de Recerca Pediàtrica Hospital Sant Joan de Déu, 08950 Barcelona, Spain.
- CIBER-ER (Biomedical Network Research Center for Rare Diseases), Institute of Health Carlos III (ISCIII), 28029 Madrid, Spain.
- Molecular and Genetics Medicine Section, Hospital Sant Joan de Déu, 08950 Barcelona, Spain.
| |
Collapse
|
13
|
Kumaran M, Subramanian U, Devarajan B. Performance assessment of variant calling pipelines using human whole exome sequencing and simulated data. BMC Bioinformatics 2019; 20:342. [PMID: 31208315 PMCID: PMC6580603 DOI: 10.1186/s12859-019-2928-9] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2018] [Accepted: 05/31/2019] [Indexed: 12/30/2022] Open
Abstract
Background Whole exome sequencing (WES) is a cost-effective method that identifies clinical variants but it demands accurate variant caller tools. Currently available tools have variable accuracy in predicting specific clinical variants. But it may be possible to find the best combination of aligner-variant caller tools for detecting accurate single nucleotide variants (SNVs) and small insertion and deletion (InDels) separately. Moreover, many important aspects of InDel detection are overlooked while comparing the performance of tools, particularly its base pair length. Results We assessed the performance of variant calling pipelines using the combinations of four variant callers and five aligners on human NA12878 and simulated exome data. We used high confidence variant calls from Genome in a Bottle (GiaB) consortium for validation, and GRCh37 and GRCh38 as the human reference genome. Based on the performance metrics, both BWA and Novoalign aligners performed better with DeepVariant and SAMtools callers for detecting SNVs, and with DeepVariant and GATK for InDels. Furthermore, we obtained similar results on human NA24385 and NA24631 exome data from GiaB. Conclusion In this study, DeepVariant with BWA and Novoalign performed best for detecting accurate SNVs and InDels. The accuracy of variant calling was improved by merging the top performing pipelines. The results of our study provide useful recommendations for analysis of WES data in clinical genomics. Electronic supplementary material The online version of this article (10.1186/s12859-019-2928-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Manojkumar Kumaran
- Department of Bioinformatics, Aravind Medical Research Foundation, Madurai, Tamil Nadu, 625020, India.,School of Chemical and Biotechnology, SASTRA (Deemed to be University), Thanjavur, Tamil Nadu, 613401, India
| | - Umadevi Subramanian
- Department of Bioinformatics, Aravind Medical Research Foundation, Madurai, Tamil Nadu, 625020, India
| | - Bharanidharan Devarajan
- Department of Bioinformatics, Aravind Medical Research Foundation, Madurai, Tamil Nadu, 625020, India.
| |
Collapse
|
14
|
Chen HL, Li HY, Wu JF, Wu SH, Chen HL, Yang YH, Hsu YH, Liou BY, Chang MH, Ni YH. Panel-Based Next-Generation Sequencing for the Diagnosis of Cholestatic Genetic Liver Diseases: Clinical Utility and Challenges. J Pediatr 2019; 205:153-159.e6. [PMID: 30366773 DOI: 10.1016/j.jpeds.2018.09.028] [Citation(s) in RCA: 30] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 06/22/2018] [Revised: 08/08/2018] [Accepted: 09/11/2018] [Indexed: 12/12/2022]
Abstract
OBJECTIVE To test the application of a target enrichment next-generation sequencing (NGS) jaundice panel in genetic diagnosis of pediatric liver diseases. STUDY DESIGN We developed a capture-based target enrichment NGS jaundice panel containing 42 known disease-causing genes associated with jaundice or cholestasis and 10 pathway-related genes. During 2015-2017, 102 pediatric patients with various forms of cholestasis or idiopathic liver diseases were tested, including patients with initial diagnosis of cholestasis in infancy, progressive familial intrahepatic cholestasis, syndromic cholestasis, Wilson disease, and others. RESULTS Of the 102 patients, 137 mutations/variants in 44 different genes were identified in 84 patients. The genetic disease diagnosis rate was 33 of 102 (32.4%). A total of 79 of 102 (77.5%) of patients had at least 1 heterozygous genetic variation. Those with progressive intrahepatic cholestasis or syndromic cholestasis in infancy had a diagnostic rate of 62.5%. Disease-causing mutations, including ATP8B1, ABCB11, ABCB4, ABCC2, TJP2, NR1H4 (FXR), JAG1, AKR1D1, CYP7B1, PKHD1, ATP7B, and SLC25A13, were identified. Nine patients had unpredicted genetic diagnosis with atypical phenotype or novel mutations in the investigational genes. We propose an NGS diagnosis classification categorizing patients into high (n = 24), moderate (n = 9), or weak (n = 25) levels of genotype-phenotype correlations to facilitate patient management. CONCLUSIONS This panel enabled high-throughput detection of genetic variants and disease diagnosis in patients with a long list of candidate causative genes. A NGS report with diagnosis classification may aid clinicians in data interpretation and patient management.
Collapse
Affiliation(s)
- Huey-Ling Chen
- Department of Pediatrics, National Taiwan University College of Medicine and Children's Hospital, Taipei, Taiwan
| | - Huei-Ying Li
- Medical Microbiome Center, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Jia-Feng Wu
- Department of Pediatrics, National Taiwan University College of Medicine and Children's Hospital, Taipei, Taiwan
| | - Shang-Hsin Wu
- Graduate Institution of Clinical Medicine, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Hui-Ling Chen
- Hepatitis Research Center, National Taiwan University Hospital, Taipei, Taiwan
| | - Yu-Hsuan Yang
- Medical Microbiome Center, National Taiwan University College of Medicine, Taipei, Taiwan
| | - Yu-Hua Hsu
- Department of Medical Genetics, National Taiwan University Hospital, Taipei, Taiwan
| | - Bang-Yu Liou
- Department of Pediatrics, National Taiwan University College of Medicine and Children's Hospital, Taipei, Taiwan
| | - Mei-Hwei Chang
- Department of Pediatrics, National Taiwan University College of Medicine and Children's Hospital, Taipei, Taiwan
| | - Yen-Hsuan Ni
- Department of Pediatrics, National Taiwan University College of Medicine and Children's Hospital, Taipei, Taiwan; Medical Microbiome Center, National Taiwan University College of Medicine, Taipei, Taiwan; Hepatitis Research Center, National Taiwan University Hospital, Taipei, Taiwan.
| |
Collapse
|
15
|
Townend GS, Ehrhart F, van Kranen HJ, Wilkinson M, Jacobsen A, Roos M, Willighagen EL, van Enckevort D, Evelo CT, Curfs LMG. MECP2 variation in Rett syndrome-An overview of current coverage of genetic and phenotype data within existing databases. Hum Mutat 2018; 39:914-924. [PMID: 29704307 PMCID: PMC6033003 DOI: 10.1002/humu.23542] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/22/2017] [Revised: 04/18/2018] [Accepted: 04/23/2018] [Indexed: 12/30/2022]
Abstract
Rett syndrome (RTT) is a monogenic rare disorder that causes severe neurological problems. In most cases, it results from a loss-of-function mutation in the gene encoding methyl-CPG-binding protein 2 (MECP2). Currently, about 900 unique MECP2 variations (benign and pathogenic) have been identified and it is suspected that the different mutations contribute to different levels of disease severity. For researchers and clinicians, it is important that genotype-phenotype information is available to identify disease-causing mutations for diagnosis, to aid in clinical management of the disorder, and to provide counseling for parents. In this study, 13 genotype-phenotype databases were surveyed for their general functionality and availability of RTT-specific MECP2 variation data. For each database, we investigated findability and interoperability alongside practical user functionality, and type and amount of genetic and phenotype data. The main conclusions are that, as well as being challenging to find these databases and specific MECP2 variants held within, interoperability is as yet poorly developed and requires effort to search across databases. Nevertheless, we found several thousand online database entries for MECP2 variations and their associated phenotypes, diagnosis, or predicted variant effects, which is a good starting point for researchers and clinicians who want to provide, annotate, and use the data.
Collapse
Affiliation(s)
- Gillian S Townend
- Rett Expertise Centre Netherlands - GKC, Maastricht University Medical Center, Maastricht, The Netherlands
| | - Friederike Ehrhart
- Rett Expertise Centre Netherlands - GKC, Maastricht University Medical Center, Maastricht, The Netherlands.,Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Henk J van Kranen
- Rett Expertise Centre Netherlands - GKC, Maastricht University Medical Center, Maastricht, The Netherlands.,Institute for Public Health Genomics, Maastricht University, Maastricht, The Netherlands
| | - Mark Wilkinson
- Center for Plant Biotechnology and Genomics, Universidad Politécnica de Madrid, Madrid, Spain
| | - Annika Jacobsen
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Marco Roos
- Department of Human Genetics, Leiden University Medical Center, Leiden, The Netherlands
| | - Egon L Willighagen
- Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - David van Enckevort
- Department of Genetics, University Medical Center Groningen, University of Groningen, Groningen, The Netherlands
| | - Chris T Evelo
- Rett Expertise Centre Netherlands - GKC, Maastricht University Medical Center, Maastricht, The Netherlands.,Department of Bioinformatics - BiGCaT, NUTRIM, Maastricht University, Maastricht, The Netherlands
| | - Leopold M G Curfs
- Rett Expertise Centre Netherlands - GKC, Maastricht University Medical Center, Maastricht, The Netherlands
| |
Collapse
|
16
|
Retshabile G, Mlotshwa BC, Williams L, Mwesigwa S, Mboowa G, Huang Z, Rustagi N, Swaminathan S, Katagirya E, Kyobe S, Wayengera M, Kisitu GP, Kateete DP, Wampande EM, Maplanka K, Kasvosve I, Pettitt ED, Matshaba M, Nsangi B, Marape M, Tsimako-Johnstone M, Brown CW, Yu F, Kekitiinwa A, Joloba M, Mpoloka SW, Mardon G, Anabwani G, Hanchard NA. Whole-Exome Sequencing Reveals Uncaptured Variation and Distinct Ancestry in the Southern African Population of Botswana. Am J Hum Genet 2018; 102:731-743. [PMID: 29706352 DOI: 10.1016/j.ajhg.2018.03.010] [Citation(s) in RCA: 32] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2017] [Accepted: 02/26/2018] [Indexed: 01/08/2023] Open
Abstract
Large-scale, population-based genomic studies have provided a context for modern medical genetics. Among such studies, however, African populations have remained relatively underrepresented. The breadth of genetic diversity across the African continent argues for an exploration of local genomic context to facilitate burgeoning disease mapping studies in Africa. We sought to characterize genetic variation and to assess population substructure within a cohort of HIV-positive children from Botswana-a Southern African country that is regionally underrepresented in genomic databases. Using whole-exome sequencing data from 164 Batswana and comparisons with 150 similarly sequenced HIV-positive Ugandan children, we found that 13%-25% of variation observed among Batswana was not captured by public databases. Uncaptured variants were significantly enriched (p = 2.2 × 10-16) for coding variants with minor allele frequencies between 1% and 5% and included predicted-damaging non-synonymous variants. Among variants found in public databases, corresponding allele frequencies varied widely, with Botswana having significantly higher allele frequencies among rare (<1%) pathogenic and damaging variants. Batswana clustered with other Southern African populations, but distinctly from 1000 Genomes African populations, and had limited evidence for admixture with extra-continental ancestries. We also observed a surprising lack of genetic substructure in Botswana, despite multiple tribal ethnicities and language groups, alongside a higher degree of relatedness than purported founder populations from the 1000 Genomes project. Our observations reveal a complex, but distinct, ancestral history and genomic architecture among Batswana and suggest that disease mapping within similar Southern African populations will require a deeper repository of genetic variation and allelic dependencies than presently exists.
Collapse
Affiliation(s)
- Gaone Retshabile
- Department of Biological Sciences, University of Botswana, Gaborone, Botswana
| | - Busisiwe C Mlotshwa
- Department of Biological Sciences, University of Botswana, Gaborone, Botswana
| | - Lesedi Williams
- Department of Biological Sciences, University of Botswana, Gaborone, Botswana
| | - Savannah Mwesigwa
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Gerald Mboowa
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda; Department of Immunology and Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Zhuoyi Huang
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Navin Rustagi
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Shanker Swaminathan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; USDA/ARS/Children's Nutrition Research Center, Baylor College of Medicine, Houston, TX 77030, USA
| | - Eric Katagirya
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Samuel Kyobe
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Misaki Wayengera
- Department of Immunology and Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Grace P Kisitu
- Baylor College of Medicine Children's Foundation, Kampala, Uganda
| | - David P Kateete
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda; Department of Immunology and Molecular Biology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Eddie M Wampande
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda; Department of Bio-molecular Resources, College of Veterinary Medicine, Makerere University, Kampala, Uganda
| | - Koketso Maplanka
- Department of Biological Sciences, University of Botswana, Gaborone, Botswana
| | - Ishmael Kasvosve
- Department of Medical Laboratory Sciences, University of Botswana, Gaborone, Botswana
| | - Edward D Pettitt
- Botswana-Baylor Children's Clinical Centre of Excellence, Gaborone, Botswana
| | - Mogomotsi Matshaba
- Botswana-Baylor Children's Clinical Centre of Excellence, Gaborone, Botswana; Pediatric Retrovirology, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Betty Nsangi
- Baylor College of Medicine Children's Foundation, Kampala, Uganda
| | - Marape Marape
- Botswana-Baylor Children's Clinical Centre of Excellence, Gaborone, Botswana
| | | | - Chester W Brown
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; University of Tennessee Health Science Center, Memphis, TN 38105, USA
| | - Fuli Yu
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX 77030, USA; Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Adeodata Kekitiinwa
- Baylor College of Medicine Children's Foundation, Kampala, Uganda; Pediatric Retrovirology, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Moses Joloba
- Department of Medical Microbiology, College of Health Sciences, Makerere University, Kampala, Uganda
| | - Sununguko W Mpoloka
- Department of Biological Sciences, University of Botswana, Gaborone, Botswana
| | - Graeme Mardon
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; Department of Pathology and Immunology, Baylor College of Medicine, Houston, TX 77030, USA
| | - Gabriel Anabwani
- Botswana-Baylor Children's Clinical Centre of Excellence, Gaborone, Botswana; Pediatric Retrovirology, Department of Pediatrics, Baylor College of Medicine, Houston, TX 77030, USA
| | - Neil A Hanchard
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX 77030, USA; USDA/ARS/Children's Nutrition Research Center, Baylor College of Medicine, Houston, TX 77030, USA.
| |
Collapse
|
17
|
Yauy K, Baux D, Pegeot H, Van Goethem C, Mathieu C, Guignard T, Juntas Morales R, Lacourt D, Krahn M, Lehtokari VL, Bonne G, Tuffery-Giraud S, Koenig M, Cossée M. MoBiDiC Prioritization Algorithm, a Free, Accessible, and Efficient Pipeline for Single-Nucleotide Variant Annotation and Prioritization for Next-Generation Sequencing Routine Molecular Diagnosis. J Mol Diagn 2018; 20:465-473. [PMID: 29689380 DOI: 10.1016/j.jmoldx.2018.03.009] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/10/2017] [Revised: 02/16/2018] [Accepted: 03/09/2018] [Indexed: 12/22/2022] Open
Abstract
Interpretation of next-generation sequencing constitutes the main limitation of molecular diagnostics. In diagnosing myopathies and muscular dystrophies, another issue is efficiency in predicting the pathogenicity of variants identified in large genes, especially TTN; current in silico prediction tools show limitations in predicting and ranking the numerous variants of such genes. We propose a variant-prioritization tool, the MoBiDiCprioritization algorithm (MPA). MPA is based on curated interpretation of data on previously reported variants, biological assumptions, and splice and missense predictors, and is used to prioritize all types of single-nucleotide variants. MPA was validated by comparing its sensitivity and specificity to those of dbNSFP database prediction tools, using a data set composed of DYSF, DMD, LMNA, NEB, and TTN variants extracted from expert-reviewed and ExAC databases. MPA obtained the best annotation rates for missense and splice variants. As MPA aggregates the results from several predictors, individual predictor errors are counterweighted, improving the sensitivity and specificity of missense and splice variant predictions. We propose a sequential use of MPA, beginning with the selection of variants with higher scores and followed by, in the absence of candidate pathologic variants, consideration of variants with lower scores. We provide scripts and documentation for free academic use and a validated annotation pipeline scaled for panel and exome sequencing to prioritize single-nucleotide variants from a VCF file.
Collapse
Affiliation(s)
- Kevin Yauy
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France.
| | - David Baux
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Henri Pegeot
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Charles Van Goethem
- Laboratoire de Biopathologie Cellulaire et Tissulaire des Tumeurs, Hôpital Arnaud de Villeneuve, CHU Montpellier, Montpellier, France
| | - Charly Mathieu
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Thomas Guignard
- Plateforme Recherche de Microremaniements Chromosomiques-Centre de Référence Anomalies du Développement et Syndromes Malformatifs, Département de Génétique Médicale, Maladies Rares et Médecine Personnalisée, CHU Montpellier, Montpellier, France; Faculté de Médecine Montpellier-Nîmes, Université de Montpellier, Montpellier, France
| | | | - Delphine Lacourt
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France
| | - Martin Krahn
- Unité de Génétique Médicale et Génomique Fonctionnelle INSERM UMRS910, Université d'Aix Marseille, Marseille, France; Département de Génétique Médicale, Hôpital Timone Enfants, Assistance Publique Hôpitaux de Marseille, Marseille, France
| | - Vilma-Lotta Lehtokari
- The Folkhalsan Institute of Genetics and the Department of Medical Genetics, Haartman Institute, University of Helsinki, Helsinki, Finland
| | - Gisele Bonne
- Unité INSERM U974-Thérapie des Maladies du Muscle Striée, Center of Research in Myology, Institut de Myologie, Université Pierre et Marie Curie, Sorbonne Universités, Paris, France
| | - Sylvie Tuffery-Giraud
- Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| | - Michel Koenig
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France; Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| | - Mireille Cossée
- Laboratoire de Génétique Moléculaire, CHU Montpellier, Montpellier, France; Laboratoire de Génétique des Maladies Rares EA7402, Université de Montpellier, Montpellier, France
| |
Collapse
|
18
|
van den Akker J, Mishne G, Zimmer AD, Zhou AY. A machine learning model to determine the accuracy of variant calls in capture-based next generation sequencing. BMC Genomics 2018; 19:263. [PMID: 29665779 PMCID: PMC5904977 DOI: 10.1186/s12864-018-4659-0] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2017] [Accepted: 04/10/2018] [Indexed: 12/30/2022] Open
Abstract
BACKGROUND Next generation sequencing (NGS) has become a common technology for clinical genetic tests. The quality of NGS calls varies widely and is influenced by features like reference sequence characteristics, read depth, and mapping accuracy. With recent advances in NGS technology and software tools, the majority of variants called using NGS alone are in fact accurate and reliable. However, a small subset of difficult-to-call variants that still do require orthogonal confirmation exist. For this reason, many clinical laboratories confirm NGS results using orthogonal technologies such as Sanger sequencing. Here, we report the development of a deterministic machine-learning-based model to differentiate between these two types of variant calls: those that do not require confirmation using an orthogonal technology (high confidence), and those that require additional quality testing (low confidence). This approach allows reliable NGS-based calling in a clinical setting by identifying the few important variant calls that require orthogonal confirmation. RESULTS We developed and tested the model using a set of 7179 variants identified by a targeted NGS panel and re-tested by Sanger sequencing. The model incorporated several signals of sequence characteristics and call quality to determine if a variant was identified at high or low confidence. The model was tuned to eliminate false positives, defined as variants that were called by NGS but not confirmed by Sanger sequencing. The model achieved very high accuracy: 99.4% (95% confidence interval: +/- 0.03%). It categorized 92.2% (6622/7179) of the variants as high confidence, and 100% of these were confirmed to be present by Sanger sequencing. Among the variants that were categorized as low confidence, defined as NGS calls of low quality that are likely to be artifacts, 92.1% (513/557) were found to be not present by Sanger sequencing. CONCLUSIONS This work shows that NGS data contains sufficient characteristics for a machine-learning-based model to differentiate low from high confidence variants. Additionally, it reveals the importance of incorporating site-specific features as well as variant call features in such a model.
Collapse
Affiliation(s)
| | - Gilad Mishne
- Color Genomics, 831 Mitten Road, Burlingame, CA, 94010, USA
| | | | - Alicia Y Zhou
- Color Genomics, 831 Mitten Road, Burlingame, CA, 94010, USA.
| |
Collapse
|
19
|
Szot JO, Cuny H, Blue GM, Humphreys DT, Ip E, Harrison K, Sholler GF, Giannoulatou E, Leo P, Duncan EL, Sparrow DB, Ho JWK, Graham RM, Pachter N, Chapman G, Winlaw DS, Dunwoodie SL. A Screening Approach to Identify Clinically Actionable Variants Causing Congenital Heart Disease in Exome Data. CIRCULATION. GENOMIC AND PRECISION MEDICINE 2018; 11:e001978. [PMID: 29555671 DOI: 10.1161/circgen.117.001978] [Citation(s) in RCA: 48] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/05/2017] [Accepted: 01/18/2018] [Indexed: 01/19/2023]
Abstract
BACKGROUND Congenital heart disease (CHD)-structural abnormalities of the heart that arise during embryonic development-is the most common inborn malformation, affecting ≤1% of the population. However, currently, only a minority of cases can be explained by genetic abnormalities. The goal of this study was to identify disease-causal genetic variants in 30 families affected by CHD. METHODS Whole-exome sequencing was performed with the DNA of multiple family members. We utilized a 2-tiered whole-exome variant screening and interpretation procedure. First, we manually curated a high-confidence list of 90 genes known to cause CHD in humans, identified predicted damaging variants in genes on this list, and rated their pathogenicity using American College of Medical Genetics and Genomics-Association for Molecular Pathology guidelines. RESULTS In 3 families (10%), we found pathogenic variants in known CHD genes TBX5, TFAP2B, and PTPN11, explaining the cardiac lesions. Second, exomes were comprehensively analyzed to identify additional predicted damaging variants that segregate with disease in CHD candidate genes. In 10 additional families (33%), likely disease-causal variants were uncovered in PBX1, CNOT1, ZFP36L2, TEK, USP34, UPF2, KDM5A, KMT2C, TIE1, TEAD2, and FLT4. CONCLUSIONS The pathogenesis of CHD could be explained using our high-confidence CHD gene list for variant filtering in a subset of cases. Furthermore, our unbiased screening procedure of family exomes implicates additional genes and variants in the pathogenesis of CHD, which suggest themselves for functional validation. This 2-tiered approach provides a means of (1) identifying clinically actionable variants and (2) identifying additional disease-causal genes, both of which are essential for improving the molecular diagnosis of CHD.
Collapse
Affiliation(s)
- Justin O Szot
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Hartmut Cuny
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Gillian M Blue
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - David T Humphreys
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Eddie Ip
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Katrina Harrison
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Gary F Sholler
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Eleni Giannoulatou
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Paul Leo
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Emma L Duncan
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Duncan B Sparrow
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Joshua W K Ho
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Robert M Graham
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Nicholas Pachter
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Gavin Chapman
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - David S Winlaw
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.)
| | - Sally L Dunwoodie
- From the Victor Chang Cardiac Research Institute, Sydney, New South Wales, Australia (J.O.S., H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.); Faculty of Science (J.O.S., S.L.D.) and Faculty of Medicine (H.C., D.T.H., E.I., E.G., D.B.S., J.W.K.H., R.M.G., G.C., S.L.D.), University of New South Wales, Sydney, New South Wales, Australia, Sydney, New South Wales, Australia; Children's Hospital at Westmead, Heart Centre for Children (G.M.B., G.F.S., D.S.W.), Sydney, New South Wales, Australia; Sydney Medical School, University of Sydney, New South Wales, Australia (G.M.B., G.F.S., D.S.W.); Genetic Services of Western Australia, Perth (K.H., N.P.); Sydney Children's Hospitals Network, New South Wales, Australia (G.F.S.); Institute of Health and Biomedical Innovation, Queensland University of Technology (P.L., E.L.D.); Department of Endocrinology and Diabetes, Royal Brisbane and Women's Hospital, Brisbane, Queensland, Australia (E.L.D.); University of Queensland, Brisbane (E.L.D.); and School of Paediatrics and Child Health, University of Western Australia, Perth, Western Australia, Australia (N.P.).
| |
Collapse
|
20
|
Abstract
PURPOSE OF REVIEW This article reviews the current molecular genetic studies, which investigate the genetic causes of Rett syndrome or Rett-like phenotypes without a MECP2 mutation. RECENT FINDINGS As next generation sequencing becomes broadly available, especially whole exome sequencing is used in clinical diagnosis of the genetic causes of a wide spectrum of intellectual disability, autism, and encephalopathies. Patients who were diagnosed with Rett syndrome or Rett-like syndrome because of their phenotype but were negative for mutations in the MECP2, CDKL5 or FOXG1 genes were subjected to whole exome sequencing and the results of the last few years revealed yet 69 different genes. Many of these genes are involved in epigenetic gene regulation, chromatin shaping, neurotransmitter action or RNA transcription/translation. Genetic data also allows to investigate the individual genetic background of an individual patient, which can modify the severity of a genetic disorder. SUMMARY We conclude that the Rett syndrome phenotype has a much broader underlying genetic cause and the typical phenotype overlap with other genetic disorders. For proper genetic counselling, patient perspective and treatment it is important to include both phenotype and genetic information.
Collapse
|
21
|
Soens ZT, Branch J, Wu S, Yuan Z, Li Y, Li H, Wang K, Xu M, Rajan L, Motta FL, Simões RT, Lopez-Solache I, Ajlan R, Birch DG, Zhao P, Porto FB, Sallum J, Koenekoop RK, Sui R, Chen R. Leveraging splice-affecting variant predictors and a minigene validation system to identify Mendelian disease-causing variants among exon-captured variants of uncertain significance. Hum Mutat 2017; 38:1521-1533. [PMID: 28714225 DOI: 10.1002/humu.23294] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2017] [Revised: 06/20/2017] [Accepted: 07/11/2017] [Indexed: 12/11/2022]
Abstract
The genetic heterogeneity of Mendelian disorders results in a significant proportion of patients that are unable to be assigned a confident molecular diagnosis after conventional exon sequencing and variant interpretation. Here, we evaluated how many patients with an inherited retinal disease (IRD) have variants of uncertain significance (VUS) that are disrupting splicing in a known IRD gene by means other than affecting the canonical dinucleotide splice site. Three in silico splice-affecting variant predictors were leveraged to annotate and prioritize variants for splicing functional validation. An in vitro minigene system was used to assay each variant's effect on splicing. Starting with 745 IRD patients lacking a confident molecular diagnosis, we validated 23 VUS as splicing variants that likely explain disease in 26 patients. Using our results, we optimized in silico score cutoffs to guide future variant interpretation. Variants that alter base pairs other than the canonical GT-AG dinucleotide are often not considered for their potential effect on RNA splicing but in silico tools and a minigene system can be utilized for the prioritization and validation of such splice-disrupting variants. These variants can be overlooked causes of human disease but can be identified using conventional exon sequencing with proper interpretation guidelines.
Collapse
Affiliation(s)
- Zachry T Soens
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Justin Branch
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Shijing Wu
- Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
| | - Zhisheng Yuan
- Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
| | - Yumei Li
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Hui Li
- Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
| | - Keqing Wang
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Mingchu Xu
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Lavan Rajan
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas
| | - Fabiana L Motta
- Department of Ophthalmology and Visual Sciences, Paulista School of Medicine, Federal University of São Paulo, São Paulo, Brazil
| | - Renata T Simões
- Department of Retina and Vitreous, Ophthalmologic Center of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,Instituto de Ensino e Pesquisa da Santa Casa de Belo Horizonte Hospital - IEP/SCBH, Belo Horizonte, Minas Gerais, Brazil
| | - Irma Lopez-Solache
- McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, Montreal, Quebec, Canada
| | - Radwan Ajlan
- McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, Montreal, Quebec, Canada
| | - David G Birch
- Retina Foundation of the Southwest and Department of Ophthalmology, University of Texas Southwestern Medical Center, Dallas, Texas
| | - Peiquan Zhao
- Department of Ophthalmology, Xin Hua Hospital affiliated to Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | - Fernanda B Porto
- Department of Retina and Vitreous, Ophthalmologic Center of Minas Gerais, Belo Horizonte, Minas Gerais, Brazil.,Instituto de Ensino e Pesquisa da Santa Casa de Belo Horizonte Hospital - IEP/SCBH, Belo Horizonte, Minas Gerais, Brazil
| | - Juliana Sallum
- Department of Ophthalmology and Visual Sciences, Paulista School of Medicine, Federal University of São Paulo, São Paulo, Brazil
| | - Robert K Koenekoop
- McGill Ocular Genetics Laboratory and Centre, Department of Paediatric Surgery, Human Genetics, and Ophthalmology, McGill University Health Centre, Montreal, Quebec, Canada
| | - Ruifang Sui
- Department of Ophthalmology, Peking Union Medical College Hospital, Peking Union Medical College, Chinese Academy of Medical Sciences, Beijing, China
| | - Rui Chen
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, Texas.,Human Genome Sequencing Center, Baylor College of Medicine, Houston, Texas.,Department of Biochemistry and Molecular Biology, Baylor College of Medicine, Houston, Texas.,Department of Structural and Computational Biology & Molecular Biophysics, Baylor College of Medicine, Houston, Texas
| |
Collapse
|
22
|
Chakravorty S, Hegde M. Gene and Variant Annotation for Mendelian Disorders in the Era of Advanced Sequencing Technologies. Annu Rev Genomics Hum Genet 2017; 18:229-256. [PMID: 28415856 DOI: 10.1146/annurev-genom-083115-022545] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Comprehensive annotations of genetic and noncoding regions and corresponding accurate variant classification for Mendelian diseases are the next big challenge in the new genomic era of personalized medicine. Progress in the development of faster and more accurate pipelines for genome annotation and variant classification will lead to the discovery of more novel disease associations and candidate therapeutic targets. This ultimately will facilitate better patient recruitment in clinical trials. In this review, we describe the trends in research at the intersection of basic and clinical genomics that aims to increase understanding of overall genomic complexity, complex inheritance patterns of disease, and patient-phenotype-specific genomic associations. We describe the emerging field of translational functional genomics, which integrates other functional "-omics" approaches that support next-generation sequencing genomic data in order to facilitate personalized diagnostics, disease management, biomarker discovery, and medicine. We also discuss the utility of this integrated approach for diagnostic clinics and medical databases and its role in the future of personalized medicine.
Collapse
Affiliation(s)
- Samya Chakravorty
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia 30322;
| | - Madhuri Hegde
- Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia 30322;
| |
Collapse
|
23
|
Hegde M, Santani A, Mao R, Ferreira-Gonzalez A, Weck KE, Voelkerding KV. Development and Validation of Clinical Whole-Exome and Whole-Genome Sequencing for Detection of Germline Variants in Inherited Disease. Arch Pathol Lab Med 2017; 141:798-805. [DOI: 10.5858/arpa.2016-0622-ra] [Citation(s) in RCA: 38] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Abstract
Context.—
With the decrease in the cost of sequencing, the clinical testing paradigm has shifted from single gene to gene panel and now whole-exome and whole-genome sequencing. Clinical laboratories are rapidly implementing next-generation sequencing–based whole-exome and whole-genome sequencing. Because a large number of targets are covered by whole-exome and whole-genome sequencing, it is critical that a laboratory perform appropriate validation studies, develop a quality assurance and quality control program, and participate in proficiency testing.
Objective.—
To provide recommendations for whole-exome and whole-genome sequencing assay design, validation, and implementation for the detection of germline variants associated in inherited disorders.
Data Sources.—
An example of trio sequencing, filtration and annotation of variants, and phenotypic consideration to arrive at clinical diagnosis is discussed.
Conclusions.—
It is critical that clinical laboratories planning to implement whole-exome and whole-genome sequencing design and validate the assay to specifications and ensure adequate performance prior to implementation. Test design specifications, including variant filtering and annotation, phenotypic consideration, guidance on consenting options, and reporting of incidental findings, are provided. These are important steps a laboratory must take to validate and implement whole-exome and whole-genome sequencing in a clinical setting for germline variants in inherited disorders.
Collapse
Affiliation(s)
| | | | | | | | | | - Karl V. Voelkerding
- From the Department of Human Genetics, Emory University School of Medicine, Atlanta, Georgia (Dr Hegde); the Department of Clinical Pathology and Laboratory Medicine, Perelman School of Medicine, University of Pennsylvania, Philadelphia (Dr Santani); the Division of Genomic Diagnostics, Children's Hospital of Philadelphia, Philadelphia, Pennsylvania (Dr Santani); the Department of Pathology, ARUP
| |
Collapse
|
24
|
He KY, Ge D, He MM. Big Data Analytics for Genomic Medicine. Int J Mol Sci 2017; 18:ijms18020412. [PMID: 28212287 PMCID: PMC5343946 DOI: 10.3390/ijms18020412] [Citation(s) in RCA: 104] [Impact Index Per Article: 14.9] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/24/2016] [Revised: 02/08/2017] [Accepted: 02/09/2017] [Indexed: 12/25/2022] Open
Abstract
Genomic medicine attempts to build individualized strategies for diagnostic or therapeutic decision-making by utilizing patients’ genomic information. Big Data analytics uncovers hidden patterns, unknown correlations, and other insights through examining large-scale various data sets. While integration and manipulation of diverse genomic data and comprehensive electronic health records (EHRs) on a Big Data infrastructure exhibit challenges, they also provide a feasible opportunity to develop an efficient and effective approach to identify clinically actionable genetic variants for individualized diagnosis and therapy. In this paper, we review the challenges of manipulating large-scale next-generation sequencing (NGS) data and diverse clinical data derived from the EHRs for genomic medicine. We introduce possible solutions for different challenges in manipulating, managing, and analyzing genomic and clinical data to implement genomic medicine. Additionally, we also present a practical Big Data toolset for identifying clinically actionable genetic variants using high-throughput NGS data and EHRs.
Collapse
Affiliation(s)
- Karen Y He
- Department of Epidemiology and Biostatistics, Case Western Reserve University, Cleveland, OH 44106, USA.
| | | | - Max M He
- BioSciKin Co., Ltd., Nanjing 210042, China.
- Computation and Informatics in Biology and Medicine, University of Wisconsin-Madison, Madison, WI 53706, USA.
| |
Collapse
|
25
|
Fowler A, Mahamdallie S, Ruark E, Seal S, Ramsay E, Clarke M, Uddin I, Wylie H, Strydom A, Lunter G, Rahman N. Accurate clinical detection of exon copy number variants in a targeted NGS panel using DECoN. Wellcome Open Res 2016; 1:20. [PMID: 28459104 PMCID: PMC5409526 DOI: 10.12688/wellcomeopenres.10069.1] [Citation(s) in RCA: 66] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Background: Targeted next generation sequencing (NGS) panels are increasingly being used in clinical genomics to increase capacity, throughput and affordability of gene testing. Identifying whole exon deletions or duplications (termed exon copy number variants, ‘exon CNVs’) in exon-targeted NGS panels has proved challenging, particularly for single exon CNVs. Methods: We developed a tool for the
Detection of
Exon
Copy
Number variants (DECoN), which is optimised for analysis of exon-targeted NGS panels in the clinical setting. We evaluated DECoN performance using 96 samples with independently validated exon CNV data. We performed simulations to evaluate DECoN detection performance of single exon CNVs and to evaluate performance using different coverage levels and sample numbers. Finally, we implemented DECoN in a clinical laboratory that tests
BRCA1 and
BRCA2 with the TruSight Cancer Panel (TSCP). We used DECoN to analyse 1,919 samples, validating exon CNV detections by multiplex ligation-dependent probe amplification (MLPA). Results: In the evaluation set, DECoN achieved 100% sensitivity and 99% specificity for BRCA exon CNVs, including identification of 8 single exon CNVs. DECoN also identified 14/15 exon CNVs in 8 other genes. Simulations of all possible BRCA single exon CNVs gave a mean sensitivity of 98% for deletions and 95% for duplications. DECoN performance remained excellent with different levels of coverage and sample numbers; sensitivity and specificity was >98% with the typical NGS run parameters. In the clinical pipeline, DECoN automatically analyses pools of 48 samples at a time, taking 24 minutes per pool, on average. DECoN detected 24 BRCA exon CNVs, of which 23 were confirmed by MLPA, giving a false discovery rate of 4%. Specificity was 99.7%. Conclusions: DECoN is a fast, accurate, exon CNV detection tool readily implementable in research and clinical NGS pipelines. It has high sensitivity and specificity and acceptable false discovery rate. DECoN is freely available at
www.icr.ac.uk/decon.
Collapse
Affiliation(s)
- Anna Fowler
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Shazia Mahamdallie
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Elise Ruark
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Sheila Seal
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Emma Ramsay
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Matthew Clarke
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK
| | - Imran Uddin
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Harriet Wylie
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Ann Strydom
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK
| | - Gerton Lunter
- Wellcome Trust Centre for Human Genetics, University of Oxford, Oxford, UK
| | - Nazneen Rahman
- Division of Genetics and Epidemiology, Institute of Cancer Research, London, UK.,TGLclinical, Institute of Cancer Research, London, UK.,Cancer Genetics Unit, The Royal Marsden NHS Foundation Trust, Sutton, UK
| |
Collapse
|
26
|
Horak P, Fröhling S, Glimm H. Integrating next-generation sequencing into clinical oncology: strategies, promises and pitfalls. ESMO Open 2016; 1:e000094. [PMID: 27933214 PMCID: PMC5133384 DOI: 10.1136/esmoopen-2016-000094] [Citation(s) in RCA: 109] [Impact Index Per Article: 13.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2016] [Revised: 10/06/2016] [Accepted: 10/17/2016] [Indexed: 12/24/2022] Open
Abstract
We live in an era of genomic medicine. The past five years brought about many significant achievements in the field of cancer genetics, driven by rapidly evolving technologies and plummeting costs of next-generation sequencing (NGS). The official completion of the Cancer Genome Project in 2014 led many to envision the clinical implementation of cancer genomic data as the next logical step in cancer therapy. Stemming from this vision, the term 'precision oncology' was coined to illustrate the novelty of this individualised approach. The basic assumption of precision oncology is that molecular markers detected by NGS will predict response to targeted therapies independently from tumour histology. However, along with a ubiquitous availability of NGS, the complexity and heterogeneity at the individual patient level had to be acknowledged. Not only does the latter present challenges to clinical decision-making based on sequencing data, it is also an obstacle to the rational design of clinical trials. Novel tissue-agnostic trial designs were quickly developed to overcome these challenges. Results from some of these trials have recently demonstrated the feasibility and efficacy of this approach. On the other hand, there is an increasing amount of whole-exome and whole-genome NGS data which allows us to assess ever smaller differences between individual patients with cancer. In this review, we highlight different tumour sequencing strategies currently used for precision oncology, describe their individual strengths and weaknesses, and emphasise their feasibility in different clinical settings. Further, we evaluate the possibility of NGS implementation in current and future clinical trials, and point to the significance of NGS for translational research.
Collapse
Affiliation(s)
- Peter Horak
- Department of Translational Oncology , National Center for Tumor Diseases Heidelberg, German Cancer Research Center (DKFZ) , Heidelberg , Germany
| | - Stefan Fröhling
- Department of Translational Oncology , National Center for Tumor Diseases Heidelberg, German Cancer Research Center (DKFZ) , Heidelberg , Germany
| | - Hanno Glimm
- Department of Translational Oncology , National Center for Tumor Diseases Heidelberg, German Cancer Research Center (DKFZ) , Heidelberg , Germany
| |
Collapse
|
27
|
OpEx - a validated, automated pipeline optimised for clinical exome sequence analysis. Sci Rep 2016; 6:31029. [PMID: 27485037 PMCID: PMC4971567 DOI: 10.1038/srep31029] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/16/2016] [Accepted: 07/12/2016] [Indexed: 11/09/2022] Open
Abstract
We present an easy-to-use, open-source Optimised Exome analysis tool, OpEx (http://icr.ac.uk/opex) that accurately detects small-scale variation, including indels, to clinical standards. We evaluated OpEx performance with an experimentally validated dataset (the ICR142 NGS validation series), a large 1000 exome dataset (the ICR1000 UK exome series), and a clinical proband-parent trio dataset. The performance of OpEx for high-quality base substitutions and short indels in both small and large datasets is excellent, with overall sensitivity of 95%, specificity of 97% and low false detection rate (FDR) of 3%. Depending on the individual performance requirements the OpEx output allows one to optimise the inevitable trade-offs between sensitivity and specificity. For example, in the clinical setting one could permit a higher FDR and lower specificity to maximise sensitivity. In contexts where experimental validation is not possible, minimising the FDR and improving specificity may be a preferable trade-off for slightly lower sensitivity. OpEx is simple to install and use; the whole pipeline is run from a single command. OpEx is therefore well suited to the increasing research and clinical laboratories undertaking exome sequencing, particularly those without in-house dedicated bioinformatics expertise.
Collapse
|