1
|
Kobren SN, Moldovan MA, Reimers R, Traviglia D, Li X, Barnum D, Veit A, Corona RI, Carvalho Neto GDV, Willett J, Berselli M, Ronchetti W, Nelson SF, Martinez-Agosto JA, Sherwood R, Krier J, Kohane IS, Undiagnosed Diseases Network, Sunyaev SR. Joint, multifaceted genomic analysis enables diagnosis of diverse, ultra-rare monogenic presentations. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2024:2024.02.13.580158. [PMID: 38405764 PMCID: PMC10888768 DOI: 10.1101/2024.02.13.580158] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/27/2024]
Abstract
Genomics for rare disease diagnosis has advanced at a rapid pace due to our ability to perform "N-of-1" analyses on individual patients with ultra-rare diseases. The increasing sizes of ultra-rare disease cohorts internationally newly enables cohort-wide analyses for new discoveries, but well-calibrated statistical genetics approaches for jointly analyzing these patients are still under development.1,2 The Undiagnosed Diseases Network (UDN) brings multiple clinical, research and experimental centers under the same umbrella across the United States to facilitate and scale N-of-1 analyses. Here, we present the first joint analysis of whole genome sequencing data of UDN patients across the network. We introduce new, well-calibrated statistical methods for prioritizing disease genes with de novo recurrence and compound heterozygosity. We also detect pathways enriched with candidate and known diagnostic genes. Our computational analysis, coupled with a systematic clinical review, recapitulated known diagnoses and revealed new disease associations. We further release a software package, RaMeDiES, enabling automated cross-analysis of deidentified sequenced cohorts for new diagnostic and research discoveries. Gene-level findings and variant-level information across the cohort are available in a public-facing browser (https://dbmi-bgm.github.io/udn-browser/). These results show that N-of-1 efforts should be supplemented by a joint genomic analysis across cohorts.
Collapse
Affiliation(s)
| | | | | | - Daniel Traviglia
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Xinyun Li
- Program in Computational Biology and Bioinformatics, Yale University, New Haven, CT
| | | | - Alexander Veit
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Rosario I. Corona
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - George de V. Carvalho Neto
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Julian Willett
- Department of Pathology and Laboratory Medicine, NewYork-Presbyterian Weill Cornell Medical Center, New York, NY
| | - Michele Berselli
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - William Ronchetti
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | - Stanley F. Nelson
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Julian A. Martinez-Agosto
- Department of Human Genetics, David Geffen School of Medicine, University of California, Los Angeles, CA
| | - Richard Sherwood
- Division of Genetics, Department of Medicine, Brigham and Women’s Hospital and Harvard Medical School, Boston, MA
| | - Joel Krier
- Department of Genetics, Atrius Health, Boston, MA
| | - Isaac S. Kohane
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| | | | - Shamil R. Sunyaev
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA
| |
Collapse
|
2
|
Tinker RJ, Bastarache L, Ezell K, Kobren SN, Esteves C, Rosenfeld JA, Macnamara EF, Hamid R, Cogan JD, Rinker D, Mukharjee S, Glass I, Dipple K, Phillips JA, Undiagnosed Diseases Network. The contribution of mosaicism to genetic diseases and de novo pathogenic variants. Am J Med Genet A 2023; 191:2482-2492. [PMID: 37246601 PMCID: PMC11167532 DOI: 10.1002/ajmg.a.63309] [Citation(s) in RCA: 7] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/01/2023] [Revised: 03/29/2023] [Accepted: 05/03/2023] [Indexed: 05/30/2023]
Abstract
The contribution of mosaicism to diagnosed genetic disease and presumed de novo variants (DNV) is under investigated. We determined the contribution of mosaic genetic disease (MGD) and diagnosed parental mosaicism (PM) in parents of offspring with reported DNV (in the same variant) in the (1) Undiagnosed Diseases Network (UDN) (N = 1946) and (2) in 12,472 individuals electronic health records (EHR) who underwent genetic testing at an academic medical center. In the UDN, we found 4.51% of diagnosed probands had MGD, and 2.86% of parents of those with DNV exhibited PM. In the EHR, we found 6.03% and 2.99% and (of diagnosed probands) had MGD detected on chromosomal microarray and exome/genome sequencing, respectively. We found 2.34% (of those with a presumed pathogenic DNV) had a parent with PM for the variant. We detected mosaicism (regardless of pathogenicity) in 4.49% of genetic tests performed. We found a broad phenotypic spectrum of MGD with previously unknown phenotypic phenomena. MGD is highly heterogeneous and provides a significant contribution to genetic diseases. Further work is required to improve the diagnosis of MGD and investigate how PM contributes to DNV risk.
Collapse
Affiliation(s)
- Rory J. Tinker
- Division of Medical Genetics and Genomic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Lisa Bastarache
- Department of Biomedical Informatics, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Kimberly Ezell
- Division of Medical Genetics and Genomic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | | | - Cecilia Esteves
- Department of Biomedical Informatics, Harvard Medical School, Boston, Massachusetts, USA
| | - Jill A. Rosenfeld
- Department of Molecular & Human Genetics, Baylor College of Medicine, Houston, Texas, USA
| | - Ellen F. Macnamara
- Undiagnosed Diseases Program, Common Fund, National Human Genome Research Institute, National Institutes of Health, Bethesda, Maryland, USA
| | - Rizwan Hamid
- Division of Medical Genetics and Genomic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - Joy D. Cogan
- Division of Medical Genetics and Genomic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | - David Rinker
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
| | - Souhrid Mukharjee
- Department of Biological Sciences, Vanderbilt University, Nashville, Tennessee, USA
| | - Ian Glass
- Division of Genetic Medicine, Department of Pediatrics, University of Washington, Seattle, Washington, USA
| | - Katrina Dipple
- Division of Genetic Medicine, Department of Pediatrics, University of Washington, Seattle, Washington, USA
| | - John A. Phillips
- Division of Medical Genetics and Genomic Medicine, Vanderbilt University Medical Center, Nashville, Tennessee, USA
| | | |
Collapse
|
3
|
Bouzinier MA, Etin D, Trifonov SI, Evdokimova VN, Ulitin V, Shen J, Kokorev A, Ghazani AA, Chekaluk Y, Albertyn Z, Giersch A, Morton CC, Abraamyan F, Bendapudi PK, Sunyaev S, Undiagnosed Diseases Network, Brigham Genomic Medicine, SEQuencing A Baby For An Optimal Outcome, Quantori, Krier JB. AnFiSA: An open-source computational platform for the analysis of sequencing data for rare genetic disease. J Biomed Inform 2022; 133:104174. [PMID: 35998814 DOI: 10.1016/j.jbi.2022.104174] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 07/23/2022] [Accepted: 08/15/2022] [Indexed: 11/28/2022]
Abstract
Despite genomic sequencing rapidly transforming from being a bench-side tool to a routine procedure in a hospital, there is a noticeable lack of genomic analysis software that supports both clinical and research workflows as well as crowdsourcing. Furthermore, most existing software packages are not forward-compatible in regards to supporting ever-changing diagnostic rules adopted by the genetics community. Regular updates of genomics databases pose challenges for reproducible and traceable automated genetic diagnostics tools. Lastly, most of the software tools score low on explainability amongst clinicians. We have created a fully open-source variant curation tool, AnFiSA, with the intention to invite and accept contributions from clinicians, researchers, and professional software developers. The design of AnFiSA addresses the aforementioned issues via the following architectural principles: using a multidimensional database management system (DBMS) for genomic data to address reproducibility, curated decision trees adaptable to changing clinical rules, and a crowdsourcing-friendly interface to address difficult-to-diagnose cases. We discuss how we have chosen our technology stack and describe the design and implementation of the software. Finally, we show in detail how selected workflows can be implemented using the current version of AnFiSA by a medical geneticist.
Collapse
Affiliation(s)
- M A Bouzinier
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
| | - D Etin
- Forome Association, Boston, MA, USA; Oracle Corporation, USA.
| | | | - V N Evdokimova
- Forome Association, Boston, MA, USA; SBCS Scientific Biomedical Consulting Services, London, UK
| | - V Ulitin
- Forome Association, Boston, MA, USA
| | - J Shen
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - A Kokorev
- ITMO University, St. Petersburg, Russian Federation
| | - A A Ghazani
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Harvard Medical School, Boston, MA, USA; Brigham Genomic Medicine, Brigham and Women's Hospital, Boston, MA, USA
| | - Y Chekaluk
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - Z Albertyn
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - A Giersch
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - C C Morton
- Department of Pathology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Department of Obstetrics and Gynecology, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA; Manchester Centre for Audiology and Deafness (ManCAD), School of Health Sciences, University of Manchester, UK
| | - F Abraamyan
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA
| | - P K Bendapudi
- Division of Hemostasis and Thrombosis, Beth Israel Deaconess Medical Center, Boston, MA, USA; Division of Hematology and Blood Transfusion Service, Massachusetts General Hospital, Boston, MA, USA
| | - S Sunyaev
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA; Harvard Medical School, Boston, MA, USA
| | | | | | | | | | - J B Krier
- Division of Genetics, Department of Medicine, Brigham and Women's Hospital, Harvard Medical School, Boston, MA, USA.
| |
Collapse
|
4
|
Cope H, Barseghyan H, Bhattacharya S, Fu Y, Hoppman N, Marcou C, Walley N, Rehder C, Deak K, Alkelai A, Vilain E, Shashi V. Detection of a mosaic CDKL5 deletion and inversion by optical genome mapping ends an exhaustive diagnostic odyssey. Mol Genet Genomic Med 2021; 9:e1665. [PMID: 33955715 PMCID: PMC8372083 DOI: 10.1002/mgg3.1665] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/26/2020] [Revised: 02/23/2021] [Accepted: 03/01/2021] [Indexed: 12/17/2022] Open
Abstract
BACKGROUND Currently available structural variant (SV) detection methods do not span the complete spectrum of disease-causing SVs. Optical genome mapping (OGM), an emerging technology with the potential to resolve diagnostic dilemmas, was performed to investigate clinically-relevant SVs in a 4-year-old male with an epileptic encephalopathy of undiagnosed molecular origin. METHODS OGM was utilized to image long, megabase-size DNA molecules, fluorescently labeled at specific sequence motifs throughout the genome with high sensitivity for detection of SVs greater than 500 bp in size. OGM results were confirmed in a CLIA-certified laboratory via mate-pair sequencing. RESULTS OGM identified a mosaic, de novo 90 kb deletion and inversion on the X chromosome disrupting the CDKL5 gene. Detection of the mosaic deletion, which had been previously undetected by chromosomal microarray, an infantile epilepsy panel including exon-level microarray for CDKL5, exome sequencing as well as genome sequencing, resulted in a diagnosis of X-linked dominant early infantile epileptic encephalopathy-2. CONCLUSION OGM affords an effective technology for the detection of SVs, especially those that are mosaic, since these remain difficult to detect with current NGS technologies and with conventional chromosomal microarrays. Further research in undiagnosed populations with OGM is warranted.
Collapse
Affiliation(s)
- Heidi Cope
- Division of Medical GeneticsDepartment of PediatricsDuke University Medical CenterDurhamNCUSA
| | - Hayk Barseghyan
- Center for Genetic Medicine ResearchChildren’s National HospitalWashingtonDCUSA
- Department of genomics and Precision MedicineSchool of Medicine and Health SciencesGeorge Washington UniversityWashingtonDCUSA
- Bionano Genomics IncSan DiegoCAUSA
| | | | - Yulong Fu
- Center for Genetic Medicine ResearchChildren’s National HospitalWashingtonDCUSA
| | - Nicole Hoppman
- Division of Laboratory Genetics and GenomicsDepartment of Laboratory Medicine and PathologyMayo ClinicRochesterMNUSA
| | - Cherisse Marcou
- Division of Laboratory Genetics and GenomicsDepartment of Laboratory Medicine and PathologyMayo ClinicRochesterMNUSA
| | - Nicole Walley
- Division of Medical GeneticsDepartment of PediatricsDuke University Medical CenterDurhamNCUSA
| | - Catherine Rehder
- Department of PathologyDuke University Medical CenterDurhamNCUSA
| | - Kristen Deak
- Department of PathologyDuke University Medical CenterDurhamNCUSA
| | - Anna Alkelai
- Institute for Genomic MedicineColumbia University Medical CenterNew YorkNYUSA
| | - Eric Vilain
- Center for Genetic Medicine ResearchChildren’s National HospitalWashingtonDCUSA
- Department of genomics and Precision MedicineSchool of Medicine and Health SciencesGeorge Washington UniversityWashingtonDCUSA
| | - Vandana Shashi
- Division of Medical GeneticsDepartment of PediatricsDuke University Medical CenterDurhamNCUSA
| |
Collapse
|
5
|
Kanzi AM, San JE, Chimukangara B, Wilkinson E, Fish M, Ramsuran V, de Oliveira T. Next Generation Sequencing and Bioinformatics Analysis of Family Genetic Inheritance. Front Genet 2020; 11:544162. [PMID: 33193618 PMCID: PMC7649788 DOI: 10.3389/fgene.2020.544162] [Citation(s) in RCA: 33] [Impact Index Per Article: 6.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/20/2020] [Accepted: 09/21/2020] [Indexed: 12/29/2022] Open
Abstract
Mendelian and complex genetic trait diseases continue to burden and affect society both socially and economically. The lack of effective tests has hampered diagnosis thus, the affected lack proper prognosis. Mendelian diseases are caused by genetic mutations in a singular gene while complex trait diseases are caused by the accumulation of mutations in either linked or unlinked genomic regions. Significant advances have been made in identifying novel diseases associated mutations especially with the introduction of next generation and third generation sequencing. Regardless, some diseases are still without diagnosis as most tests rely on SNP genotyping panels developed from population based genetic analyses. Analysis of family genetic inheritance using whole genomes, whole exomes or a panel of genes has been shown to be effective in identifying disease-causing mutations. In this review, we discuss next generation and third generation sequencing platforms, bioinformatic tools and genetic resources commonly used to analyze family based genomic data with a focus on identifying inherited or novel disease-causing mutations. Additionally, we also highlight the analytical, ethical and regulatory challenges associated with analyzing personal genomes which constitute the data used for family genetic inheritance.
Collapse
Affiliation(s)
- Aquillah M. Kanzi
- Kwazulu-Natal Research and Innovation Sequencing Platform (KRISP), School of Laboratory Medicine and Medical Sciences, College of Health Sciences, University of KwaZulu-Natal, Durban, South Africa
| | | | | | | | | | | | | |
Collapse
|