1
|
Zhuang G, Zhang X, Du W, Xu L, Ma J, Luo H, Tang H, Wang W, Wang P, Li M, Yang X, Wu D, Fang S. A benchmarking framework for the accurate and cost-effective detection of clinically-relevant structural variants for cancer target identification and diagnosis. J Transl Med 2024; 22:65. [PMID: 38229122 DOI: 10.1186/s12967-024-04865-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2023] [Accepted: 01/06/2024] [Indexed: 01/18/2024] Open
Abstract
BACKGROUND Accurate clinical structural variant (SV) calling is essential for cancer target identification and diagnosis but has been historically challenging due to the lack of ground truth for clinical specimens. Meanwhile, reduced clinical-testing cost is the key to the widespread clinical utility. METHODS We analyzed massive data from tumor samples of 476 patients and developed a computational framework for accurate and cost-effective detection of clinically-relevant SVs. In addition, standard materials and classical experiments including immunohistochemistry and/or fluorescence in situ hybridization were used to validate the developed computational framework. RESULTS We systematically evaluated the common algorithms for SV detection and established an expert-reviewed SV call set of 1,303 tumor-specific SVs with high-evidence levels. Moreover, we developed a random-forest-based decision model to improve the true positive of SVs. To independently validate the tailored 'two-step' strategy, we utilized standard materials and classical experiments. The accuracy of the model was over 90% (92-99.78%) for all types of data. CONCLUSION Our study provides a valuable resource and an actionable guide to improve cancer-specific SV detection accuracy and clinical applicability.
Collapse
Affiliation(s)
- Guiwu Zhuang
- Department of Gastrointestinal Surgery, The Eighth Affiliated Hospital, Sun Yat-Sen University, Shenzhen, China
| | - Xiaotao Zhang
- Department of Radiotherapy, Qingdao Central Hospital, University of Health and Rehabilitation Sciences, Qingdao, China
| | - Wenjing Du
- Department of Radiotherapy, Shanxi Province Cancer Hospital/Shanxi Hospital Affiliated to Cancer Hospital, Chinese Academy of Medical Sciences/Cancer Hospital Affiliated to Shanxi Medical University, Taiyuan, China
| | - Libin Xu
- Department of Orthopedic Surgery, National Cancer Center/National Clinical Research Center for Cancer/Cancer Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, China
| | - Jiyong Ma
- Department of Respiration, Nanjing First Hospital, Nanjing Medical University, Nanjing, China
| | - Haitao Luo
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Hongzhen Tang
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Wei Wang
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Peng Wang
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Miao Li
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Xu Yang
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Dongfang Wu
- Shenzhen Engineering Center for Translational Medicine of Precision Cancer Immunodiagnosis and Therapy, YuceBio Technology Co., Ltd., Shenzhen, China
| | - Shencun Fang
- Department of Respiratory Medicine, Nanjing Chest Hospital, The Affiliated Brain Hospital of Nanjing Medical University, Nanjing, China.
| |
Collapse
|
2
|
Pagnamenta AT, Camps C, Giacopuzzi E, Taylor JM, Hashim M, Calpena E, Kaisaki PJ, Hashimoto A, Yu J, Sanders E, Schwessinger R, Hughes JR, Lunter G, Dreau H, Ferla M, Lange L, Kesim Y, Ragoussis V, Vavoulis DV, Allroggen H, Ansorge O, Babbs C, Banka S, Baños-Piñero B, Beeson D, Ben-Ami T, Bennett DL, Bento C, Blair E, Brasch-Andersen C, Bull KR, Cario H, Cilliers D, Conti V, Davies EG, Dhalla F, Dacal BD, Dong Y, Dunford JE, Guerrini R, Harris AL, Hartley J, Hollander G, Javaid K, Kane M, Kelly D, Kelly D, Knight SJL, Kreins AY, Kvikstad EM, Langman CB, Lester T, Lines KE, Lord SR, Lu X, Mansour S, Manzur A, Maroofian R, Marsden B, Mason J, McGowan SJ, Mei D, Mlcochova H, Murakami Y, Németh AH, Okoli S, Ormondroyd E, Ousager LB, Palace J, Patel SY, Pentony MM, Pugh C, Rad A, Ramesh A, Riva SG, Roberts I, Roy N, Salminen O, Schilling KD, Scott C, Sen A, Smith C, Stevenson M, Thakker RV, Twigg SRF, Uhlig HH, van Wijk R, Vona B, Wall S, Wang J, Watkins H, Zak J, Schuh AH, Kini U, Wilkie AOM, Popitsch N, Taylor JC. Structural and non-coding variants increase the diagnostic yield of clinical whole genome sequencing for rare diseases. Genome Med 2023; 15:94. [PMID: 37946251 PMCID: PMC10636885 DOI: 10.1186/s13073-023-01240-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/14/2022] [Accepted: 09/27/2023] [Indexed: 11/12/2023] Open
Abstract
BACKGROUND Whole genome sequencing is increasingly being used for the diagnosis of patients with rare diseases. However, the diagnostic yields of many studies, particularly those conducted in a healthcare setting, are often disappointingly low, at 25-30%. This is in part because although entire genomes are sequenced, analysis is often confined to in silico gene panels or coding regions of the genome. METHODS We undertook WGS on a cohort of 122 unrelated rare disease patients and their relatives (300 genomes) who had been pre-screened by gene panels or arrays. Patients were recruited from a broad spectrum of clinical specialties. We applied a bioinformatics pipeline that would allow comprehensive analysis of all variant types. We combined established bioinformatics tools for phenotypic and genomic analysis with our novel algorithms (SVRare, ALTSPLICE and GREEN-DB) to detect and annotate structural, splice site and non-coding variants. RESULTS Our diagnostic yield was 43/122 cases (35%), although 47/122 cases (39%) were considered solved when considering novel candidate genes with supporting functional data into account. Structural, splice site and deep intronic variants contributed to 20/47 (43%) of our solved cases. Five genes that are novel, or were novel at the time of discovery, were identified, whilst a further three genes are putative novel disease genes with evidence of causality. We identified variants of uncertain significance in a further fourteen candidate genes. The phenotypic spectrum associated with RMND1 was expanded to include polymicrogyria. Two patients with secondary findings in FBN1 and KCNQ1 were confirmed to have previously unidentified Marfan and long QT syndromes, respectively, and were referred for further clinical interventions. Clinical diagnoses were changed in six patients and treatment adjustments made for eight individuals, which for five patients was considered life-saving. CONCLUSIONS Genome sequencing is increasingly being considered as a first-line genetic test in routine clinical settings and can make a substantial contribution to rapidly identifying a causal aetiology for many patients, shortening their diagnostic odyssey. We have demonstrated that structural, splice site and intronic variants make a significant contribution to diagnostic yield and that comprehensive analysis of the entire genome is essential to maximise the value of clinical genome sequencing.
Collapse
Affiliation(s)
- Alistair T Pagnamenta
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Carme Camps
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edoardo Giacopuzzi
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Human Technopole, Viale Rita Levi Montalcini 1, 20157, Milan, Italy
| | - John M Taylor
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mona Hashim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Eduardo Calpena
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Pamela J Kaisaki
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Akiko Hashimoto
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jing Yu
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Edward Sanders
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Ron Schwessinger
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Jim R Hughes
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Gerton Lunter
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- University Medical Center Groningen, Groningen University, PO Box 72, 9700 AB, Groningen, The Netherlands
| | - Helene Dreau
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Matteo Ferla
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Lukas Lange
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Yesim Kesim
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Vassilis Ragoussis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Dimitrios V Vavoulis
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Holger Allroggen
- Neurosciences Department, UHCW NHS Trust, Clifford Bridge Road, Coventry, CV2 2DX, UK
| | - Olaf Ansorge
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Christian Babbs
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Siddharth Banka
- Division of Evolution, Infection and Genomics, School of Biological Sciences, Faculty of Biology, Medicine and Health, University of Manchester, Manchester, UK
- Manchester Centre for Genomic Medicine, Saint Mary's Hospital, Oxford Road, Manchester, M13 9WL, UK
| | - Benito Baños-Piñero
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - David Beeson
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Tal Ben-Ami
- Pediatric Hematology-Oncology Unit, Kaplan Medical Center, Rehovot, Israel
| | - David L Bennett
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Celeste Bento
- Hematology Department, Hospitais da Universidade de Coimbra, Coimbra, Portugal
| | - Edward Blair
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Charlotte Brasch-Andersen
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Katherine R Bull
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Holger Cario
- Department of Pediatrics and Adolescent Medicine, University Medical Center, Eythstrasse 24, 89075, Ulm, Germany
| | - Deirdre Cilliers
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Valerio Conti
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - E Graham Davies
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Fatima Dhalla
- Department of Paediatrics, Institute of Developmental and Regenerative Medicine, IMS-Tetsuya Nakamura Building, Old Road Campus, Roosevelt Drive, Oxford, OX3 7TY, UK
| | - Beatriz Diez Dacal
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Yin Dong
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - James E Dunford
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Renzo Guerrini
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Adrian L Harris
- Department of Oncology, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Jane Hartley
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Georg Hollander
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Kassim Javaid
- Oxford NIHR Musculoskeletal BRC and Nuffield Department of Orthopaedics, Rheumatology and Musculoskeletal Sciences, Nuffield Orthopaedic Centre, Old Road, Oxford, OX3 7HE, UK
| | - Maureen Kane
- Department of Pharmaceutical Sciences, School of Pharmacy, University of Maryland, Pharmacy Hall North, Room 731, 20 N. Pine Street, Baltimore, MD, 21201, USA
| | - Deirdre Kelly
- Liver Unit, Birmingham Women's & Children's Hospital and University of Birmingham, Steelhouse Lane, Birmingham, B4 6NH, UK
| | - Dominic Kelly
- Children's Hospital, OUH NHS Foundation Trust, NIHR Oxford BRC, Headley Way, Oxford, OX3 9DU, UK
| | - Samantha J L Knight
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Alexandra Y Kreins
- Department of Immunology, Great Ormond Street Hospital for Children NHS Trust and UCL Great Ormond Street Institute of Child Health, Zayed Centre for Research, 2Nd Floor, 20C Guilford Street, London, WC1N 1DZ, UK
| | - Erika M Kvikstad
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Craig B Langman
- Feinberg School of Medicine, Northwestern University, 211 E Chicago Avenue, Chicago, IL, MS37, USA
| | - Tracy Lester
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Kate E Lines
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Simon R Lord
- Early Phase Clinical Trials Unit, Department of Oncology, University of Oxford, Cancer and Haematology Centre, Level 2 Administration Area, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Xin Lu
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
| | - Sahar Mansour
- St George's University Hospitals NHS Foundation Trust, Blackshore Road, Tooting, London, SW17 0QT, UK
| | - Adnan Manzur
- MRC Centre for Neuromuscular Diseases, National Hospital for Neurology and Neurosurgery, Queen Square, London, WC1N 3BG, UK
| | - Reza Maroofian
- Department of Neuromuscular Diseases, UCL Queen Square Institute of Neurology and The National Hospital for Neurology and Neurosurgery, London, WC1N 3BG, UK
| | - Brian Marsden
- Nuffield Department of Medicine, Kennedy Institute, University of Oxford, Oxford, OX3 7BN, UK
| | - Joanne Mason
- Yourgene Health Headquarters, Skelton House, Lloyd Street North, Manchester Science Park, Manchester, M15 6SH, UK
| | - Simon J McGowan
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Davide Mei
- Neuroscience Department, Meyer Children's Hospital IRCCS, Viale Pieraccini 24, 50139, Florence, Italy
| | - Hana Mlcochova
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Yoshiko Murakami
- Research Institute for Microbial Diseases, Osaka University, 3-1 Yamadaoka, Suita, Osaka, 565-0871, Japan
| | - Andrea H Németh
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Steven Okoli
- Imperial College NHS Trust, Department of Haematology, Hammersmith Hospital, Du Cane Road, London, W12 0HS, UK
| | - Elizabeth Ormondroyd
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Lilian Bomme Ousager
- Department of Clinical Genetics, Odense University Hospital and Department of Clinical Research, University of Southern Denmark, Odense, Denmark
| | - Jacqueline Palace
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Smita Y Patel
- Clinical Immunology, John Radcliffe Hospital, Level 4A, Oxford, OX3 9DU, UK
| | - Melissa M Pentony
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
| | - Chris Pugh
- Nuffield Department of Medicine, University of Oxford, Oxford, OX3 7BN, UK
| | - Aboulfazl Rad
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
| | - Archana Ramesh
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Simone G Riva
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Irene Roberts
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Noémi Roy
- Department of Haematology, Oxford University Hospitals NHS Foundation Trust, Level 4, Haematology, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Outi Salminen
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Kyleen D Schilling
- Ann & Robert H. Lurie Children's Hospital of Chicago, 225 E Chicago Avenue, Chicago, IL, 60611, USA
| | - Caroline Scott
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Arjune Sen
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Conrad Smith
- Oxford Genetics Laboratories, Oxford University Hospitals NHS Foundation Trust, Churchill Hospital, Old Road, Oxford, OX3 7LE, UK
| | - Mark Stevenson
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Rajesh V Thakker
- University of Oxford, Academic Endocrine Unit, OCDEM, Churchill Hospital, Oxford, OX3 7LJ, UK
| | - Stephen R F Twigg
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Holm H Uhlig
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Paediatrics, University of Oxford, Level 2, Children's Hospital, John Radcliffe Hospital, Oxford, OX3 9DU, UK
- Translational Gastroenterology Unit, John Radcliffe Hospital, Oxford, OX3 9DU, UK
| | - Richard van Wijk
- UMC Utrecht, Heidelberglaan 100, 3584 CX, Utrecht, The Netherlands
| | - Barbara Vona
- Department of Otolaryngology-Head & Neck Surgery, Tübingen Hearing Research Centre, Eberhard Karls University, Elfriede-Aulhorn-Str. 5, 72076, Tübingen, Germany
- Institute of Human Genetics, University Medical Center Göttingen, Heinrich-Düker-Weg 12, 37073, Göttingen, Germany
- Institute for Auditory Neuroscience and InnerEarLab, University Medical Center Göttingen, Robert-Koch-Str. 40, 37075, Göttingen, Germany
| | - Steven Wall
- Oxford Craniofacial Unit, John Radcliffe Hospital, Level LG1, West Wing, Oxford, OX3 9DU, UK
| | - Jing Wang
- Nuffield Department of Clinical Neurosciences, University of Oxford, Oxford, OX3 9DU, UK
| | - Hugh Watkins
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- University of Oxford, Level 6 West Wing, Oxford, OX3 9DU, JR, UK
| | - Jaroslav Zak
- Nuffield Department of Clinical Medicine, Ludwig Institute for Cancer Research, University of Oxford, Old Road Campus Research Building, Oxford, OX3 7DQ, UK
- Department of Immunology and Microbiology, The Scripps Research Institute, 10550 North Torrey Pines Road, La Jolla, CA, 92037, USA
| | - Anna H Schuh
- Department of Oncology, Oxford Molecular Diagnostics Centre, University of Oxford, Level 4, John Radcliffe Hospital, Headley Way, Oxford, OX3 9DU, UK
| | - Usha Kini
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Oxford Centre for Genomic Medicine, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 7LE, UK
| | - Andrew O M Wilkie
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- MRC Weatherall Institute of Molecular Medicine, University of Oxford, John Radcliffe Hospital, Oxford, OX3 9DS, UK
| | - Niko Popitsch
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK
- Department of Biochemistry and Cell Biology, Max Perutz Labs, University of Vienna, Vienna BioCenter(VBC), Dr.-Bohr-Gasse 9, 1030, Vienna, Austria
| | - Jenny C Taylor
- Wellcome Centre for Human Genetics, University of Oxford, Old Road Campus, Roosevelt Drive, Oxford, OX3 7BN, UK.
- NIHR Oxford Biomedical Research Centre, John Radcliffe Hospital, Oxford University Hospitals NHS Foundation Trust, Oxford, OX3 9DU, UK.
| |
Collapse
|
3
|
Majidian S, Agustinho DP, Chin CS, Sedlazeck FJ, Mahmoud M. Genomic variant benchmark: if you cannot measure it, you cannot improve it. Genome Biol 2023; 24:221. [PMID: 37798733 PMCID: PMC10552390 DOI: 10.1186/s13059-023-03061-1] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Accepted: 09/18/2023] [Indexed: 10/07/2023] Open
Abstract
Genomic benchmark datasets are essential to driving the field of genomics and bioinformatics. They provide a snapshot of the performances of sequencing technologies and analytical methods and highlight future challenges. However, they depend on sequencing technology, reference genome, and available benchmarking methods. Thus, creating a genomic benchmark dataset is laborious and highly challenging, often involving multiple sequencing technologies, different variant calling tools, and laborious manual curation. In this review, we discuss the available benchmark datasets and their utility. Additionally, we focus on the most recent benchmark of genes with medical relevance and challenging genomic complexity.
Collapse
Affiliation(s)
- Sina Majidian
- Department of Computational Biology, University of Lausanne, 1015, Lausanne, Switzerland
- SIB Swiss Institute of Bioinformatics, 1015, Lausanne, Switzerland
| | | | | | - Fritz J Sedlazeck
- Baylor College of Medicine, Human Genome Sequencing Center, Houston, TX, 77030, USA.
- Department of Computer Science, Rice University, 6100 Main Street, Houston, TX, 77005, USA.
| | - Medhat Mahmoud
- Baylor College of Medicine, Human Genome Sequencing Center, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA.
| |
Collapse
|
4
|
Jiang T, Liu S, Guo H. Reply: Correspondence on NanoVar's performance outlined by Jiang T. et al. in 'Long-read sequencing settings for efficient structural variation detection based on comprehensive evaluation'. BMC Bioinformatics 2023; 24:352. [PMID: 37730581 PMCID: PMC10510213 DOI: 10.1186/s12859-023-05483-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Accepted: 09/13/2023] [Indexed: 09/22/2023] Open
Abstract
We published a paper in BMC Bioinformatics comprehensively evaluating the performance of structural variation (SV) calling with long-read SV detection methods based on simulated error-prone long-read data under various sequencing settings. Recently, C.Y.T. et al. wrote a correspondence claiming that the performance of NanoVar was underestimated in our benchmarking and listed some errors in our previous manuscripts. To clarify these matters, we reproduced our previous benchmarking results and carried out a series of parallel experiments on both the newly generated simulated datasets and the ones provided by C.Y.T. et al. The robust benchmark results indicate that NanoVar has unstable performance on simulated data produced from different versions of VISOR, while other tools do not exhibit this phenomenon. Furthermore, the errors proposed by C.Y.T. et al. were due to them using another version of VISOR and Sniffles, which caused many changes in usage and results compared to the versions applied in our previous work. We hope that this commentary proves the validity of our previous publication, clarifies and eliminates the misunderstanding about the commands and results in our benchmarking. Furthermore, we welcome more experts and scholars in the scientific community to pay attention to our research and help us better optimize these valuable works.
Collapse
Affiliation(s)
- Tao Jiang
- Faculty of Computing, Harbin Institute of Technology, Harbin, 150001, China
| | - Shiqi Liu
- Faculty of Computing, Harbin Institute of Technology, Harbin, 150001, China
| | - Hongzhe Guo
- Faculty of Computing, Harbin Institute of Technology, Harbin, 150001, China.
| |
Collapse
|
5
|
Tschernoster N, Erger F, Kohl S, Reusch B, Wenzel A, Walsh S, Thiele H, Becker C, Franitza M, Bartram MP, Kömhoff M, Schumacher L, Kukat C, Borodina T, Quedenau C, Nürnberg P, Rinschen MM, Driller JH, Pedersen BP, Schlingmann KP, Hüttel B, Bockenhauer D, Beck B, Altmüller J. Long-read sequencing identifies a common transposition haplotype predisposing for CLCNKB deletions. Genome Med 2023; 15:62. [PMID: 37612755 PMCID: PMC10464140 DOI: 10.1186/s13073-023-01215-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2023] [Accepted: 07/27/2023] [Indexed: 08/25/2023] Open
Abstract
BACKGROUND Long-read sequencing is increasingly used to uncover structural variants in the human genome, both functionally neutral and deleterious. Structural variants occur more frequently in regions with a high homology or repetitive segments, and one rearrangement may predispose to additional events. Bartter syndrome type 3 (BS 3) is a monogenic tubulopathy caused by deleterious variants in the chloride channel gene CLCNKB, a high proportion of these being large gene deletions. Multiplex ligation-dependent probe amplification, the current diagnostic gold standard for this type of mutation, will indicate a simple homozygous gene deletion in biallelic deletion carriers. However, since the phenotypic spectrum of BS 3 is broad even among biallelic deletion carriers, we undertook a more detailed analysis of precise breakpoint regions and genomic structure. METHODS Structural variants in 32 BS 3 patients from 29 families and one BS4b patient with CLCNKB deletions were investigated using long-read and synthetic long-read sequencing, as well as targeted long-read sequencing approaches. RESULTS We report a ~3 kb duplication of 3'-UTR CLCNKB material transposed to the corresponding locus of the neighbouring CLCNKA gene, also found on ~50 % of alleles in healthy control individuals. This previously unknown common haplotype is significantly enriched in our cohort of patients with CLCNKB deletions (45 of 51 alleles with haplotype information, 2.2 kb and 3.0 kb transposition taken together, p=9.16×10-9). Breakpoint coordinates for the CLCNKB deletion were identifiable in 28 patients, with three being compound heterozygous. In total, eight different alleles were found, one of them a complex rearrangement with three breakpoint regions. Two patients had different CLCNKA/CLCNKB hybrid genes encoding a predicted CLCNKA/CLCNKB hybrid protein with likely residual function. CONCLUSIONS The presence of multiple different deletion alleles in our cohort suggests that large CLCNKB gene deletions originated from many independently recurring genomic events clustered in a few hot spots. The uncovered associated sequence transposition haplotype apparently predisposes to these additional events. The spectrum of CLCNKB deletion alleles is broader than expected and likely still incomplete, but represents an obvious candidate for future genotype/phenotype association studies. We suggest a sensitive and cost-efficient approach, consisting of indirect sequence capture and long-read sequencing, to analyse disease-relevant structural variant hotspots in general.
Collapse
Affiliation(s)
- Nikolai Tschernoster
- Cologne Center for Genomics (CCG), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
- Institute of Human Genetics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Kerpener Str. 34, 50931, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Florian Erger
- Institute of Human Genetics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Kerpener Str. 34, 50931, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Stefan Kohl
- Department of Pediatrics, Cologne Children's Hospital, Cologne, Germany
| | - Björn Reusch
- Institute of Human Genetics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Kerpener Str. 34, 50931, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Andrea Wenzel
- Institute of Human Genetics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Kerpener Str. 34, 50931, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Stephen Walsh
- Department of Renal Medicine, UCL, University College London, London, UK
| | - Holger Thiele
- Cologne Center for Genomics (CCG), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Christian Becker
- Cologne Center for Genomics (CCG), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Marek Franitza
- Cologne Center for Genomics (CCG), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Malte P Bartram
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
- Department II of Internal Medicine, University of Cologne, Cologne, Germany
| | - Martin Kömhoff
- Department of Pediatrics, University Marburg, Marburg, Germany
| | - Lena Schumacher
- FACS & Imaging Core Facility, Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Christian Kukat
- FACS & Imaging Core Facility, Max Planck Institute for Biology of Ageing, Cologne, Germany
| | - Tatiana Borodina
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Straße 28, 10115, Berlin, Germany
| | - Claudia Quedenau
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Straße 28, 10115, Berlin, Germany
| | - Peter Nürnberg
- Cologne Center for Genomics (CCG), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany
| | - Markus M Rinschen
- Department of Biomedicine, Aarhus University, Aarhus, Denmark
- Aarhus Institute of Advanced Studies, Aarhus University, Aarhus, Denmark
- Department III of Medicine, University Medical Center Hamburg-Eppendorf, Hamburg, Germany
| | - Jan H Driller
- Department of Molecular Biology and Genetics, Aarhus University, Universitetsbyen 81, DK-8000, Aarhus C, Denmark
| | - Bjørn P Pedersen
- Department of Molecular Biology and Genetics, Aarhus University, Universitetsbyen 81, DK-8000, Aarhus C, Denmark
| | - Karl P Schlingmann
- Department of General Pediatrics, University Children's Hospital, Münster, Germany
| | - Bruno Hüttel
- Max Planck Genome-Centre Cologne, Max Planck Institute for Plant Breeding Research, Cologne, Germany
| | - Detlef Bockenhauer
- Department of Renal Medicine, UCL, University College London, London, UK
- Great Ormond Street Hospital for Children, NHS Foundation Trust, London, UK
| | - Bodo Beck
- Institute of Human Genetics, Faculty of Medicine and University Hospital Cologne, University of Cologne, Kerpener Str. 34, 50931, Cologne, Germany.
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany.
| | - Janine Altmüller
- Center for Molecular Medicine Cologne (CMMC), University of Cologne, Faculty of Medicine and University Hospital Cologne, Cologne, Germany.
- Max Delbrück Center for Molecular Medicine in the Helmholtz Association (MDC), Hannoversche Straße 28, 10115, Berlin, Germany.
- Berlin Institute of Health at Charité - Universitätsmedizin Berlin, Core Facility Genomics, Berlin, Germany.
| |
Collapse
|
6
|
Laufer VA, Glover TW, Wilson TE. Applications of advanced technologies for detecting genomic structural variation. Mutat Res Rev Mutat Res 2023; 792:108475. [PMID: 37931775 PMCID: PMC10792551 DOI: 10.1016/j.mrrev.2023.108475] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Revised: 09/07/2023] [Accepted: 11/02/2023] [Indexed: 11/08/2023]
Abstract
Chromosomal structural variation (SV) encompasses a heterogenous class of genetic variants that exerts strong influences on human health and disease. Despite their importance, many structural variants (SVs) have remained poorly characterized at even a basic level, a discrepancy predicated upon the technical limitations of prior genomic assays. However, recent advances in genomic technology can identify and localize SVs accurately, opening new questions regarding SV risk factors and their impacts in humans. Here, we first define and classify human SVs and their generative mechanisms, highlighting characteristics leveraged by various SV assays. We next examine the first-ever gapless assembly of the human genome and the technical process of assembling it, which required third-generation sequencing technologies to resolve structurally complex loci. The new portions of that "telomere-to-telomere" and subsequent pangenome assemblies highlight aspects of SV biology likely to develop in the near-term. We consider the strengths and limitations of the most promising new SV technologies and when they or longstanding approaches are best suited to meeting salient goals in the study of human SV in population-scale genomics research, clinical, and public health contexts. It is a watershed time in our understanding of human SV when new approaches are expected to fundamentally change genomic applications.
Collapse
Affiliation(s)
- Vincent A Laufer
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas W Glover
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| | - Thomas E Wilson
- Department of Pathology, University of Michigan Medical School, Ann Arbor, MI 48109, USA; Department of Human Genetics, University of Michigan Medical School, Ann Arbor, MI 48109, USA.
| |
Collapse
|
7
|
Li BJ, Chen L, Yan MZ, Zou XQ, Bai YL, Xue YG, Jiang Z, Chen BH, Li CY, He Q, Feng JX, Zhou T, Xu P, Zhou T, Xu P. Intercross population study reveals that co-mutation of mitfa genes in two subgenomes induces red skin color in common carp ( Cyprinus carpio wuyuanensis). Zool Res 2023; 44:276-286. [PMID: 36785895 PMCID: PMC10083221 DOI: 10.24272/j.issn.2095-8137.2022.388] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/13/2023] Open
Abstract
Common carp are among the oldest domesticated fish in the world. As such, there are many food and ornamental carp strains with abundant phenotypic variations due to natural and artificial selection. Hebao red carp (HB, Cyprinus carpio wuyuanensis), an indigenous strain in China, is renowned for its unique body morphology and reddish skin. To reveal the genetic basis underlying the distinct skin color of HB, we constructed an improved high-fidelity (HiFi) HB genome with good contiguity, completeness, and correctness. Genome structure comparison was conducted between HB and a representative wild strain, Yellow River carp (YR, C. carpio haematopterus), to identify structural variants and genes under positive selection. Signatures of artificial selection during domestication were identified in HB and YR populations, while phenotype mapping was performed in a segregating population generated by HB×YR crosses. Body color in HB was associated with regions with fixed mutations. The simultaneous mutation and superposition of a pair of homologous genes ( mitfa) in chromosomes A06 and B06 conferred the reddish color in domesticated HB. Transcriptome analysis of common carp with different alleles of the mitfa mutation confirmed that gene duplication can buffer the deleterious effects of mutation in allotetraploids. This study provides new insights into genotype-phenotype associations in allotetraploid species and lays a foundation for future breeding of common carp.
Collapse
Affiliation(s)
- Bi-Jun Li
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Lin Chen
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Meng-Zhen Yan
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Xiao-Qing Zou
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Yu-Lin Bai
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Ya-Guo Xue
- College of Fisheries, Henan Normal University, Xinxiang, Henan 453007, China
| | - Zhou Jiang
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,College of Fisheries, Henan Normal University, Xinxiang, Henan 453007, China
| | - Bao-Hua Chen
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Cheng-Yu Li
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Qian He
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Jian-Xin Feng
- Henan Academy of Fishery Science, Zhengzhou, Henan 450039, China
| | - Tao Zhou
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China
| | - Peng Xu
- State Key Laboratory of Marine Environmental Science, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China.,Fujian Key Laboratory of Genetics and Breeding of Marine Organisms, College of Ocean and Earth Sciences, Xiamen University, Xiamen, Fujian 361102, China. E-mail:
| | | | | | | | | | | | | | | | | |
Collapse
|
8
|
Haga Y, Sakamoto Y, Arai M, Suzuki Y, Suzuki A. Long-Read Whole-Genome Sequencing Using a Nanopore Sequencer and Detection of Structural Variants in Cancer Genomes. Methods Mol Biol 2023; 2632:177-89. [PMID: 36781729 DOI: 10.1007/978-1-0716-2996-3_13] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/15/2023]
Abstract
Long-read sequencing technologies enable us to precisely identify structural variants (SVs), which would be occasionally associated with various types of diseases, including cancers. In this section, we introduce experimental and computational procedures for conducting long-read whole-genome sequencing (WGS) of cancer genomes from fresh frozen tissues/cells. We also demonstrate the analysis of SVs in cancer genomes using long-read WGS data from lung cancer cell lines by several representative computational tools, such as cuteSV and Sniffles2, as examples.
Collapse
|
9
|
Danis D, Jacobsen JOB, Balachandran P, Zhu Q, Yilmaz F, Reese J, Haimel M, Lyon GJ, Helbig I, Mungall CJ, Beck CR, Lee C, Smedley D, Robinson PN. SvAnna: efficient and accurate pathogenicity prediction of coding and regulatory structural variants in long-read genome sequencing. Genome Med 2022; 14:44. [PMID: 35484572 PMCID: PMC9047340 DOI: 10.1186/s13073-022-01046-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2021] [Accepted: 04/12/2022] [Indexed: 01/18/2023] Open
Abstract
Structural variants (SVs) are implicated in the etiology of Mendelian diseases but have been systematically underascertained owing to sequencing technology limitations. Long-read sequencing enables comprehensive detection of SVs, but approaches for prioritization of candidate SVs are needed. Structural variant Annotation and analysis (SvAnna) assesses all classes of SVs and their intersection with transcripts and regulatory sequences, relating predicted effects on gene function with clinical phenotype data. SvAnna places 87% of deleterious SVs in the top ten ranks. The interpretable prioritizations offered by SvAnna will facilitate the widespread adoption of long-read sequencing in diagnostic genomics. SvAnna is available at https://github.com/TheJacksonLaboratory/SvAnn a .
Collapse
Affiliation(s)
- Daniel Danis
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
| | - Julius O. B. Jacobsen
- grid.4868.20000 0001 2171 1133William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ UK
| | - Parithi Balachandran
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
| | - Qihui Zhu
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
| | - Feyza Yilmaz
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
| | - Justin Reese
- grid.184769.50000 0001 2231 4551Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Matthias Haimel
- grid.511293.d0000 0004 6104 8403Ludwig Boltzmann Institute for Rare and Undiagnosed Diseases, Vienna, Austria ,grid.416346.2St. Anna Children’s Cancer Research Institute, Vienna, Austria ,grid.418729.10000 0004 0392 6802CeMM Research Center for Molecular Medicine of the Austrian Academy of Sciences, Vienna, Austria ,grid.486422.e0000000405446183Present address: Global Computational Biology and Digital Sciences, Boehringer Ingelheim Regional Center Vienna GmbH & Co KG, 1120 Vienna, Austria
| | - Gholson J. Lyon
- grid.420001.70000 0000 9813 9625Department of Human Genetics, New York State Institute for Basic Research in Developmental Disabilities, Staten Island, New York, USA ,grid.212340.60000000122985718Biology PhD Program, The Graduate Center, The City University of New York, New York, USA
| | - Ingo Helbig
- grid.239552.a0000 0001 0680 8770Division of Neurology, Children’s Hospital of Philadelphia, Philadelphia, PA USA ,grid.239552.a0000 0001 0680 8770The Epilepsy NeuroGenetics Initiative (ENGIN), Children’s Hospital of Philadelphia, Philadelphia, PA USA ,grid.239552.a0000 0001 0680 8770Department of Biomedical and Health Informatics (DBHi), Children’s Hospital of Philadelphia, Philadelphia, PA USA ,grid.25879.310000 0004 1936 8972Department of Neurology, University of Pennsylvania, Philadelphia, PA USA
| | - Christopher J. Mungall
- grid.184769.50000 0001 2231 4551Division of Environmental Genomics and Systems Biology, Lawrence Berkeley National Laboratory, Berkeley, CA 94720 USA
| | - Christine R. Beck
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA ,grid.208078.50000000419370394Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06032 USA ,grid.63054.340000 0001 0860 4915Institute for Systems Genomics, University of Connecticut, Storrs, CT 06269 USA
| | - Charles Lee
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA
| | - Damian Smedley
- grid.4868.20000 0001 2171 1133William Harvey Research Institute, Charterhouse Square, Barts and the London School of Medicine and Dentistry, Queen Mary University of London, London, EC1M 6BQ UK
| | - Peter N. Robinson
- grid.249880.f0000 0004 0374 0039The Jackson Laboratory for Genomic Medicine, Farmington, CT 06032 USA ,grid.208078.50000000419370394Department of Genetics and Genome Sciences, University of Connecticut Health Center, Farmington, CT 06032 USA
| |
Collapse
|
10
|
Theunissen F, Flynn LL, Anderton RS, Akkari PA. Short structural variants as informative genetic markers for ALS disease risk and progression. BMC Med 2022; 20:11. [PMID: 35034660 PMCID: PMC8762977 DOI: 10.1186/s12916-021-02206-y] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 09/29/2021] [Accepted: 12/06/2021] [Indexed: 02/07/2023] Open
Abstract
There is considerable variability in disease progression for patients with amyotrophic lateral sclerosis (ALS) including the age of disease onset, site of disease onset, and survival time. There is growing evidence that short structural variations (SSVs) residing in frequently overlooked genomic regions can contribute to complex disease mechanisms and can explain, in part, the phenotypic variability in ALS patients. Here, we discuss SSVs recently characterized by our laboratory and how these discoveries integrate into the current literature on ALS, particularly in the context of application to future clinical trials. These markers may help to identify and differentiate patients for clinical trials that have a similar ALS disease mechanism(s), thereby reducing the impact of participant heterogeneity. As evidence accumulates for the genetic markers discovered in SQSTM1, SCAF4, and STMN2, we hope to improve the outcomes of future ALS clinical trials.
Collapse
Affiliation(s)
- Frances Theunissen
- Perron Institute for Neurological and Translational Science, First floor, RR block, QEII Medical Centre, 8 Verdun St, Nedlands, WA, 6009, Australia.
- Centre for Neuromuscular and Neurological Disorders, University of Western Australia, Nedlands, WA, Australia.
| | - Loren L Flynn
- Perron Institute for Neurological and Translational Science, First floor, RR block, QEII Medical Centre, 8 Verdun St, Nedlands, WA, 6009, Australia
- Centre for Neuromuscular and Neurological Disorders, University of Western Australia, Nedlands, WA, Australia
- Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA, Australia
- Black Swan Pharmaceuticals, Wake Forrest, NC, USA
| | - Ryan S Anderton
- Centre for Neuromuscular and Neurological Disorders, University of Western Australia, Nedlands, WA, Australia
- Faculty of Medicine, Nursing, Midwifery and Health Sciences, University of Notre Dame Australia, Fremantle, WA, 6160, Australia
| | - P Anthony Akkari
- Perron Institute for Neurological and Translational Science, First floor, RR block, QEII Medical Centre, 8 Verdun St, Nedlands, WA, 6009, Australia
- Centre for Neuromuscular and Neurological Disorders, University of Western Australia, Nedlands, WA, Australia
- Centre for Molecular Medicine and Innovative Therapeutics, Murdoch University, Perth, WA, Australia
- Black Swan Pharmaceuticals, Wake Forrest, NC, USA
- Division of Neurology, Duke University Medical Centre, Duke University, Durham, NC, USA
| |
Collapse
|
11
|
Mahmoud M, Doddapaneni H, Timp W, Sedlazeck FJ. PRINCESS: comprehensive detection of haplotype resolved SNVs, SVs, and methylation. Genome Biol 2021; 22:268. [PMID: 34521442 PMCID: PMC8442460 DOI: 10.1186/s13059-021-02486-w] [Citation(s) in RCA: 18] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2021] [Accepted: 09/02/2021] [Indexed: 12/11/2022] Open
Abstract
Long-read sequencing has been shown to have advantages in structural variation (SV) detection and methylation calling. Many studies focus either on SV, methylation, or phasing of SNV; however, only the combination of variants provides a comprehensive insight into the sample and thus enables novel findings in biology or medicine. PRINCESS is a structured workflow that takes raw sequence reads and generates a fully phased SNV, SV, and methylation call set within a few hours. PRINCESS achieves high accuracy and long phasing even on low coverage datasets and can resolve repetitive, complex medical relevant genes that often escape detection. PRINCESS is publicly available at https://github.com/MeHelmy/princess under the MIT license.
Collapse
Affiliation(s)
- Medhat Mahmoud
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA.
- Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, 77030, USA.
| | | | - Winston Timp
- Department of Biomedical Engineering, Johns Hopkins University, Baltimore, MD, 21218, USA
| | - Fritz J Sedlazeck
- Human Genome Sequencing Center, Baylor College of Medicine, Houston, TX, 77030, USA.
| |
Collapse
|
12
|
Singh AK, Olsen MF, Lavik LAS, Vold T, Drabløs F, Sjursen W. Detecting copy number variation in next generation sequencing data from diagnostic gene panels. BMC Med Genomics 2021; 14:214. [PMID: 34465341 PMCID: PMC8406611 DOI: 10.1186/s12920-021-01059-x] [Citation(s) in RCA: 19] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/14/2021] [Accepted: 08/16/2021] [Indexed: 01/21/2023] Open
Abstract
Background Detection of copy number variation (CNV) in genes associated with disease is important in genetic diagnostics, and next generation sequencing (NGS) technology provides data that can be used for CNV detection. However, CNV detection based on NGS data is in general not often used in diagnostic labs as the data analysis is challenging, especially with data from targeted gene panels. Wet lab methods like MLPA (MRC Holland) are widely used, but are expensive, time consuming and have gene-specific limitations. Our aim has been to develop a bioinformatic tool for CNV detection from NGS data in medical genetic diagnostic samples. Results Our computational pipeline for detection of CNVs in NGS data from targeted gene panels utilizes coverage depth of the captured regions and calculates a copy number ratio score for each region. This is computed by comparing the mean coverage of the sample with the mean coverage of the same region in other samples, defined as a pool. The pipeline selects pools for comparison dynamically from previously sequenced samples, using the pool with an average coverage depth that is nearest to the one of the samples. A sliding window-based approach is used to analyze each region, where length of sliding window and sliding distance can be chosen dynamically to increase or decrease the resolution. This helps in detecting CNVs in small or partial exons. With this pipeline we have correctly identified the CNVs in 36 positive control samples, with sensitivity of 100% and specificity of 91%. We have detected whole gene level deletion/duplication, single/multi exonic level deletion/duplication, partial exonic deletion and mosaic deletion. Since its implementation in mid-2018 it has proven its diagnostic value with more than 45 CNV findings in routine tests. Conclusions With this pipeline as part of our diagnostic practices it is now possible to detect partial, single or multi-exonic, and intragenic CNVs in all genes in our target panel. This has helped our diagnostic lab to expand the portfolio of genes where we offer CNV detection, which previously was limited by the availability of MLPA kits. Supplementary Information The online version contains supplementary material available at 10.1186/s12920-021-01059-x.
Collapse
Affiliation(s)
- Ashish Kumar Singh
- Department of Medical Genetics, St. Olavs Hospital, Trondheim, Norway. .,Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Trondheim, Norway.
| | | | | | - Trine Vold
- Department of Medical Genetics, St. Olavs Hospital, Trondheim, Norway
| | - Finn Drabløs
- Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| | - Wenche Sjursen
- Department of Medical Genetics, St. Olavs Hospital, Trondheim, Norway.,Department of Clinical and Molecular Medicine, Faculty of Medicine and Health Sciences, NTNU - Norwegian University of Science and Technology, Trondheim, Norway
| |
Collapse
|
13
|
Sakamoto Y, Zaha S, Suzuki Y, Seki M, Suzuki A. Application of long-read sequencing to the detection of structural variants in human cancer genomes. Comput Struct Biotechnol J 2021; 19:4207-4216. [PMID: 34527193 PMCID: PMC8350331 DOI: 10.1016/j.csbj.2021.07.030] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2021] [Revised: 07/20/2021] [Accepted: 07/25/2021] [Indexed: 01/02/2023] Open
Abstract
In recent years, the so-called long-read sequencing technology has had a substantial impact on various aspects of genome sciences. Here, we introduce recent studies of cancerous structural variants (SVs) using long-read sequencing technologies, namely Pacific Biosciences (PacBio) sequencers, Oxford Nanopore Technologies (ONT) sequencers, and linked-read methods. By taking advantage of long-read lengths, these technologies have enabled the precise detection of SVs, including long insertions by transposable elements, such as LINE-1. In addition to SV detection, the epigenome status (including DNA methylation and haplotype information) surrounding SV loci has also been unveiled by long-read sequencing technologies, to identify the effects of SVs. Among the various research fields in which long-read sequencing has been applied, cancer genomics has shown the most remarkable advances. In fact, many studies are beginning to shed light on the detection of SVs and the elucidation of their complex structures in various types of cancer. In the particular case of cancers, we summarize the technical limitations of the application of this technology to the analysis of clinical samples. We will introduce recent achievements from this viewpoint. However, a similar approach will be started for other applications in the near future. Therefore, by complementing the current short-read sequencing analysis, long-read sequencing should reveal the complex nature of human genomes in their healthy and disease states, which will open a new opportunity for a better understanding of disease development and for a novel strategy for drug development.
Collapse
Affiliation(s)
- Yoshitaka Sakamoto
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| | - Suzuko Zaha
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| | - Yutaka Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| | - Masahide Seki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| | - Ayako Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, 5-1-5 Kashiwanoha, Kashiwa, Chiba 277-8561, Japan
| |
Collapse
|
14
|
Kent M, Moser M, Boman IA, Lindtveit K, Árnyasi M, Sundsaasen KK, Våge DI. Insertion of an endogenous Jaagsiekte sheep retrovirus element into the BCO2 - gene abolishes its function and leads to yellow discoloration of adipose tissue in Norwegian Spælsau (Ovis aries). BMC Genomics 2021; 22:492. [PMID: 34193038 PMCID: PMC8247158 DOI: 10.1186/s12864-021-07826-5] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/11/2021] [Accepted: 06/21/2021] [Indexed: 11/19/2022] Open
Abstract
Background The accumulation of carotenoids in adipose tissue leading to yellow fat is, in sheep, a heritable recessive trait that can be attributed to a nonsense mutation in the beta-carotene oxygenase 2 (BCO2) gene. However, not all sheep breeds suffering from yellow fat have this nonsense mutation, meaning that other functional mechanisms must exist. We investigated one such breed, the Norwegian spælsau. Results In spælsau we detected an aberration in BCO2 mRNA. Nanopore sequencing of genomic DNA revealed the insertion of a 7.9 kb endogenous Jaagsiekte Sheep Retrovirus (enJSRV) sequence in the first intron of the BCO2 gene. Close examination of its cDNA revealed that the BCO2 genes first exon was spliced together with enJSRV-sequence immediately downstream of a potential -AG splice acceptor site at enJSRV position 415. The hybrid protein product consists of 29 amino acids coded by the BCO2 exon 1, one amino acid coded by the junction sequence, followed by 28 amino acids arbitrary coded for by the enJSRV-sequence, before a translation stop codon is reached. Conclusions Considering that the functional BCO2 protein consists of 575 amino acids, it is unlikely that the 58 amino acid BCO2/enJSRV hybrid protein can display any enzymatic function. The existence of this novel BCO2 allele represents an alternative functional mechanism accounting for BCO2 inactivation and is a perfect example of the potential benefits for searching for structural variants using long-read sequencing data. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-021-07826-5.
Collapse
Affiliation(s)
- Matthew Kent
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway
| | - Michel Moser
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway
| | - Inger Anne Boman
- The Norwegian Association of Sheep and Goat Breeders, No-1431, Ås, Norway
| | - Kristine Lindtveit
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway
| | - Mariann Árnyasi
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway
| | - Kristil Kindem Sundsaasen
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway
| | - Dag Inge Våge
- Department of Animal and Aquacultural Sciences, Centre for Integrative Genetics (CIGENE), Faculty of Biosciences, Norwegian University of Life Sciences, No-1432, Ås, Norway.
| |
Collapse
|
15
|
Li X, Kumar S, Harmanci A, Li S, Kitchen RR, Zhang Y, Wali VB, Reddy SM, Woodward WA, Reuben JM, Rozowsky J, Hatzis C, Ueno NT, Krishnamurthy S, Pusztai L, Gerstein M. Whole-genome sequencing of phenotypically distinct inflammatory breast cancers reveals similar genomic alterations to non-inflammatory breast cancers. Genome Med 2021; 13:70. [PMID: 33902690 PMCID: PMC8077918 DOI: 10.1186/s13073-021-00879-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/09/2020] [Accepted: 03/25/2021] [Indexed: 02/08/2023] Open
Abstract
BACKGROUND Inflammatory breast cancer (IBC) has a highly invasive and metastatic phenotype. However, little is known about its genetic drivers. To address this, we report the largest cohort of whole-genome sequencing (WGS) of IBC cases. METHODS We performed WGS of 20 IBC samples and paired normal blood DNA to identify genomic alterations. For comparison, we used 23 matched non-IBC samples from the Cancer Genome Atlas Program (TCGA). We also validated our findings using WGS data from the International Cancer Genome Consortium (ICGC) and the Pan-Cancer Analysis of Whole Genomes (PCAWG) Consortium. We examined a wide selection of genomic features to search for differences between IBC and conventional breast cancer. These include (i) somatic and germline single-nucleotide variants (SNVs), in both coding and non-coding regions; (ii) the mutational signature and the clonal architecture derived from these SNVs; (iii) copy number and structural variants (CNVs and SVs); and (iv) non-human sequence in the tumors (i.e., exogenous sequences of bacterial origin). RESULTS Overall, IBC has similar genomic characteristics to non-IBC, including specific alterations, overall mutational load and signature, and tumor heterogeneity. In particular, we observed similar mutation frequencies between IBC and non-IBC, for each gene and most cancer-related pathways. Moreover, we found no exogenous sequences of infectious agents specific to IBC samples. Even though we could not find any strongly statistically distinguishing genomic features between the two groups, we did find some suggestive differences in IBC: (i) The MAST2 gene was more frequently mutated (20% IBC vs. 0% non-IBC). (ii) The TGF β pathway was more frequently disrupted by germline SNVs (50% vs. 13%). (iii) Different copy number profiles were observed in several genomic regions harboring cancer genes. (iv) Complex SVs were more frequent. (v) The clonal architecture was simpler, suggesting more homogenous tumor-evolutionary lineages. CONCLUSIONS Whole-genome sequencing of IBC manifests a similar genomic architecture to non-IBC. We found no unique genomic alterations shared in just IBCs; however, subtle genomic differences were observed including germline alterations in TGFβ pathway genes and somatic mutations in the MAST2 kinase that could represent potential therapeutic targets.
Collapse
Affiliation(s)
- Xiaotong Li
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Yale Cancer Center, Breast Medical Oncology, Yale School of Medicine, 300 George Street, Suite 120, Rm133, New Haven, CT 06511 USA
| | - Sushant Kumar
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
| | - Arif Harmanci
- Center for Precision Health, School of Biomedical Informatics, University of Texas Health Science Center Houston, Houston, TX USA
| | - Shantao Li
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
| | - Robert R. Kitchen
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Medicine, Massachusetts General Hospital, Harvard Medical School, Boston, MA USA
| | - Yan Zhang
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Biomedical Informatics, College of Medicine, The Ohio State University, Columbus, OH USA
- The Ohio State University Comprehensive Cancer Center (OSUCCC – James), Columbus, OH USA
| | - Vikram B. Wali
- Yale Cancer Center, Breast Medical Oncology, Yale School of Medicine, 300 George Street, Suite 120, Rm133, New Haven, CT 06511 USA
| | - Sangeetha M. Reddy
- Division of Hematology/Oncology, Department of Internal Medicine, University of Texas Southwestern Medical Center, Dallas, TX USA
- Department of Breast Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX USA
| | - Wendy A. Woodward
- Morgan Welch Inflammatory Breast Cancer Research Program and Clinic, The University of Texas MD Anderson Cancer Center, Houston, TX USA
- Department of Radiation Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX USA
| | - James M. Reuben
- Morgan Welch Inflammatory Breast Cancer Research Program and Clinic, The University of Texas MD Anderson Cancer Center, Houston, TX USA
- Department of Hematopathology, The University of Texas MD Anderson Cancer Center, Houston, TX USA
| | - Joel Rozowsky
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
| | - Christos Hatzis
- Yale Cancer Center, Breast Medical Oncology, Yale School of Medicine, 300 George Street, Suite 120, Rm133, New Haven, CT 06511 USA
| | - Naoto T. Ueno
- Department of Breast Medical Oncology, The University of Texas MD Anderson Cancer Center, Houston, TX USA
- Morgan Welch Inflammatory Breast Cancer Research Program and Clinic, The University of Texas MD Anderson Cancer Center, Houston, TX USA
| | - Savitri Krishnamurthy
- Morgan Welch Inflammatory Breast Cancer Research Program and Clinic, The University of Texas MD Anderson Cancer Center, Houston, TX USA
| | - Lajos Pusztai
- Yale Cancer Center, Breast Medical Oncology, Yale School of Medicine, 300 George Street, Suite 120, Rm133, New Haven, CT 06511 USA
| | - Mark Gerstein
- Program in Computational Biology and Bioinformatics, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Molecular Biophysics and Biochemistry, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Computer Science, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
- Department of Statistics and Data Science, Yale University, 266 Whitney Ave., Bass 432A, New Haven, CT 06520 USA
| |
Collapse
|
16
|
Mc Cartney AM, Mahmoud M, Jochum M, Agustinho DP, Zorman B, Al Khleifat A, Dabbaghie F, K Kesharwani R, Smolka M, Dawood M, Albin D, Aliyev E, Almabrazi H, Arslan A, Balaji A, Behera S, Billingsley K, L Cameron D, Daw J, T. Dawson E, De Coster W, Du H, Dunn C, Esteban R, Jolly A, Kalra D, Liao C, Liu Y, Lu TY, M Havrilla J, M Khayat M, Marin M, Monlong J, Price S, Rafael Gener A, Ren J, Sagayaradj S, Sapoval N, Sinner C, C. Soto D, Soylev A, Subramaniyan A, Syed N, Tadimeti N, Tater P, Vats P, Vaughn J, Walker K, Wang G, Zeng Q, Zhang S, Zhao T, Kille B, Biederstedt E, Chaisson M, English A, Kronenberg Z, J. Treangen T, Hefferon T, Chin CS, Busby B, J Sedlazeck F. An international virtual hackathon to build tools for the analysis of structural variants within species ranging from coronaviruses to vertebrates. F1000Res 2021; 10:246. [PMID: 34621504 PMCID: PMC8479851 DOI: 10.12688/f1000research.51477.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 08/23/2021] [Indexed: 11/20/2022] Open
Abstract
In October 2020, 62 scientists from nine nations worked together remotely in the Second Baylor College of Medicine & DNAnexus hackathon, focusing on different related topics on Structural Variation, Pan-genomes, and SARS-CoV-2 related research. The overarching focus was to assess the current status of the field and identify the remaining challenges. Furthermore, how to combine the strengths of the different interests to drive research and method development forward. Over the four days, eight groups each designed and developed new open-source methods to improve the identification and analysis of variations among species, including humans and SARS-CoV-2. These included improvements in SV calling, genotyping, annotations and filtering. Together with advancements in benchmarking existing methods. Furthermore, groups focused on the diversity of SARS-CoV-2. Daily discussion summary and methods are available publicly at https://github.com/collaborativebioinformatics provides valuable insights for both participants and the research community.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Fawaz Dabbaghie
- Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany
| | | | | | | | | | | | | | - Ahmed Arslan
- Stanford University School of Medicine, California, USA
| | | | | | | | - Daniel L Cameron
- Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
| | - Joyjit Daw
- NVIDIA Corporation, Santa Clara, California, USA
| | | | | | - Haowei Du
- Baylor College of Medicine, Houston, USA
| | | | | | | | | | | | | | | | | | | | | | - Jean Monlong
- UC Santa Cruz Genomics Institute, Santa Cruz, USA
| | | | | | | | | | | | | | | | - Arda Soylev
- Konya Food and Agriculture University, Konya, Turkey
| | | | | | | | | | - Pankaj Vats
- NVIDIA Corporation, Santa Clara, California, USA
| | | | | | | | - Qiandong Zeng
- Laboratory Corporation of America Holdings, Westborough, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
17
|
Mc Cartney AM, Mahmoud M, Jochum M, Agustinho DP, Zorman B, Al Khleifat A, Dabbaghie F, K Kesharwani R, Smolka M, Dawood M, Albin D, Aliyev E, Almabrazi H, Arslan A, Balaji A, Behera S, Billingsley K, L Cameron D, Daw J, T. Dawson E, De Coster W, Du H, Dunn C, Esteban R, Jolly A, Kalra D, Liao C, Liu Y, Lu TY, M Havrilla J, M Khayat M, Marin M, Monlong J, Price S, Rafael Gener A, Ren J, Sagayaradj S, Sapoval N, Sinner C, C. Soto D, Soylev A, Subramaniyan A, Syed N, Tadimeti N, Tater P, Vats P, Vaughn J, Walker K, Wang G, Zeng Q, Zhang S, Zhao T, Kille B, Biederstedt E, Chaisson M, English A, Kronenberg Z, J. Treangen T, Hefferon T, Chin CS, Busby B, J Sedlazeck F. An international virtual hackathon to build tools for the analysis of structural variants within species ranging from coronaviruses to vertebrates. F1000Res 2021; 10:246. [PMID: 34621504 PMCID: PMC8479851 DOI: 10.12688/f1000research.51477.1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Accepted: 03/04/2021] [Indexed: 11/08/2023] Open
Abstract
In October 2020, 62 scientists from nine nations worked together remotely in the Second Baylor College of Medicine & DNAnexus hackathon, focusing on different related topics on Structural Variation, Pan-genomes, and SARS-CoV-2 related research. The overarching focus was to assess the current status of the field and identify the remaining challenges. Furthermore, how to combine the strengths of the different interests to drive research and method development forward. Over the four days, eight groups each designed and developed new open-source methods to improve the identification and analysis of variations among species, including humans and SARS-CoV-2. These included improvements in SV calling, genotyping, annotations and filtering. Together with advancements in benchmarking existing methods. Furthermore, groups focused on the diversity of SARS-CoV-2. Daily discussion summary and methods are available publicly at https://github.com/collaborativebioinformatics provides valuable insights for both participants and the research community.
Collapse
Affiliation(s)
| | | | | | | | | | | | - Fawaz Dabbaghie
- Institute for Medical Biometry and Bioinformatics, Düsseldorf, Germany
| | | | | | | | | | | | | | - Ahmed Arslan
- Stanford University School of Medicine, California, USA
| | | | | | | | - Daniel L Cameron
- Walter and Eliza Hall Institute of Medical Research, Melbourne, Australia
| | - Joyjit Daw
- NVIDIA Corporation, Santa Clara, California, USA
| | | | | | - Haowei Du
- Baylor College of Medicine, Houston, USA
| | | | | | | | | | | | | | | | | | | | | | - Jean Monlong
- UC Santa Cruz Genomics Institute, Santa Cruz, USA
| | | | | | | | | | | | | | | | - Arda Soylev
- Konya Food and Agriculture University, Konya, Turkey
| | | | | | | | | | - Pankaj Vats
- NVIDIA Corporation, Santa Clara, California, USA
| | | | | | | | - Qiandong Zeng
- Laboratory Corporation of America Holdings, Westborough, USA
| | | | | | | | | | | | | | | | | | | | | | | | | |
Collapse
|
18
|
Kimura H, Mori D, Aleksic B, Ozaki N. Elucidation of molecular pathogenesis and drug development for psychiatric disorders from rare disease-susceptibility variants. Neurosci Res 2020; 170:24-31. [PMID: 33316300 DOI: 10.1016/j.neures.2020.11.008] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2020] [Revised: 11/19/2020] [Accepted: 11/26/2020] [Indexed: 10/22/2022]
Abstract
Recent rapid progress in genome analysis and large-scale consortia has made it possible to discover variants with a variety of allele frequencies and effect sizes associated with psychiatric disorders. Among psychiatric disorder-susceptibility variants, rare variants with large effect sizes detected by sequencing analysis or array comparative genomic hybridization would be particularly useful for elucidating pathophysiology by developing disease models, such as genome-edited mouse or induced pluripotent stem cells. In the last decade, investigations of rare variants with large effect size have revealed an important role of neurodevelopment in the pathogenesis of psychiatric disorders. In future research, integration of recent evidence concerning the contribution of the immune system or gut microbiota will enhance our understanding of psychiatric disorders and facilitate novel drug development.
Collapse
Affiliation(s)
- Hiroki Kimura
- Department of Psychiatry, Nagoya University Graduate School of Medicine, Nagoya, Japan.
| | - Daisuke Mori
- Department of Psychiatry, Nagoya University Graduate School of Medicine, Nagoya, Japan; Brain & Mind Research Center, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Branko Aleksic
- Department of Psychiatry, Nagoya University Graduate School of Medicine, Nagoya, Japan
| | - Norio Ozaki
- Department of Psychiatry, Nagoya University Graduate School of Medicine, Nagoya, Japan; Brain & Mind Research Center, Nagoya University Graduate School of Medicine, Nagoya, Japan; Medical Genomics Center, Nagoya University Hospital, Nagoya, Japan
| |
Collapse
|
19
|
Van L, Heung T, Malecki SL, Fenn C, Tyrer A, Sanches M, Chow EW, Boot E, Corral M, Dash S, George SR, Bassett AS. 22q11.2 microdeletion and increased risk for type 2 diabetes. EClinicalMedicine 2020; 26:100528. [PMID: 33089125 PMCID: PMC7565196 DOI: 10.1016/j.eclinm.2020.100528] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/12/2020] [Revised: 08/12/2020] [Accepted: 08/14/2020] [Indexed: 01/21/2023] Open
Abstract
BACKGROUND The 22q11.2 microdeletion is the pathogenic copy number variation (CNV) associated with 22q11.2 deletion syndrome (22q11.2DS, formerly known as DiGeorge syndrome). Familiar endocrinological manifestations include hypoparathyroidism and hypothyroidism, with recent elucidation of elevated risk for obesity in adults. In this study, we aimed to determine whether adults with 22q11.2DS have an increased risk of developing type 2 diabetes (T2D). METHODS We studied the effect of the 22q11.2 microdeletion on risk for T2D, defined by history and glycosylated hemoglobin (HbA1c), using weighted survey data from the adult Canadian population (based on n = 11,874) and from a clinical cohort of adults with 22q11.2DS (n = 314), aged 17-69 years. Binomial logistic regression models accounted for age, sex, non-European ethnicity, family history of T2D, obesity, and antipsychotic medication use. FINDINGS The 22q11.2 microdeletion was a significant independent risk factor for T2D (OR 2·44, 95% CI 1·39-4·31), accounting for other factors (p < 0·0001). All factors except sex were also significant within 22q11.2DS. The median age at diagnosis of T2D was significantly younger in 22q11.2DS than in the Canadian population sample (32 vs 50 years, p < 0·0001). In adults without T2D, HbA1c was significantly higher in 22q11.2DS than the population (p = 0·042), after accounting for younger age of the 22q11.2DS group. INTERPRETATION The results support the 22q11.2 microdeletion as a novel independent risk factor and potential model for early onset T2D. The findings complement emerging evidence that rare CNVs may contribute to risk for T2D. The results have implications for precision medicine and research into the underlying pathogenesis of T2D.
Collapse
Affiliation(s)
- Lily Van
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Department of Psychiatry, University of Toronto, Toronto, Ontario, Canada
| | - Tracy Heung
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- The Dalglish Family 22q Clinic, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
| | - Sarah L. Malecki
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Christian Fenn
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Undergraduate Medical Education, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Andrea Tyrer
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Undergraduate Medical Education, Faculty of Medicine, University of Toronto, Toronto, Ontario, Canada
| | - Marcos Sanches
- Biostatistical Consulting Service, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
| | - Eva W.C. Chow
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Department of Psychiatry, University of Toronto, Toronto, Ontario, Canada
| | - Erik Boot
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- The Dalglish Family 22q Clinic, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
- Advisium, ’s Heeren Loo Zorggroep, Amersfoort, the Netherlands
| | - Maria Corral
- The Dalglish Family 22q Clinic, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
| | - Satya Dash
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada
- Banting & Best Diabetes Center, University of Toronto, Toronto, Ontario, Canada
- Division of Endocrinology, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
| | - Susan R. George
- Department of Medicine, University of Toronto, Toronto, Ontario, Canada
- Division of Endocrinology, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
- Department of Pharmacology, University of Toronto, Toronto, Ontario, Canada
| | - Anne S. Bassett
- Clinical Genetics Research Program, Centre for Addiction and Mental Health, Toronto, Ontario, Canada
- Department of Psychiatry, University of Toronto, Toronto, Ontario, Canada
- The Dalglish Family 22q Clinic, Toronto General Hospital, University Health Network, Toronto, Ontario, Canada
- Toronto Congenital Cardiac Centre for Adults, and Division of Cardiology, Department of Medicine, University Health Network, Toronto, Ontario, Canada
- Toronto General Research Institute and Campbell Family Mental Health Research Institute, Toronto, Ontario, Canada
- Corresponding author at: The Dalglish Family 22q Clinic, Toronto General Hospital, 200 Elizabeth Street, 8NU-802 Toronto, ON M4G 2C5, Canada.
| |
Collapse
|
20
|
Bi C, Wang L, Yuan B, Zhou X, Li Y, Wang S, Pang Y, Gao X, Huang Y, Li M. Long-read individual-molecule sequencing reveals CRISPR-induced genetic heterogeneity in human ESCs. Genome Biol 2020; 21:213. [PMID: 32831134 PMCID: PMC7444080 DOI: 10.1186/s13059-020-02143-8] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Accepted: 08/11/2020] [Indexed: 12/12/2022] Open
Abstract
Quantifying the genetic heterogeneity of a cell population is essential to understanding of biological systems. We develop a universal method to label individual DNA molecules for single-base-resolution haplotype-resolved quantitative characterization of diverse types of rare variants, with frequency as low as 4 × 10-5, using both short- or long-read sequencing platforms. It provides the first quantitative evidence of persistent nonrandom large structural variants and an increase in single-nucleotide variants at the on-target locus following repair of double-strand breaks induced by CRISPR-Cas9 in human embryonic stem cells.
Collapse
Affiliation(s)
- Chongwei Bi
- Laboratory of Stem Cell and Regeneration, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - Lin Wang
- Laboratory of Stem Cell and Regeneration, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
- Present address: Key Laboratory of Zoonosis Research, Ministry of Education, Institute of Zoonosis, College of Veterinary Medicine, Jilin University, Changchun, China
| | - Baolei Yuan
- Laboratory of Stem Cell and Regeneration, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - Xuan Zhou
- Laboratory of Stem Cell and Regeneration, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia
| | - Yu Li
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Sheng Wang
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Yuhong Pang
- Beijing Advanced Innovation Center for Genomics (ICG), Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, College of Chemistry, College of Engineering, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China
| | - Xin Gao
- Computational Bioscience Research Center (CBRC), Computer, Electrical and Mathematical Science and Engineering (CEMSE) Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Saudi Arabia
| | - Yanyi Huang
- Beijing Advanced Innovation Center for Genomics (ICG), Biomedical Pioneering Innovation Center (BIOPIC), School of Life Sciences, College of Chemistry, College of Engineering, Peking-Tsinghua Center for Life Sciences, Peking University, Beijing, China.
- Institute for Cell Analysis, Shenzhen Bay Laboratory, Shenzhen, China.
| | - Mo Li
- Laboratory of Stem Cell and Regeneration, Biological and Environmental Science and Engineering Division, King Abdullah University of Science and Technology (KAUST), Thuwal, 23955-6900, Kingdom of Saudi Arabia.
| |
Collapse
|
21
|
Magnusson M, Eisfeldt J, Nilsson D, Rosenbaum A, Wirta V, Lindstrand A, Wedell A, Stranneheim H. Loqusdb: added value of an observations database of local genomic variation. BMC Bioinformatics 2020; 21:273. [PMID: 32611382 PMCID: PMC7329469 DOI: 10.1186/s12859-020-03609-z] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2020] [Accepted: 06/17/2020] [Indexed: 01/23/2023] Open
Abstract
BACKGROUND Exome and genome sequencing is becoming the method of choice for rare disease diagnostics. One of the key challenges remaining is distinguishing the disease causing variants from the benign background variation. After analysis and annotation of the sequencing data there are typically thousands of candidate variants requiring further investigation. One of the most effective and least biased ways to reduce this number is to assess the rarity of a variant in any population. Currently, there are a number of reliable sources of information for major population frequencies when considering single nucleotide variants (SNVs) and small insertion and deletions (INDELs), with gnomAD as the most prominent public resource available. However, local variation or frequencies in sub-populations may be underrepresented in these public resources. In contrast, for structural variation (SV), the background frequency in the general population is more or less unknown mostly due to challenges in calling SVs in a consistent way. Keeping track of local variation is one way to overcome these problems and significantly reduce the number of potential disease causing variants retained for manual inspection, both for SNVs and SVs. RESULTS Here, we present loqusdb, a tool to solve the challenge of keeping track of any type of variant observations from genome sequencing data. Loqusdb was designed to handle a large flow of samples and unlike other solutions, samples can be added continuously to the database without rebuilding it, facilitating improvements and additions. We assessed the added value of a local observations database using 98 samples annotated with information from a background of 888 unrelated individuals. CONCLUSIONS We show both how powerful SV analysis can be when filtering for population frequencies and how the number of apparently rare SNVs/INDELs can be reduced by adding local population information even after annotating the data with other large frequency databases, such as gnomAD. In conclusion, we show that a local frequency database is an attractive, and a necessary addition to the publicly available databases that facilitate the analysis of exome and genome data in a clinical setting.
Collapse
Affiliation(s)
- Måns Magnusson
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology, and Health, KTH Royal Institute of Technology, Stockholm, Sweden. .,Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden. .,Centre for Inherited Metabolic Diseases, Karolinska University Hospital, Stockholm, Sweden.
| | - Jesper Eisfeldt
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden.,Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden.,Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Daniel Nilsson
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden.,Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden
| | - Adam Rosenbaum
- Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology, and Health, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Valtteri Wirta
- Science for Life Laboratory, Department of Microbiology, Tumor and Cell Biology, Karolinska Institutet, Stockholm, Sweden.,Science for Life Laboratory, School of Engineering Sciences in Chemistry, Biotechnology, and Health, KTH Royal Institute of Technology, Stockholm, Sweden
| | - Anna Lindstrand
- Department of Clinical Genetics, Karolinska University Hospital, Stockholm, Sweden.,Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden.,Center for Molecular Medicine, Karolinska Institutet, Stockholm, Sweden
| | - Anna Wedell
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden.,Centre for Inherited Metabolic Diseases, Karolinska University Hospital, Stockholm, Sweden
| | - Henrik Stranneheim
- Department of Molecular Medicine and Surgery, Karolinska Institutet, Stockholm, Sweden.,Centre for Inherited Metabolic Diseases, Karolinska University Hospital, Stockholm, Sweden
| |
Collapse
|
22
|
Cremers FPM, Lee W, Collin RWJ, Allikmets R. Clinical spectrum, genetic complexity and therapeutic approaches for retinal disease caused by ABCA4 mutations. Prog Retin Eye Res 2020; 79:100861. [PMID: 32278709 PMCID: PMC7544654 DOI: 10.1016/j.preteyeres.2020.100861] [Citation(s) in RCA: 149] [Impact Index Per Article: 37.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2019] [Revised: 03/13/2020] [Accepted: 03/18/2020] [Indexed: 12/18/2022]
Abstract
The ABCA4 protein (then called a “rim protein”) was first
identified in 1978 in the rims and incisures of rod photoreceptors. The
corresponding gene, ABCA4, was cloned in 1997, and variants
were identified as the cause of autosomal recessive Stargardt disease (STGD1).
Over the next two decades, variation in ABCA4 has been
attributed to phenotypes other than the classically defined STGD1 or fundus
flavimaculatus, ranging from early onset and fast progressing cone-rod dystrophy
and retinitis pigmentosa-like phenotypes to very late onset cases of mostly mild
disease sometimes resembling, and confused with, age-related macular
degeneration. Similarly, analysis of the ABCA4 locus uncovered
a trove of genetic information, including >1200 disease-causing mutations
of varying severity, and of all types – missense, nonsense, small
deletions/insertions, and splicing affecting variants, of which many are located
deep-intronic. Altogether, this has greatly expanded our understanding of
complexity not only of the diseases caused by ABCA4 mutations,
but of all Mendelian diseases in general. This review provides an in depth
assessment of the cumulative knowledge of ABCA4-associated retinopathy –
clinical manifestations, genetic complexity, pathophysiology as well as current
and proposed therapeutic approaches.
Collapse
Affiliation(s)
- Frans P M Cremers
- Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, the Netherlands; Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, PO Box 9104, 6500 HE, Nijmegen, the Netherlands.
| | - Winston Lee
- Department of Ophthalmology, Columbia University, New York, NY, 10032, USA; Department of Genetics & Development, Columbia University, New York, NY, 10032, USA
| | - Rob W J Collin
- Department of Human Genetics, Radboud University Medical Center, PO Box 9101, 6500 HB, Nijmegen, the Netherlands; Donders Institute for Brain, Cognition and Behaviour, Radboud University Medical Center, PO Box 9104, 6500 HE, Nijmegen, the Netherlands
| | - Rando Allikmets
- Department of Ophthalmology, Columbia University, New York, NY, 10032, USA; Department of Pathology & Cell Biology, Columbia University, New York, NY, 10032, USA.
| |
Collapse
|
23
|
Amiri Ghanatsaman Z, Wang GD, Asadollahpour Nanaei H, Asadi Fozi M, Peng MS, Esmailizadeh A, Zhang YP. Whole genome resequencing of the Iranian native dogs and wolves to unravel variome during dog domestication. BMC Genomics 2020; 21:207. [PMID: 32131720 PMCID: PMC7057629 DOI: 10.1186/s12864-020-6619-8] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/25/2019] [Accepted: 02/25/2020] [Indexed: 11/21/2022] Open
Abstract
BACKGROUND Advances in genome technology have simplified a new comprehension of the genetic and historical processes crucial to rapid phenotypic evolution under domestication. To get new insight into the genetic basis of the dog domestication process, we conducted whole-genome sequence analysis of three wolves and three dogs from Iran which covers the eastern part of the Fertile Crescent located in Southwest Asia where the independent domestication of most of the plants and animals has been documented and also high haplotype sharing between wolves and dog breeds has been reported. RESULTS Higher diversity was found within the wolf genome compared with the dog genome. A total number of 12.45 million SNPs were detected in all individuals (10.45 and 7.82 million SNPs were identified for all the studied wolves and dogs, respectively) and a total number of 3.49 million small Indels were detected in all individuals (3.11 and 2.24 million small Indels were identified for all the studied wolves and dogs, respectively). A total of 10,571 copy number variation regions (CNVRs) were detected across the 6 individual genomes, covering 154.65 Mb, or 6.41%, of the reference genome (canFam3.1). Further analysis showed that the distribution of deleterious variants in the dog genome is higher than the wolf genome. Also, genomic annotation results from intron and intergenic regions showed that the proportion of variations in the wolf genome is higher than that in the dog genome, while the proportion of the coding sequences and 3'-UTR in the dog genome is higher than that in the wolf genome. The genes related to the olfactory and immune systems were enriched in the set of the structural variants (SVs) identified in this work. CONCLUSIONS Our results showed more deleterious mutations and coding sequence variants in the domestic dog genome than those in wolf genome. By providing the first Iranian dog and wolf variome map, our findings contribute to understanding the genetic architecture of the dog domestication.
Collapse
Affiliation(s)
- Zeinab Amiri Ghanatsaman
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran
- Yong Researchers Society, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran
| | - Guo-Dong Wang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No. 32 Jiaochang Donglu, Kunming, 650223, Yunnan, China
| | - Hojjat Asadollahpour Nanaei
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran
- Yong Researchers Society, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran
| | - Masood Asadi Fozi
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran
| | - Min-Sheng Peng
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No. 32 Jiaochang Donglu, Kunming, 650223, Yunnan, China
| | - Ali Esmailizadeh
- Department of Animal Science, Faculty of Agriculture, Shahid Bahonar University of Kerman, PB 76169-133, Kerman, Iran.
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No. 32 Jiaochang Donglu, Kunming, 650223, Yunnan, China.
| | - Ya-Ping Zhang
- State Key Laboratory of Genetic Resources and Evolution, Kunming Institute of Zoology, Chinese Academy of Sciences, No. 32 Jiaochang Donglu, Kunming, 650223, Yunnan, China.
- State Key Laboratory for Conservation and Utilization of Bio-Resources in Yunnan, Yunnan University, Kunming, 650091, China.
| |
Collapse
|
24
|
Abstract
Structural variant (SV) differences between human genomes can cause germline and mosaic disease as well as inter-individual variation. De-regulation of accurate DNA repair and genomic surveillance mechanisms results in a large number of SVs in cancer. Analysis of the DNA sequences at SV breakpoints can help identify pathways of mutagenesis and regions of the genome that are more susceptible to rearrangement. Large-scale SV analyses have been enabled by high-throughput genome-level sequencing on humans in the past decade. These studies have shed light on the mechanisms and prevalence of complex genomic rearrangements. Recent advancements in both sequencing and other mapping technologies as well as calling algorithms for detection of genomic rearrangements have helped propel SV detection into population-scale studies, and have begun to elucidate previously inaccessible regions of the genome. Here, we discuss the genomic organization of simple and complex SVs, the molecular mechanisms of their formation, and various ways to detect them. We also introduce methods for characterizing SVs and their consequences on human genomes.
Collapse
Affiliation(s)
| | - Christine R Beck
- The Jackson Laboratory for Genomic Medicine, Farmington, CT, 06032, USA.
- Department of Genetics and Genome Sciences, Institute for Systems Genomics, University of Connecticut Health Center, Farmington, CT, 06030, USA.
| |
Collapse
|
25
|
Yokoyama TT, Sakamoto Y, Seki M, Suzuki Y, Kasahara M. MoMI-G: modular multi-scale integrated genome graph browser. BMC Bioinformatics 2019; 20:548. [PMID: 31690272 PMCID: PMC6833150 DOI: 10.1186/s12859-019-3145-2] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Accepted: 10/09/2019] [Indexed: 01/30/2023] Open
Abstract
Background Genome graph is an emerging approach for representing structural variants on genomes with branches. For example, representing structural variants of cancer genomes as a genome graph is more natural than representing such genomes as differences from the linear reference genome. While more and more structural variants are being identified by long-read sequencing, many of them are difficult to visualize using existing structural variants visualization tools. To this end, visualization method for large genome graphs such as human cancer genome graphs is demanded. Results We developed MOdular Multi-scale Integrated Genome graph browser, MoMI-G, a web-based genome graph browser that can visualize genome graphs with structural variants and supporting evidences such as read alignments, read depth, and annotations. This browser allows more intuitive recognition of large, nested, and potentially more complex structural variations. MoMI-G has view modules for different scales, which allow users to view the whole genome down to nucleotide-level alignments of long reads. Alignments spanning reference alleles and those spanning alternative alleles are shown in the same view. Users can customize the view, if they are not satisfied with the preset views. In addition, MoMI-G has Interval Card Deck, a feature for rapid manual inspection of hundreds of structural variants. Herein, we describe the utility of MoMI-G by using representative examples of large and nested structural variations found in two cell lines, LC-2/ad and CHM1. Conclusions Users can inspect complex and large structural variations found by long-read analysis in large genomes such as human genomes more smoothly and more intuitively. In addition, users can easily filter out false positives by manually inspecting hundreds of identified structural variants with supporting long-read alignments and annotations in a short time. Software availability MoMI-G is freely available at https://github.com/MoMI-G/MoMI-G under the MIT license.
Collapse
Affiliation(s)
- Toshiyuki T Yokoyama
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Yoshitaka Sakamoto
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Masahide Seki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Yutaka Suzuki
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan
| | - Masahiro Kasahara
- Department of Computational Biology and Medical Sciences, Graduate School of Frontier Sciences, The University of Tokyo, Chiba, Japan.
| |
Collapse
|
26
|
Qi Y, Pradhan D, El-Kebir M. Implications of non-uniqueness in phylogenetic deconvolution of bulk DNA samples of tumors. Algorithms Mol Biol 2019; 14:19. [PMID: 31497065 PMCID: PMC6719395 DOI: 10.1186/s13015-019-0155-6] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/11/2019] [Accepted: 08/17/2019] [Indexed: 12/11/2022] Open
Abstract
Background Tumors exhibit extensive intra-tumor heterogeneity, the presence of groups of cellular populations with distinct sets of somatic mutations. This heterogeneity is the result of an evolutionary process, described by a phylogenetic tree. In addition to enabling clinicians to devise patient-specific treatment plans, phylogenetic trees of tumors enable researchers to decipher the mechanisms of tumorigenesis and metastasis. However, the problem of reconstructing a phylogenetic tree T given bulk sequencing data from a tumor is more complicated than the classic phylogeny inference problem. Rather than observing the leaves of T directly, we are given mutation frequencies that are the result of mixtures of the leaves of T. The majority of current tumor phylogeny inference methods employ the perfect phylogeny evolutionary model. The underlying Perfect Phylogeny Mixture (PPM) combinatorial problem typically has multiple solutions. Results We prove that determining the exact number of solutions to the PPM problem is #P-complete and hard to approximate within a constant factor. Moreover, we show that sampling solutions uniformly at random is hard as well. On the positive side, we provide a polynomial-time computable upper bound on the number of solutions and introduce a simple rejection-sampling based scheme that works well for small instances. Using simulated and real data, we identify factors that contribute to and counteract non-uniqueness of solutions. In addition, we study the sampling performance of current methods, identifying significant biases. Conclusions Awareness of non-uniqueness of solutions to the PPM problem is key to drawing accurate conclusions in downstream analyses based on tumor phylogenies. This work provides the theoretical foundations for non-uniqueness of solutions in tumor phylogeny inference from bulk DNA samples.
Collapse
|
27
|
Au CH, Ho DN, Ip BBK, Wan TSK, Ng MHL, Chiu EKW, Chan TL, Ma ESK. Rapid detection of chromosomal translocation and precise breakpoint characterization in acute myeloid leukemia by nanopore long-read sequencing. Cancer Genet 2019; 239:22-25. [PMID: 31473470 DOI: 10.1016/j.cancergen.2019.08.005] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 07/21/2019] [Accepted: 08/21/2019] [Indexed: 12/11/2022]
Abstract
Detection of chromosomal translocation is a key component in diagnosis and management of acute myeloid leukemia (AML). Targeted RNA next-generation sequencing (NGS) is emerging as a powerful and clinically practical tool, but it depends on expression of RNA transcript from the underlying DNA translocation. Here, we show the clinical utility of nanopore long-read sequencing in rapidly detecting DNA translocation with exact breakpoints. In a newly diagnosed patient with AML, conventional karyotyping showed translocation t(10;12)(q22;p13) but RNA NGS detected NUP98-NSD1 fusion transcripts from a known cryptic translocation t(5;11)(q35;p15). Rapid PCR-free nanopore whole-genome sequencing yielded a 26,194 bp sequencing read and revealed the t(10;12) breakpoint to be DUSP13 and GRIN2B in head-to-head configuration. This translocation was then classified as a passenger structural variant. The sequencing also yielded a 20,709 bp sequencing read and revealed the t(5;11) breakpoint of the driver NUP98-NSD1 fusion. The identified DNA breakpoints also served as markers for molecular monitoring, in addition to fusion transcript expression by digital PCR and sequence mutations by NGS. We illustrate that third-generation nanopore sequencing is a simple and low-cost workflow for DNA translocation detection.
Collapse
Affiliation(s)
- Chun Hang Au
- Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium and Hospital, 1/F Li Shu Fan Block, 2 Village Road, Happy Valley, Hong Kong.
| | - Dona N Ho
- Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium and Hospital, 1/F Li Shu Fan Block, 2 Village Road, Happy Valley, Hong Kong.
| | - Beca B K Ip
- Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium and Hospital, 1/F Li Shu Fan Block, 2 Village Road, Happy Valley, Hong Kong.
| | - Thomas S K Wan
- Blood Cancer Cytogenetics and Genomics Laboratory, Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Prince of Wales Hospital, Hong Kong.
| | - Margaret H L Ng
- Blood Cancer Cytogenetics and Genomics Laboratory, Department of Anatomical and Cellular Pathology, The Chinese University of Hong Kong, Prince of Wales Hospital, Hong Kong.
| | - Edmond K W Chiu
- Honorary Consultant in Hematology and Hematological Oncology, Hong Kong Sanatorium and Hospital, Hong Kong.
| | - Tsun Leung Chan
- Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium and Hospital, 1/F Li Shu Fan Block, 2 Village Road, Happy Valley, Hong Kong.
| | - Edmond S K Ma
- Division of Molecular Pathology, Department of Pathology, Hong Kong Sanatorium and Hospital, 1/F Li Shu Fan Block, 2 Village Road, Happy Valley, Hong Kong.
| |
Collapse
|
28
|
Truty R, Paul J, Kennemer M, Lincoln SE, Olivares E, Nussbaum RL, Aradhya S. Prevalence and properties of intragenic copy-number variation in Mendelian disease genes. Genet Med 2018; 21:114-123. [PMID: 29895855 PMCID: PMC6752305 DOI: 10.1038/s41436-018-0033-5] [Citation(s) in RCA: 113] [Impact Index Per Article: 18.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2018] [Accepted: 03/22/2018] [Indexed: 12/20/2022] Open
Abstract
Purpose We investigated the frequencies and characteristics of intragenic copy-number variants (CNVs) in a deep sampling of disease genes associated with monogenic disorders. Methods Subsets of 1507 genes were tested using next-generation sequencing to simultaneously detect sequence variants and CNVs in >143,000 individuals referred for genetic testing. We analyzed CNVs in gene panels for hereditary cancer syndromes and cardiovascular, neurological, or pediatric disorders. Results Our analysis identified 2844 intragenic CNVs in 384 clinically tested genes. CNVs were observed in 1.9% of the entire cohort but in a disproportionately high fraction (9.8%) of individuals with a clinically significant result. CNVs accounted for 4.7–35% of pathogenic variants, depending on clinical specialty. Distinct patterns existed among CNVs in terms of copy number, location, exons affected, clinical classification, and genes affected. Separately, analysis of de-identified data for 599 genes unrelated to the clinical phenotype yielded 4054 CNVs. Most of these CNVs were novel rare events, present as duplications, and enriched in genes associated with recessive disorders or lacking loss-of-function mutational mechanisms. Conclusion Universal intragenic CNV analysis adds substantial clinical sensitivity to genetic testing. Clinically relevant CNVs have distinct properties that distinguish them from CNVs contributing to normal variation in human disease genes.
Collapse
Affiliation(s)
| | | | | | | | | | - Robert L Nussbaum
- Invitae, San Francisco, CA, USA.,Volunteer Clinical Faculty, University of California, San Francisco, CA, USA
| | | |
Collapse
|
29
|
Abstract
Background Genomic structural variants (SV) play a significant role in the onset and progression of cancer. Genomic deletions can create oncogenic fusion genes or cause the loss of tumor suppressing gene function which can lead to tumorigenesis by downregulating these genes. Detecting these variants has clinical importance in the treatment of diseases. Furthermore, it is also clinically important to detect their breakpoint boundaries at high resolution. We have generalized the framework of a previously-published algorithm that located translocations, and we have applied that framework to develop a method to locate deletions at base pair level using next-generation sequencing data. Our method uses abnormally mapped read pairs, and then subsequently maps split reads to identify precise breakpoints. Results On a primary prostate cancer dataset and a simulated dataset, our method predicted the number, type, and breakpoints of biologically validated SVs at high accuracy. It also outperformed two existing algorithms on precise breakpoint prediction, which is clinically important. Conclusion Our algorithm, called Pegasus, accurately calls deletion breakpoints. However, the method must be extended to allow for germline variant filtering and heterozygous deletion detection. The source code that implements Pegasus can be downloaded from the following URL: http://github.com/mhayes20/Pegasus.
Collapse
|
30
|
Harrison A, Mashon RS, Kakkar N, Das S. Clinico-Hematological Profile of Hb Q India: An Uncommon Hemoglobin Variant. Indian J Hematol Blood Transfus 2017; 34:299-303. [PMID: 29622873 DOI: 10.1007/s12288-017-0864-2] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Accepted: 08/08/2017] [Indexed: 11/29/2022] Open
Abstract
Inherited hemoglobin disorders include thalassemias and structural variants like HbS, HbE, and HbD, Hb Lepore, HbD-Iran, Hb-H disease and HbQ India. HbQ India is an uncommon alpha-chain structural hemoglobin variant seen in North and West India. Patients are mostly asymptomatic and often present in the heterozygous state or co-inherited with beta-thalassaemia. This study was done in a tertiary care teaching hospital in North India over a period of 7 years among patients referred from antenatal and other clinics for screening of hemoglobin disorders. Complete blood count, peripheral blood smear examination and cation exchange high performance liquid chromatography (HPLC) was done to quantify various hemoglobins. HbQ India was diagnosed if the unknown variant hemoglobin was detected within the characteristic retention window. Of a total of 7530 patients screened, 31 (0.4%) were detected to have HbQ India. Of these, 25 (0.3%) patients had HbQ India trait and 6 (0.1%) patients had compound heterozygosity for HbQ India and Beta Thalassemia trait (HbQ India-BTT). All patients were clinically asymptomatic and were detected as part of the screening for hemoglobin disorders. Only two patients with HbQ India-BTT had hemoglobin less than 10 g/dL. In 25 patients with HbQ India trait, HbQ ranged from 13.6 to 24.4% and in 6 patients with HbQ India-BTT, HbQ India ranged from 7.4 to 9.0%. HbQ India is an uncommon structural hemoglobin variant. Although asymptomatic, it may cause diagnostic difficulty in the compound heterozygous state with beta thalassemia. HPLC provides a rapid, accurate and reproducible method for screening of this condition to identify and counsel individuals.
Collapse
Affiliation(s)
- Aradhana Harrison
- 1Department of Pathology, Christian Medical College & Hospital, Brown Road, Ludhiana, Punjab 141008 India
| | - Ranjeet Singh Mashon
- 1Department of Pathology, Christian Medical College & Hospital, Brown Road, Ludhiana, Punjab 141008 India
| | - Naveen Kakkar
- 1Department of Pathology, Christian Medical College & Hospital, Brown Road, Ludhiana, Punjab 141008 India
| | - Sheila Das
- 2HOD Hematopathology Section, Believers Church Medical College Hospital, St. Thomas Nagar, Kuttapuzha, Pathanamthitta, Thiruvalla, Kerala India
| |
Collapse
|
31
|
Guo Y, Dai Y, Yu H, Zhao S, Samuels DC, Shyr Y. Improvements and impacts of GRCh38 human reference on high throughput sequencing data analysis. Genomics 2017; 109:83-90. [PMID: 28131802 DOI: 10.1016/j.ygeno.2017.01.005] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2016] [Revised: 01/16/2017] [Accepted: 01/24/2017] [Indexed: 12/31/2022]
Abstract
Analyses of high throughput sequencing data starts with alignment against a reference genome, which is the foundation for all re-sequencing data analyses. Each new release of the human reference genome has been augmented with improved accuracy and completeness. It is presumed that the latest release of human reference genome, GRCh38 will contribute more to high throughput sequencing data analysis by providing more accuracy. But the amount of improvement has not yet been quantified. We conducted a study to compare the genomic analysis results between the GRCh38 reference and its predecessor GRCh37. Through analyses of alignment, single nucleotide polymorphisms, small insertion/deletions, copy number and structural variants, we show that GRCh38 offers overall more accurate analysis of human sequencing data. More importantly, GRCh38 produced fewer false positive structural variants. In conclusion, GRCh38 is an improvement over GRCh37 not only from the genome assembly aspect, but also yields more reliable genomic analysis results.
Collapse
|
32
|
Knief U, Hemmrich-Stanisak G, Wittig M, Franke A, Griffith SC, Kempenaers B, Forstmeier W. Fitness consequences of polymorphic inversions in the zebra finch genome. Genome Biol 2016; 17:199. [PMID: 27687629 PMCID: PMC5043542 DOI: 10.1186/s13059-016-1056-3] [Citation(s) in RCA: 40] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2016] [Accepted: 09/05/2016] [Indexed: 12/21/2022] Open
Abstract
Background Inversion polymorphisms constitute an evolutionary puzzle: they should increase embryo mortality in heterokaryotypic individuals but still they are widespread in some taxa. Some insect species have evolved mechanisms to reduce the cost of embryo mortality but humans have not. In birds, a detailed analysis is missing although intraspecific inversion polymorphisms are regarded as common. In Australian zebra finches (Taeniopygia guttata), two polymorphic inversions are known cytogenetically and we set out to detect these two and potentially additional inversions using genomic tools and study their effects on embryo mortality and other fitness-related and morphological traits. Results Using whole-genome SNP data, we screened 948 wild zebra finches for polymorphic inversions and describe four large (12–63 Mb) intraspecific inversion polymorphisms with allele frequencies close to 50 %. Using additional data from 5229 birds and 9764 eggs from wild and three captive zebra finch populations, we show that only the largest inversions increase embryo mortality in heterokaryotypic males, with surprisingly small effect sizes. We test for a heterozygote advantage on other fitness components but find no evidence for heterosis for any of the inversions. Yet, we find strong additive effects on several morphological traits. Conclusions The mechanism that has carried the derived inversion haplotypes to such high allele frequencies remains elusive. It appears that selection has effectively minimized the costs associated with inversions in zebra finches. The highly skewed distribution of recombination events towards the chromosome ends in zebra finches and other estrildid species may function to minimize crossovers in the inverted regions. Electronic supplementary material The online version of this article (doi:10.1186/s13059-016-1056-3) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Ulrich Knief
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, 82319, Seewiesen, Germany. .,Current address: Division of Evolutionary Biology, Faculty of Biology, Ludwig Maximilian University of Munich, 82152, Planegg-Martinsried, Germany.
| | - Georg Hemmrich-Stanisak
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, 24105, Kiel, Germany
| | - Michael Wittig
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, 24105, Kiel, Germany
| | - Andre Franke
- Institute of Clinical Molecular Biology, Christian-Albrechts-University, 24105, Kiel, Germany
| | - Simon C Griffith
- Department of Biological Sciences, Macquarie University, Sydney, NSW, 2109, Australia.,School of Biological, Earth & Environmental Sciences, University of New South Wales, Sydney, NSW, 2057, Australia
| | - Bart Kempenaers
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, 82319, Seewiesen, Germany
| | - Wolfgang Forstmeier
- Department of Behavioural Ecology and Evolutionary Genetics, Max Planck Institute for Ornithology, 82319, Seewiesen, Germany
| |
Collapse
|
33
|
Abstract
Next-generation sequencing (NGS) technologies have rapidly evolved in the last 5 years, leading to the generation of millions of short reads in a single run. Consequently, various sequence alignment algorithms have been developed to compare these reads to an appropriate reference in order to perform important downstream analysis. SOAP2 from the SOAP series is one of the most commonly used alignment programs to handle NGS data, and it efficiently does so using low computer memory usage and fast alignment speed. This chapter describes the protocol used to align short reads to a reference genome using SOAP2, and highlights the significance of using the in-built command-line options to tune the behavior of the algorithm according to the inputs and the desired results.
Collapse
|