1
|
Schlapbach LJ, Ganesamoorthy D, Wilson C, Raman S, George S, Snelling PJ, Phillips N, Irwin A, Sharp N, Le Marsney R, Chavan A, Hempenstall A, Bialasiewicz S, MacDonald AD, Grimwood K, Kling JC, McPherson SJ, Blumenthal A, Kaforou M, Levin M, Herberg JA, Gibbons KS, Coin LJM. Host gene expression signatures to identify infection type and organ dysfunction in children evaluated for sepsis: a multicentre cohort study. Lancet Child Adolesc Health 2024; 8:325-338. [PMID: 38513681 DOI: 10.1016/s2352-4642(24)00017-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/19/2023] [Revised: 01/14/2024] [Accepted: 01/15/2024] [Indexed: 03/23/2024]
Abstract
BACKGROUND Sepsis is defined as dysregulated host response to infection that leads to life-threatening organ dysfunction. Biomarkers characterising the dysregulated host response in sepsis are lacking. We aimed to develop host gene expression signatures to predict organ dysfunction in children with bacterial or viral infection. METHODS This cohort study was done in emergency departments and intensive care units of four hospitals in Queensland, Australia, and recruited children aged 1 month to 17 years who, upon admission, underwent a diagnostic test, including blood cultures, for suspected sepsis. Whole-blood RNA sequencing of blood was performed with Illumina NovaSeq (San Diego, CA, USA). Samples with completed phenotyping, monitoring, and RNA extraction by March 31, 2020, were included in the discovery cohort; samples collected or completed thereafter and by Oct 27, 2021, constituted the Rapid Paediatric Infection Diagnosis in Sepsis (RAPIDS) internal validation cohort. An external validation cohort was assembled from RNA sequencing gene expression count data from the observational European Childhood Life-threatening Infectious Disease Study (EUCLIDS), which recruited children with severe infection in nine European countries between 2012 and 2016. Feature selection approaches were applied to derive novel gene signatures for disease class (bacterial vs viral infection) and disease severity (presence vs absence of organ dysfunction 24 h post-sampling). The primary endpoint was the presence of organ dysfunction 24 h after blood sampling in the presence of confirmed bacterial versus viral infection. Gene signature performance is reported as area under the receiver operating characteristic curves (AUCs) and 95% CI. FINDINGS Between Sept 25, 2017, and Oct 27, 2021, 907 patients were enrolled. Blood samples from 595 patients were included in the discovery cohort, and samples from 312 children were included in the RAPIDS validation cohort. We derived a ten-gene disease class signature that achieved an AUC of 94·1% (95% CI 90·6-97·7) in distinguishing bacterial from viral infections in the RAPIDS validation cohort. A ten-gene disease severity signature achieved an AUC of 82·2% (95% CI 76·3-88·1) in predicting organ dysfunction within 24 h of sampling in the RAPIDS validation cohort. Used in tandem, the disease class and disease severity signatures predicted organ dysfunction within 24 h of sampling with an AUC of 90·5% (95% CI 83·3-97·6) for patients with predicted bacterial infection and 94·7% (87·8-100·0) for patients with predicted viral infection. In the external EUCLIDS validation dataset (n=362), the disease class and disease severity predicted organ dysfunction at time of sampling with an AUC of 70·1% (95% CI 44·1-96·2) for patients with predicted bacterial infection and 69·6% (53·1-86·0) for patients with predicted viral infection. INTERPRETATION In children evaluated for sepsis, novel host transcriptomic signatures specific for bacterial and viral infection can identify dysregulated host response leading to organ dysfunction. FUNDING Australian Government Medical Research Future Fund Genomic Health Futures Mission, Children's Hospital Foundation Queensland, Brisbane Diamantina Health Partners, Emergency Medicine Foundation, Gold Coast Hospital Foundation, Far North Queensland Foundation, Townsville Hospital and Health Services SERTA Grant, and Australian Infectious Diseases Research Centre.
Collapse
Affiliation(s)
- Luregn J Schlapbach
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia; Department of Intensive Care and Neonatology, and Children's Research Center, University Children's Hospital Zurich, University of Zurich, Zurich, Switzerland; Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia.
| | - Devika Ganesamoorthy
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia
| | - Clare Wilson
- Section of Paediatric Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
| | - Sainath Raman
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia; Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Shane George
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia; Department of Emergency Medicine, Gold Coast University Hospital, Southport, QLD, Australia; School of Medicine and Dentistry and the Menzies Health Institute Queensland, Griffith University, Southport, QLD, Australia
| | - Peter J Snelling
- Department of Emergency Medicine, Gold Coast University Hospital, Southport, QLD, Australia; School of Medicine and Dentistry and the Menzies Health Institute Queensland, Griffith University, Southport, QLD, Australia
| | - Natalie Phillips
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia; Emergency Department, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Adam Irwin
- Faculty of Medicine, UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia; Infection Management and Prevention Services, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Natalie Sharp
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia; Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Renate Le Marsney
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia
| | - Arjun Chavan
- Paediatric Intensive Care Unit, Townsville University Hospital, Townsville, QLD, Australia
| | | | - Seweryn Bialasiewicz
- School of Chemistry and Molecular Biosciences, The Australian Centre for Ecogenomics, and Queensland Paediatric Infectious Diseases Laboratory, The University of Queensland, Brisbane, QLD, Australia
| | - Anna D MacDonald
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia
| | - Keith Grimwood
- School of Medicine and Dentistry and the Menzies Health Institute Queensland, Griffith University, Southport, QLD, Australia; Department of Infectious Disease and Paediatrics, Gold Coast Health, Southport, QLD, Australia
| | - Jessica C Kling
- Frazer Institute, The University of Queensland, Brisbane, QLD, Australia
| | | | - Antje Blumenthal
- Frazer Institute, The University of Queensland, Brisbane, QLD, Australia
| | - Myrsini Kaforou
- Section of Paediatric Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
| | - Michael Levin
- Section of Paediatric Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
| | - Jethro A Herberg
- Section of Paediatric Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
| | - Kristen S Gibbons
- Children's Intensive Care Research Program, Child Health Research Centre, The University of Queensland, Brisbane, QLD, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia; Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
2
|
Hall MB, Coin LJM. Pangenome databases improve host removal and mycobacteria classification from clinical metagenomic data. Gigascience 2024; 13:giae010. [PMID: 38573185 PMCID: PMC10993716 DOI: 10.1093/gigascience/giae010] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/18/2023] [Revised: 01/10/2024] [Accepted: 02/27/2024] [Indexed: 04/05/2024] Open
Abstract
BACKGROUND Culture-free real-time sequencing of clinical metagenomic samples promises both rapid pathogen detection and antimicrobial resistance profiling. However, this approach introduces the risk of patient DNA leakage. To mitigate this risk, we need near-comprehensive removal of human DNA sequences at the point of sequencing, typically involving the use of resource-constrained devices. Existing benchmarks have largely focused on the use of standardized databases and largely ignored the computational requirements of depletion pipelines as well as the impact of human genome diversity. RESULTS We benchmarked host removal pipelines on simulated and artificial real Illumina and Nanopore metagenomic samples. We found that construction of a custom kraken database containing diverse human genomes results in the best balance of accuracy and computational resource usage. In addition, we benchmarked pipelines using kraken and minimap2 for taxonomic classification of Mycobacterium reads using standard and custom databases. With a database representative of the Mycobacterium genus, both tools obtained improved specificity and sensitivity, compared to the standard databases for classification of Mycobacterium tuberculosis. Computational efficiency of these custom databases was superior to most standard approaches, allowing them to be executed on a laptop device. CONCLUSIONS Customized pangenome databases provide the best balance of accuracy and computational efficiency when compared to standard databases for the task of human read removal and M. tuberculosis read classification from metagenomic samples. Such databases allow for execution on a laptop, without sacrificing accuracy, an especially important consideration in low-resource settings. We make all customized databases and pipelines freely available.
Collapse
Affiliation(s)
- Michael B Hall
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, 3000 Victoria, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, 3000 Victoria, Australia
| |
Collapse
|
3
|
Jackson HR, Zandstra J, Menikou S, Hamilton MS, McArdle AJ, Fischer R, Thorne AM, Huang H, Tanck MW, Jansen MH, De T, Agyeman PKA, Von Both U, Carrol ED, Emonts M, Eleftheriou I, Van der Flier M, Fink C, Gloerich J, De Groot R, Moll HA, Pokorn M, Pollard AJ, Schlapbach LJ, Tsolia MN, Usuf E, Wright VJ, Yeung S, Zavadska D, Zenz W, Coin LJM, Casals-Pascual C, Cunnington AJ, Martinon-Torres F, Herberg JA, de Jonge MI, Levin M, Kuijpers TW, Kaforou M. A multi-platform approach to identify a blood-based host protein signature for distinguishing between bacterial and viral infections in febrile children (PERFORM): a multi-cohort machine learning study. Lancet Digit Health 2023; 5:e774-e785. [PMID: 37890901 DOI: 10.1016/s2589-7500(23)00149-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/28/2022] [Revised: 06/08/2023] [Accepted: 07/26/2023] [Indexed: 10/29/2023]
Abstract
BACKGROUND Differentiating between self-resolving viral infections and bacterial infections in children who are febrile is a common challenge, causing difficulties in identifying which individuals require antibiotics. Studying the host response to infection can provide useful insights and can lead to the identification of biomarkers of infection with diagnostic potential. This study aimed to identify host protein biomarkers for future development into an accurate, rapid point-of-care test that can distinguish between bacterial and viral infections, by recruiting children presenting to health-care settings with fever or a history of fever in the previous 72 h. METHODS In this multi-cohort machine learning study, patient data were taken from EUCLIDS, the Swiss Pediatric Sepsis study, the GENDRES study, and the PERFORM study, which were all based in Europe. We generated three high-dimensional proteomic datasets (SomaScan and two via liquid chromatography tandem mass spectrometry, referred to as MS-A and MS-B) using targeted and untargeted platforms (SomaScan and liquid chromatography mass spectrometry). Protein biomarkers were then shortlisted using differential abundance analysis, feature selection using forward selection-partial least squares (FS-PLS; 100 iterations), along with a literature search. Identified proteins were tested with Luminex and ELISA and iterative FS-PLS was done again (25 iterations) on the Luminex results alone, and the Luminex and ELISA results together. A sparse protein signature for distinguishing between bacterial and viral infections was identified from the selected proteins. The performance of this signature was finally tested using Luminex assays and by calculating disease risk scores. FINDINGS 376 children provided serum or plasma samples for use in the discovery of protein biomarkers. 79 serum samples were collected for the generation of the SomaScan dataset, 147 plasma samples for the MS-A dataset, and 150 plasma samples for the MS-B dataset. Differential abundance analysis, and the first round of feature selection using FS-PLS identified 35 protein biomarker candidates, of which 13 had commercial ELISA or Luminex tests available. 16 proteins with ELISA or Luminex tests available were identified by literature review. Further evaluation via Luminex and ELISA and the second round of feature selection using FS-PLS revealed a six-protein signature: three of the included proteins are elevated in bacterial infections (SELE, NGAL, and IFN-γ), and three are elevated in viral infections (IL18, NCAM1, and LG3BP). Performance testing of the signature using Luminex assays revealed area under the receiver operating characteristic curve values between 89·4% and 93·6%. INTERPRETATION This study has led to the identification of a protein signature that could be ultimately developed into a blood-based point-of-care diagnostic test for rapidly diagnosing bacterial and viral infections in febrile children. Such a test has the potential to greatly improve care of children who are febrile, ensuring that the correct individuals receive antibiotics. FUNDING European Union's Horizon 2020 research and innovation programme, the European Union's Seventh Framework Programme (EUCLIDS), Imperial Biomedical Research Centre of the National Institute for Health Research, the Wellcome Trust and Medical Research Foundation, Instituto de Salud Carlos III, Consorcio Centro de Investigación Biomédica en Red de Enfermedades Respiratorias, Grupos de Refeencia Competitiva, Swiss State Secretariat for Education, Research and Innovation.
Collapse
Affiliation(s)
- Heather R Jackson
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Judith Zandstra
- Sanquin Research and Landsteiner Laboratory, Department of Immunopathology, Sanquin Blood Supply, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands; Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands
| | - Stephanie Menikou
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Melissa Shea Hamilton
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Andrew J McArdle
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Roman Fischer
- Discovery Proteomics Facility, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Adam M Thorne
- Department of Surgery, Section of Hepatobiliary Surgery and Liver Transplantation, University of Groningen, University Medical Center Groningen, Groningen, Netherlands
| | - Honglei Huang
- Target Discovery Institute, Nuffield Department of Medicine, University of Oxford, Oxford, UK
| | - Michael W Tanck
- Department of Epidemiology and Data Science, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands
| | - Machiel H Jansen
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands
| | - Tisham De
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Philipp K A Agyeman
- Department of Pediatrics, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Ulrich Von Both
- Infectious Diseases, Department of Pediatrics, Dr von Hauner Children's Hospital, University Hospital, LMU Munich, Munich, Germany
| | - Enitan D Carrol
- Department of Clinical Infection Microbiology and Immunology, University of Liverpool Institute of Infection, Veterinary and Ecological Sciences, University of Liverpool, Liverpool, UK
| | - Marieke Emonts
- Paediatric Infectious Diseases and Immunology Department, Newcastle upon Tyne Hospitals Foundation Trust, Great North Children's Hospital, Newcastle upon Tyne, UK
| | - Irini Eleftheriou
- Second Department of Paediatrics, National and Kapodistrian University of Athens (NKUA), School of Medicine, Panagiotis & Aglaia, Kyriakou Children's Hospital, Athens, Greece
| | - Michiel Van der Flier
- Paediatric Infectious Diseases and Immunology, Wilhelmina Children's Hospital, University Medical Center Utrecht, Utrecht, Netherlands; Pediatric Infectious Diseases and Immunology Amalia Children's Hospital, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands; Laboratory of Infectious Diseases, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands
| | - Colin Fink
- Micropathology, University of Warwick, Warwick, UK
| | - Jolein Gloerich
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands
| | - Ronald De Groot
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands
| | | | - Marko Pokorn
- Division of Paediatrics, University Medical Centre Ljubljana and Medical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Andrew J Pollard
- Oxford Vaccine Group Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Luregn J Schlapbach
- Department of Intensive Care and Neonatology and Children's Research Center, University Children's Hospital Zurich, Zurich, Switzerland; Child Health Research Centre, The University of Queensland, Brisbane, NSW, Australia
| | - Maria N Tsolia
- Second Department of Paediatrics, National and Kapodistrian University of Athens (NKUA), School of Medicine, Panagiotis & Aglaia, Kyriakou Children's Hospital, Athens, Greece
| | - Effua Usuf
- Medical Research Council Unit The Gambia at the London School of Hygiene & Tropical Medicine, Fajara, Gambia
| | - Victoria J Wright
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Shunmay Yeung
- Clinical Research Department, Faculty of Infectious and Tropical Disease, London School of Hygiene & Tropical Medicine, London, UK
| | - Dace Zavadska
- Children's Clinical University Hospital, Rīga Stradins University, Rïga, Latvia
| | - Werner Zenz
- University Clinic of Paediatrics and Adolescent Medicine, Department of General Paediatrics, Medical University Graz, Graz, Austria
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Climent Casals-Pascual
- Department of Clinical Microbiology, CDB, Hospital Clínic of Barcelona, University of Barcelona, Barcelona, Spain
| | - Aubrey J Cunnington
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Federico Martinon-Torres
- Translational Pediatrics and Infectious Diseases Section, Pediatrics Department, Universidade de Santiago de Compostela (USC), Santiago de Compostela, Spain; Genetics, Vaccines, Infectious Diseases, and Pediatrics research group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela (USC), Santiago de Compostela, Spain; Consorcio Centro de Investigación Biomédica en Red de Enfermedades Respiratorias (CIBERES), Instituto de Salud Carlos III, Madrid, Spain
| | - Jethro A Herberg
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Marien I de Jonge
- Translational Metabolic Laboratory, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands; Laboratory of Medical Immunology, Department of Laboratory Medicine, Radboud Institute of Molecular Life Sciences, Radboud UMC, Nijmegen, Netherlands
| | - Michael Levin
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK
| | - Taco W Kuijpers
- Sanquin Research and Landsteiner Laboratory, Department of Immunopathology, Sanquin Blood Supply, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands; Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (UMC), Amsterdam, Netherlands
| | - Myrsini Kaforou
- Section of Paediatric Infectious Disease, Faculty of Medicine, and Centre for Paediatrics and Child Health, Imperial College London, London, UK.
| |
Collapse
|
4
|
Li F, Wang C, Guo X, Akutsu T, Webb GI, Coin LJM, Kurgan L, Song J. ProsperousPlus: a one-stop and comprehensive platform for accurate protease-specific substrate cleavage prediction and machine-learning model construction. Brief Bioinform 2023; 24:bbad372. [PMID: 37874948 DOI: 10.1093/bib/bbad372] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2023] [Revised: 08/30/2023] [Accepted: 09/29/2023] [Indexed: 10/26/2023] Open
Abstract
Proteases contribute to a broad spectrum of cellular functions. Given a relatively limited amount of experimental data, developing accurate sequence-based predictors of substrate cleavage sites facilitates a better understanding of protease functions and substrate specificity. While many protease-specific predictors of substrate cleavage sites were developed, these efforts are outpaced by the growth of the protease substrate cleavage data. In particular, since data for 100+ protease types are available and this number continues to grow, it becomes impractical to publish predictors for new protease types, and instead it might be better to provide a computational platform that helps users to quickly and efficiently build predictors that address their specific needs. To this end, we conceptualized, developed, tested and released a versatile bioinformatics platform, ProsperousPlus, that empowers users, even those with no programming or little bioinformatics background, to build fast and accurate predictors of substrate cleavage sites. ProsperousPlus facilitates the use of the rapidly accumulating substrate cleavage data to train, empirically assess and deploy predictive models for user-selected substrate types. Benchmarking tests on test datasets show that our platform produces predictors that on average exceed the predictive performance of current state-of-the-art approaches. ProsperousPlus is available as a webserver and a stand-alone software package at http://prosperousplus.unimelb-biotools.cloud.edu.au/.
Collapse
Affiliation(s)
- Fuyi Li
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- South Australian immunoGENomics Cancer Institute (SAiGENCI), Faculty of Health and Medical Sciences, The University of Adelaide, Adelaide, SA 5005, Australia
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, VIC 3000, Australia
| | - Cong Wang
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Tatsuya Akutsu
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Kyoto 611-0011, Japan
| | - Geoffrey I Webb
- Monash Data Futures Institute, Monash University, VIC 3800, Australia
| | - Lachlan J M Coin
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, VIC 3000, Australia
| | - Lukasz Kurgan
- Department of Computer Science, Virginia Commonwealth University, Richmond, VA 23284, USA
| | - Jiangning Song
- Monash Data Futures Institute, Monash University, VIC 3800, Australia
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, VIC 3800, Australia
| |
Collapse
|
5
|
Habgood-Coote D, Wilson C, Shimizu C, Barendregt AM, Philipsen R, Galassini R, Calle IR, Workman L, Agyeman PKA, Ferwerda G, Anderson ST, van den Berg JM, Emonts M, Carrol ED, Fink CG, de Groot R, Hibberd ML, Kanegaye J, Nicol MP, Paulus S, Pollard AJ, Salas A, Secka F, Schlapbach LJ, Tremoulet AH, Walther M, Zenz W, Van der Flier M, Zar HJ, Kuijpers T, Burns JC, Martinón-Torres F, Wright VJ, Coin LJM, Cunnington AJ, Herberg JA, Levin M, Kaforou M. Diagnosis of childhood febrile illness using a multi-class blood RNA molecular signature. Med 2023; 4:635-654.e5. [PMID: 37597512 DOI: 10.1016/j.medj.2023.06.007] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/15/2021] [Revised: 06/08/2023] [Accepted: 06/19/2023] [Indexed: 08/21/2023]
Abstract
BACKGROUND Appropriate treatment and management of children presenting with fever depend on accurate and timely diagnosis, but current diagnostic tests lack sensitivity and specificity and are frequently too slow to inform initial treatment. As an alternative to pathogen detection, host gene expression signatures in blood have shown promise in discriminating several infectious and inflammatory diseases in a dichotomous manner. However, differential diagnosis requires simultaneous consideration of multiple diseases. Here, we show that diverse infectious and inflammatory diseases can be discriminated by the expression levels of a single panel of genes in blood. METHODS A multi-class supervised machine-learning approach, incorporating clinical consequence of misdiagnosis as a "cost" weighting, was applied to a whole-blood transcriptomic microarray dataset, incorporating 12 publicly available datasets, including 1,212 children with 18 infectious or inflammatory diseases. The transcriptional panel identified was further validated in a new RNA sequencing dataset comprising 411 febrile children. FINDINGS We identified 161 transcripts that classified patients into 18 disease categories, reflecting individual causative pathogen and specific disease, as well as reliable prediction of broad classes comprising bacterial infection, viral infection, malaria, tuberculosis, or inflammatory disease. The transcriptional panel was validated in an independent cohort and benchmarked against existing dichotomous RNA signatures. CONCLUSIONS Our data suggest that classification of febrile illness can be achieved with a single blood sample and opens the way for a new approach for clinical diagnosis. FUNDING European Union's Seventh Framework no. 279185; Horizon2020 no. 668303 PERFORM; Wellcome Trust (206508/Z/17/Z); Medical Research Foundation (MRF-160-0008-ELP-KAFO-C0801); NIHR Imperial BRC.
Collapse
Affiliation(s)
- Dominic Habgood-Coote
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Clare Wilson
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Chisato Shimizu
- Department of Pediatrics, Rady Children's Hospital San Diego/University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Anouk M Barendregt
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (AUMC), University of Amsterdam, Amsterdam, the Netherlands
| | - Ria Philipsen
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Department of Laboratory Medicine, Nijmegen, the Netherlands
| | - Rachel Galassini
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Irene Rivero Calle
- Pediatrics Department, Translational Pediatrics and Infectious Diseases Section, Santiago de Compostela, Spain; Genetics- Vaccines- Infectious Diseases and Pediatrics Research Group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Santiago de Compostela, Spain
| | - Lesley Workman
- Department of Paediatrics & Child Health, Red Cross Childrens Hospital and SA-MRC Unit on Child & Adolescent Health, University of Cape Town, Cape Town, South Africa
| | - Philipp K A Agyeman
- Department of Pediatrics, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Gerben Ferwerda
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Department of Laboratory Medicine, Nijmegen, the Netherlands
| | - Suzanne T Anderson
- Medical Research Council Unit, Fajara, The Gambia at the London School of Hygiene and Tropical Medicine, MRCG at LSHTM Fajara, Banjul, The Gambia
| | - J Merlijn van den Berg
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (AUMC), University of Amsterdam, Amsterdam, the Netherlands
| | - Marieke Emonts
- Great North Children's Hospital, Department of Paediatric Immunology, Infectious Diseases & Allergy and NIHR Newcastle Biomedical Research Centre, Newcastle Upon Tyne Hospitals NHS Foundation Trust, Newcastle Upon Tyne, UK; Translational and Clinical Research Institute, Newcastle University, Newcastle Upon Tyne, UK
| | - Enitan D Carrol
- Department of Clinical Infection, Microbiology and Immunology, University of Liverpool Institute of Infection, Veterinary and Ecological Sciences, Liverpool, UK
| | - Colin G Fink
- Micropathology Ltd Research and Diagnosis, Coventry, UK; University of Warwick, Coventry, UK
| | - Ronald de Groot
- Radboud Institute for Molecular Life Sciences, Radboud University Medical Center, Department of Laboratory Medicine, Nijmegen, the Netherlands
| | - Martin L Hibberd
- Department of Infection Biology, Faculty of Infectious and Tropical Disease, London School of Hygiene and Tropical Medicine, London, UK
| | - John Kanegaye
- Department of Pediatrics, Rady Children's Hospital San Diego/University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Mark P Nicol
- Marshall Centre, School of Biomedical Sciences, University of Western Australia, Perth, Australia
| | - Stéphane Paulus
- Department of Clinical Infection, Microbiology and Immunology, University of Liverpool Institute of Infection, Veterinary and Ecological Sciences, Liverpool, UK; Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Antonio Salas
- Pediatrics Department, Translational Pediatrics and Infectious Diseases Section, Santiago de Compostela, Spain; Genetics- Vaccines- Infectious Diseases and Pediatrics Research Group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Santiago de Compostela, Spain; Unidade de Xenética, Instituto de Ciencias Forenses (INCIFOR), Facultade de Medicina, Universidade de Santiago de Compostela, and GenPoB Research Group, Instituto de Investigación Sanitaria (IDIS), Hospital Clínico Universitario de Santiago (SERGAS), 15706 Galicia, Spain
| | - Fatou Secka
- Medical Research Council Unit, Fajara, The Gambia at the London School of Hygiene and Tropical Medicine, MRCG at LSHTM Fajara, Banjul, The Gambia
| | - Luregn J Schlapbach
- Pediatric and Neonatal Intensive Care Unit, and Children`s Research Center, University Children's Hospital Zurich, Zurich, Switzerland; Child Health Research Centre, The University of Queensland, and Paediatric Intensive Care Unit, Queensland Children's Hospital, Brisbane, QLD, Australia
| | - Adriana H Tremoulet
- Department of Pediatrics, Rady Children's Hospital San Diego/University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Michael Walther
- Medical Research Council Unit, Fajara, The Gambia at the London School of Hygiene and Tropical Medicine, MRCG at LSHTM Fajara, Banjul, The Gambia
| | - Werner Zenz
- University Clinic of Paediatrics and Adolescent Medicine, Department of General Paediatrics, Medical University of Graz, Graz, Austria
| | - Michiel Van der Flier
- Paediatric Infectious Diseases and Immunology, Wilhelmina Children's Hospital, University Medical Center Utrecht, Utrecht, the Netherlands; Paediatric Infectious Diseases and Immunology Amalia Children's Hospital, Radboudumc, Nijmegen, the Netherlands
| | - Heather J Zar
- Department of Paediatrics & Child Health, Red Cross Childrens Hospital and SA-MRC Unit on Child & Adolescent Health, University of Cape Town, Cape Town, South Africa
| | - Taco Kuijpers
- Department of Pediatric Immunology, Rheumatology and Infectious Diseases, Emma Children's Hospital, Amsterdam University Medical Center (AUMC), University of Amsterdam, Amsterdam, the Netherlands; Department of Blood Cell Research, Sanquin Blood Supply, Division Research and Landsteiner Laboratory of Amsterdam UMC (AUMC), University of Amsterdam, Amsterdam, the Netherlands
| | - Jane C Burns
- Department of Pediatrics, Rady Children's Hospital San Diego/University of California San Diego School of Medicine, La Jolla, CA, USA
| | - Federico Martinón-Torres
- Pediatrics Department, Translational Pediatrics and Infectious Diseases Section, Santiago de Compostela, Spain; Genetics- Vaccines- Infectious Diseases and Pediatrics Research Group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Santiago de Compostela, Spain
| | - Victoria J Wright
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Aubrey J Cunnington
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Jethro A Herberg
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Michael Levin
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK
| | - Myrsini Kaforou
- Section of Paediatric Infectious Disease and Centre for Paediatrics & Child Health, Department of Infectious Disease, Imperial College London, London, UK.
| |
Collapse
|
6
|
Hall MB, Lima L, Coin LJM, Iqbal Z. Drug resistance prediction for Mycobacterium tuberculosis with reference graphs. Microb Genom 2023; 9:mgen001081. [PMID: 37552534 PMCID: PMC10483414 DOI: 10.1099/mgen.0.001081] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2023] [Accepted: 07/14/2023] [Indexed: 08/09/2023] Open
Abstract
Tuberculosis is a global pandemic disease with a rising burden of antimicrobial resistance. As a result, the World Health Organization (WHO) has a goal of enabling universal access to drug susceptibility testing (DST). Given the slowness of and infrastructure requirements for phenotypic DST, whole-genome sequencing, followed by genotype-based prediction of DST, now provides a route to achieving this. Since a central component of genotypic DST is to detect the presence of any known resistance-causing mutations, a natural approach is to use a reference graph that allows encoding of known variation. We have developed DrPRG (Drug resistance Prediction with Reference Graphs) using the bacterial reference graph method Pandora. First, we outline the construction of a Mycobacterium tuberculosis drug resistance reference graph. The graph is built from a global dataset of isolates with varying drug susceptibility profiles, thus capturing common and rare resistance- and susceptible-associated haplotypes. We benchmark DrPRG against the existing graph-based tool Mykrobe and the haplotype-based approach of TBProfiler using 44 709 and 138 publicly available Illumina and Nanopore samples with associated phenotypes. We find that DrPRG has significantly improved sensitivity and specificity for some drugs compared to these tools, with no significant decreases. It uses significantly less computational memory than both tools, and provides significantly faster runtimes, except when runtime is compared to Mykrobe with Nanopore data. We discover and discuss novel insights into resistance-conferring variation for M. tuberculosis - including deletion of genes katG and pncA - and suggest mutations that may warrant reclassification as associated with resistance.
Collapse
Affiliation(s)
- Michael B. Hall
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, UK
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Australia
| | - Leandro Lima
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, UK
| | - Lachlan J. M. Coin
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Australia
| | - Zamin Iqbal
- European Molecular Biology Laboratory, European Bioinformatics Institute, Hinxton, Cambridgeshire, UK
| |
Collapse
|
7
|
Jackson HR, Miglietta L, Habgood-Coote D, D’Souza G, Shah P, Nichols S, Vito O, Powell O, Davidson MS, Shimizu C, Agyeman PKA, Beudeker CR, Brengel-Pesce K, Carrol ED, Carter MJ, De T, Eleftheriou I, Emonts M, Epalza C, Georgiou P, De Groot R, Fidler K, Fink C, van Keulen D, Kuijpers T, Moll H, Papatheodorou I, Paulus S, Pokorn M, Pollard AJ, Rivero-Calle I, Rojo P, Secka F, Schlapbach LJ, Tremoulet AH, Tsolia M, Usuf E, Van Der Flier M, Von Both U, Vermont C, Yeung S, Zavadska D, Zenz W, Coin LJM, Cunnington A, Burns JC, Wright V, Martinon-Torres F, Herberg JA, Rodriguez-Manzano J, Kaforou M, Levin M. Diagnosis of Multisystem Inflammatory Syndrome in Children by a Whole-Blood Transcriptional Signature. J Pediatric Infect Dis Soc 2023; 12:322-331. [PMID: 37255317 PMCID: PMC10312302 DOI: 10.1093/jpids/piad035] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 11/09/2022] [Accepted: 05/30/2023] [Indexed: 06/01/2023]
Abstract
BACKGROUND To identify a diagnostic blood transcriptomic signature that distinguishes multisystem inflammatory syndrome in children (MIS-C) from Kawasaki disease (KD), bacterial infections, and viral infections. METHODS Children presenting with MIS-C to participating hospitals in the United Kingdom and the European Union between April 2020 and April 2021 were prospectively recruited. Whole-blood RNA Sequencing was performed, contrasting the transcriptomes of children with MIS-C (n = 38) to those from children with KD (n = 136), definite bacterial (DB; n = 188) and viral infections (DV; n = 138). Genes significantly differentially expressed (SDE) between MIS-C and comparator groups were identified. Feature selection was used to identify genes that optimally distinguish MIS-C from other diseases, which were subsequently translated into RT-qPCR assays and evaluated in an independent validation set comprising MIS-C (n = 37), KD (n = 19), DB (n = 56), DV (n = 43), and COVID-19 (n = 39). RESULTS In the discovery set, 5696 genes were SDE between MIS-C and combined comparator disease groups. Five genes were identified as potential MIS-C diagnostic biomarkers (HSPBAP1, VPS37C, TGFB1, MX2, and TRBV11-2), achieving an AUC of 96.8% (95% CI: 94.6%-98.9%) in the discovery set, and were translated into RT-qPCR assays. The RT-qPCR 5-gene signature achieved an AUC of 93.2% (95% CI: 88.3%-97.7%) in the independent validation set when distinguishing MIS-C from KD, DB, and DV. CONCLUSIONS MIS-C can be distinguished from KD, DB, and DV groups using a 5-gene blood RNA expression signature. The small number of genes in the signature and good performance in both discovery and validation sets should enable the development of a diagnostic test for MIS-C.
Collapse
Affiliation(s)
- Heather R Jackson
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Luca Miglietta
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Department of Electrical and Electronic Engineering, Faculty of Engineering, Imperial College London, London, UK
| | - Dominic Habgood-Coote
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Giselle D’Souza
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Priyen Shah
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Samuel Nichols
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Ortensia Vito
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Oliver Powell
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Maisey Salina Davidson
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Chisato Shimizu
- Department of Pediatrics, Rady Children’s Hospital and University of California San Diego, La Jolla, California, USA
| | - Philipp K A Agyeman
- Department of Pediatrics, Inselspital, Bern University Hospital, University of Bern, Bern, Switzerland
| | - Coco R Beudeker
- Department of Paediatric Infectious Diseases and Immunology, Wilhelmina Children’s Hospital, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Karen Brengel-Pesce
- Joint Research Unit Hospices Civils de Lyon-bioMérieux, Lyon Sud Hospital, Pierre-Bénite, France
| | - Enitan D Carrol
- Department of Clinical Infection Microbiology and Immunology, University of Liverpool Institute of Infection, Veterinary and Ecological Sciences, Liverpool, UK
| | - Michael J Carter
- Paediatric Intensive Care, Evelina London Children’s Hospital, Guy’s and St Thomas’ NHS Foundation Trust, London, UK
- Department of Women and Children’s Health, School of Life Course Sciences, King’s College London, St Thomas’ Hospital, London, UK
| | - Tisham De
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Irini Eleftheriou
- Second Department of Paediatrics, National and Kapodistrian University of Athens (NKUA), School of Medicine, P. and A. Kyriakou Children’s Hospital, Athens, Greece
| | - Marieke Emonts
- Translational and Clinical Research Institute, Newcastle University, Newcastle upon Tyne, UK
- Paediatric Infectious Diseases and Immunology Department, Newcastle upon Tyne Hospitals Foundation Trust, Great North Children’s Hospital, Newcastle upon Tyne, UK
- NIHR Newcastle Biomedical Research Centre, Newcastle upon Tyne Hospitals NHS Trust and Newcastle University, Newcastle upon Tyne, UK
| | - Cristina Epalza
- Pediatric Infectious Diseases Unit, Pediatric Department, Hospital Doce de Octubre, Madrid, Spain
| | - Pantelis Georgiou
- Department of Electrical and Electronic Engineering, Faculty of Engineering, Imperial College London, London, UK
| | - Ronald De Groot
- Department of Pediatrics, Division of Pediatric Infectious Diseases and Immunology and Laboratory of Infectious Diseases, Radboud Institute of Molecular Life Sciences, Radboudumc, Nijmegen, The Netherlands
| | - Katy Fidler
- Academic Department of Paediatrics, Royal Alexandra Children’s Hospital, University Hospitals Sussex, Brighton, UK
| | - Colin Fink
- Micropathology Ltd., University of Warwick, Warwick, UK
| | | | - Taco Kuijpers
- Department of Pediatric Immunology, Rheumatology, and Infectious Diseases, Emma Children’s Hospital, Amsterdam University Medical Centre, Amsterdam, The Netherlands
- Sanquin Research, Department of Blood Cell Research, and Landsteiner Laboratory, Amsterdam University Medical Centre, Amsterdam, The Netherlands
| | - Henriette Moll
- Department of Pediatrics, Erasmus MC Sophia Children’s Hospital, Rotterdam, The Netherlands
| | - Irene Papatheodorou
- Gene Expression Team, European Molecular Biology Laboratory, EMBL-European Bioinformatics Institute (EMBL-EBI), Hinxton, Cambridge, UK
| | - Stephane Paulus
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Marko Pokorn
- Division of Pediatrics, University Medical Centre Ljubljana and Medical Faculty, University of Ljubljana, Ljubljana, Slovenia
| | - Andrew J Pollard
- Oxford Vaccine Group, Department of Paediatrics, University of Oxford and the NIHR Oxford Biomedical Research Centre, Oxford, UK
| | - Irene Rivero-Calle
- Pediatrics Department, Translational Pediatrics and Infectious Diseases Section, Santiago de Compostela, Spain
- Genetics–Vaccines–Infectious Diseases and Pediatrics Research Group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela (USC), Santiago de Compostela, Spain
- Unidade de Xenética, Departamento de Anatomía Patolóxica e Ciencias Forenses, Instituto de Ciencias Forenses, Facultade de Medicina, Universidade de Santiago de Compostela, Galicia, Spain
- GenPoB Research Group, Instituto de Investigaciones Sanitarias (IDIS), Hospital Clínico Universitario de Santiago (SERGAS), Galicia, Spain
| | - Pablo Rojo
- Pediatric Infectious Diseases Unit, Pediatric Department, Hospital Doce de Octubre, Madrid, Spain
| | - Fatou Secka
- Medical Research Council Unit, The Gambia at the London School of Hygiene and Tropical Medicine, Banjul, Gambia
| | - Luregn J Schlapbach
- Department of Intensive Care and Neonatology, and Children’s Research Center, University Children`s Hospital Zurich, Zurich, Switzerland
- Child Health Research Centre, The University of Queensland, Brisbane, Queensland, Australia
| | - Adriana H Tremoulet
- Department of Pediatrics, Rady Children’s Hospital and University of California San Diego, La Jolla, California, USA
| | - Maria Tsolia
- Second Department of Paediatrics, National and Kapodistrian University of Athens (NKUA), School of Medicine, P. and A. Kyriakou Children’s Hospital, Athens, Greece
| | - Effua Usuf
- Medical Research Council Unit, The Gambia at the London School of Hygiene and Tropical Medicine, Banjul, Gambia
| | - Michiel Van Der Flier
- Department of Paediatric Infectious Diseases and Immunology, Wilhelmina Children’s Hospital, University Medical Centre Utrecht, Utrecht, The Netherlands
| | - Ulrich Von Both
- Division of Pediatric Infectious Diseases, Department of Pediatrics, Dr von Hauner Children’s Hospital, University Hospital, LMU Munich, Munich, Germany
| | - Clementien Vermont
- Department of Paediatric Infectious Diseases and Immunology, Erasmus MC Sophia Children’s Hospital, Rotterdam, The Netherlands
| | - Shunmay Yeung
- Clinical Research Department, Faculty of Infectious and Tropical Disease, London School of Hygiene and Tropical Medicine, London, UK
| | - Dace Zavadska
- Department of Pediatrics, Children’s Clinical University Hospital, Rīga, Latvia
| | - Werner Zenz
- Department of General Paediatrics, University Clinic of Paediatrics and Adolescent Medicine, Medical University Graz, Austria
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, Australia
| | - Aubrey Cunnington
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Jane C Burns
- Department of Pediatrics, Rady Children’s Hospital and University of California San Diego, La Jolla, California, USA
| | - Victoria Wright
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Federico Martinon-Torres
- Pediatrics Department, Translational Pediatrics and Infectious Diseases Section, Santiago de Compostela, Spain
- Genetics–Vaccines–Infectious Diseases and Pediatrics Research Group GENVIP, Instituto de Investigación Sanitaria de Santiago (IDIS), Universidade de Santiago de Compostela (USC), Santiago de Compostela, Spain
- Unidade de Xenética, Departamento de Anatomía Patolóxica e Ciencias Forenses, Instituto de Ciencias Forenses, Facultade de Medicina, Universidade de Santiago de Compostela, Galicia, Spain
- GenPoB Research Group, Instituto de Investigaciones Sanitarias (IDIS), Hospital Clínico Universitario de Santiago (SERGAS), Galicia, Spain
| | - Jethro A Herberg
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | | | - Myrsini Kaforou
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| | - Michael Levin
- Department of Infectious Disease, Faculty of Medicine, Imperial College London, London, UK
- Centre for Paediatrics and Child Health, Imperial College London, London, SW7 2AZ, UK
| |
Collapse
|
8
|
Zhu Y, Li F, Guo X, Wang X, Coin LJM, Webb GI, Song J, Jia C. TIMER is a Siamese neural network-based framework for identifying both general and species-specific bacterial promoters. Brief Bioinform 2023:7192238. [PMID: 37291763 DOI: 10.1093/bib/bbad209] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2023] [Revised: 05/04/2023] [Accepted: 05/20/2023] [Indexed: 06/10/2023] Open
Abstract
BACKGROUND Promoters are DNA regions that initiate the transcription of specific genes near the transcription start sites. In bacteria, promoters are recognized by RNA polymerases and associated sigma factors. Effective promoter recognition is essential for synthesizing the gene-encoded products by bacteria to grow and adapt to different environmental conditions. A variety of machine learning-based predictors for bacterial promoters have been developed; however, most of them were designed specifically for a particular species. To date, only a few predictors are available for identifying general bacterial promoters with limited predictive performance. RESULTS In this study, we developed TIMER, a Siamese neural network-based approach for identifying both general and species-specific bacterial promoters. Specifically, TIMER uses DNA sequences as the input and employs three Siamese neural networks with the attention layers to train and optimize the models for a total of 13 species-specific and general bacterial promoters. Extensive 10-fold cross-validation and independent tests demonstrated that TIMER achieves a competitive performance and outperforms several existing methods on both general and species-specific promoter prediction. As an implementation of the proposed method, the web server of TIMER is publicly accessible at http://web.unimelb-bioinfortools.cloud.edu.au/TIMER/.
Collapse
Affiliation(s)
- Yan Zhu
- School of Science, Dalian Maritime University, Dalian 116026, China
| | - Fuyi Li
- College of Information Engineering, Northwest A&F University, Yangling 712100, China
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Victoria, Australia
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Xiaoyu Wang
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, Victoria, Australia
| | - Geoffrey I Webb
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Cangzhi Jia
- School of Science, Dalian Maritime University, Dalian 116026, China
| |
Collapse
|
9
|
Chen R, Li F, Guo X, Bi Y, Li C, Pan S, Coin LJM, Song J. ATTIC is an integrated approach for predicting A-to-I RNA editing sites in three species. Brief Bioinform 2023; 24:bbad170. [PMID: 37150785 PMCID: PMC10565902 DOI: 10.1093/bib/bbad170] [Citation(s) in RCA: 6] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 04/12/2023] [Accepted: 04/14/2023] [Indexed: 05/09/2023] Open
Abstract
A-to-I editing is the most prevalent RNA editing event, which refers to the change of adenosine (A) bases to inosine (I) bases in double-stranded RNAs. Several studies have revealed that A-to-I editing can regulate cellular processes and is associated with various human diseases. Therefore, accurate identification of A-to-I editing sites is crucial for understanding RNA-level (i.e. transcriptional) modifications and their potential roles in molecular functions. To date, various computational approaches for A-to-I editing site identification have been developed; however, their performance is still unsatisfactory and needs further improvement. In this study, we developed a novel stacked-ensemble learning model, ATTIC (A-To-I ediTing predICtor), to accurately identify A-to-I editing sites across three species, including Homo sapiens, Mus musculus and Drosophila melanogaster. We first comprehensively evaluated 37 RNA sequence-derived features combined with 14 popular machine learning algorithms. Then, we selected the optimal base models to build a series of stacked ensemble models. The final ATTIC framework was developed based on the optimal models improved by the feature selection strategy for specific species. Extensive cross-validation and independent tests illustrate that ATTIC outperforms state-of-the-art tools for predicting A-to-I editing sites. We also developed a web server for ATTIC, which is publicly available at http://web.unimelb-bioinfortools.cloud.edu.au/ATTIC/. We anticipate that ATTIC can be utilized as a useful tool to accelerate the identification of A-to-I RNA editing events and help characterize their roles in post-transcriptional regulation.
Collapse
Affiliation(s)
- Ruyi Chen
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, VIC 3000, Australia
| | - Fuyi Li
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, VIC 3000, Australia
| | - Xudong Guo
- College of Information Engineering, Northwest A&F University, Shaanxi 712100, China
| | - Yue Bi
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, VIC 3800, Australia
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, VIC 3800, Australia
| | - Shirui Pan
- School of Information and Communication Technology, Griffith University, QLD 4222, Australia
| | - Lachlan J M Coin
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, VIC 3000, Australia
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, VIC 3800, Australia
- Monash Data Futures Institute, Monash University, VIC 3800, Australia
| |
Collapse
|
10
|
Davies MR, Keller N, Brouwer S, Jespersen MG, Cork AJ, Hayes AJ, Pitt ME, De Oliveira DMP, Harbison-Price N, Bertolla OM, Mediati DG, Curren BF, Taiaroa G, Lacey JA, Smith HV, Fang NX, Coin LJM, Stevens K, Tong SYC, Sanderson-Smith M, Tree JJ, Irwin AD, Grimwood K, Howden BP, Jennison AV, Walker MJ. Detection of Streptococcus pyogenes M1 UK in Australia and characterization of the mutation driving enhanced expression of superantigen SpeA. Nat Commun 2023; 14:1051. [PMID: 36828918 PMCID: PMC9951164 DOI: 10.1038/s41467-023-36717-4] [Citation(s) in RCA: 23] [Impact Index Per Article: 23.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Accepted: 02/13/2023] [Indexed: 02/26/2023] Open
Abstract
A new variant of Streptococcus pyogenes serotype M1 (designated 'M1UK') has been reported in the United Kingdom, linked with seasonal scarlet fever surges, marked increase in invasive infections, and exhibiting enhanced expression of the superantigen SpeA. The progenitor S. pyogenes 'M1global' and M1UK clones can be differentiated by 27 SNPs and 4 indels, yet the mechanism for speA upregulation is unknown. Here we investigate the previously unappreciated expansion of M1UK in Australia, now isolated from the majority of serious infections caused by serotype M1 S. pyogenes. M1UK sub-lineages circulating in Australia also contain a novel toxin repertoire associated with epidemic scarlet fever causing S. pyogenes in Asia. A single SNP in the 5' transcriptional leader sequence of the transfer-messenger RNA gene ssrA drives enhanced SpeA superantigen expression as a result of ssrA terminator read-through in the M1UK lineage. This represents a previously unappreciated mechanism of toxin expression and urges enhanced international surveillance.
Collapse
Affiliation(s)
- Mark R Davies
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia.
| | - Nadia Keller
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Stephan Brouwer
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Magnus G Jespersen
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Amanda J Cork
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Andrew J Hayes
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Miranda E Pitt
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - David M P De Oliveira
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Nichaela Harbison-Price
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Olivia M Bertolla
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Daniel G Mediati
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Bodie F Curren
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - George Taiaroa
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Jake A Lacey
- Department of Infectious Diseases, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Helen V Smith
- Public Health Microbiology, Queensland Health Forensic and Scientific Services, Queensland Health, Coopers Plains, QLD, Australia
| | - Ning-Xia Fang
- Public Health Microbiology, Queensland Health Forensic and Scientific Services, Queensland Health, Coopers Plains, QLD, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Kerrie Stevens
- Microbiological Diagnostic Unit Public Health Laboratory, The Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Steven Y C Tong
- Department of Infectious Diseases, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia.,Victorian Infectious Diseases Service, The Royal Melbourne Hospital, at the Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Martina Sanderson-Smith
- Illawarra Health and Medical Research Institute and Molecular Horizons, School of Chemistry and Molecular Bioscience, University of Wollongong, Wollongong, NSW, Australia
| | - Jai J Tree
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Adam D Irwin
- University of Queensland Centre for Clinical Research, Brisbane, QLD, Australia.,Queensland Children's Hospital, Brisbane, QLD, Australia
| | - Keith Grimwood
- School of Medicine and Dentistry and Menzies Health Institute Queensland, Griffith University, Gold Coast, QLD, Australia.,Departments of Infectious Diseases and Paediatrics, Gold Coast Health, Gold Coast, QLD, Australia
| | - Benjamin P Howden
- Microbiological Diagnostic Unit Public Health Laboratory, The Department of Microbiology and Immunology, The University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Amy V Jennison
- Public Health Microbiology, Queensland Health Forensic and Scientific Services, Queensland Health, Coopers Plains, QLD, Australia
| | - Mark J Walker
- Australian Infectious Diseases Research Centre and School of Chemistry and Molecular Biosciences and Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia.
| |
Collapse
|
11
|
Liu Q, Fang H, Wang X, Wang M, Li S, Coin LJM, Li F, Song J. DeepGenGrep: a general deep learning-based predictor for multiple genomic signals and regions. Bioinformatics 2022; 38:4053-4061. [PMID: 35799358 DOI: 10.1093/bioinformatics/btac454] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2022] [Revised: 04/11/2022] [Accepted: 07/06/2022] [Indexed: 12/24/2022] Open
Abstract
MOTIVATION Accurate annotation of different genomic signals and regions (GSRs) from DNA sequences is fundamentally important for understanding gene structure, regulation and function. Numerous efforts have been made to develop machine learning-based predictors for in silico identification of GSRs. However, it remains a great challenge to identify GSRs as the performance of most existing approaches is unsatisfactory. As such, it is highly desirable to develop more accurate computational methods for GSRs prediction. RESULTS In this study, we propose a general deep learning framework termed DeepGenGrep, a general predictor for the systematic identification of multiple different GSRs from genomic DNA sequences. DeepGenGrep leverages the power of hybrid neural networks comprising a three-layer convolutional neural network and a two-layer long short-term memory to effectively learn useful feature representations from sequences. Benchmarking experiments demonstrate that DeepGenGrep outperforms several state-of-the-art approaches on identifying polyadenylation signals, translation initiation sites and splice sites across four eukaryotic species including Homo sapiens, Mus musculus, Bos taurus and Drosophila melanogaster. Overall, DeepGenGrep represents a useful tool for the high-throughput and cost-effective identification of potential GSRs in eukaryotic genomes. AVAILABILITY AND IMPLEMENTATION The webserver and source code are freely available at http://bigdata.biocie.cn/deepgengrep/home and Github (https://github.com/wx-cie/DeepGenGrep/). SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Quanzhong Liu
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Honglin Fang
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Xiao Wang
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Miao Wang
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Shuqin Li
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Fuyi Li
- Department of Software Engineering, College of Information Engineering, Northwest A&F University, Yangling 712100, China.,Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, Melbourne, VIC 3000, Australia
| | - Jiangning Song
- Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia.,Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| |
Collapse
|
12
|
Hall MB, Coin LJM. Assessment of the 2021 WHO Mycobacterium tuberculosis drug resistance mutation catalogue on an independent dataset. The Lancet Microbe 2022; 3:e645. [DOI: 10.1016/s2666-5247(22)00151-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2022] [Accepted: 05/18/2022] [Indexed: 11/25/2022] Open
|
13
|
Chang JJY, Gleeson J, Rawlinson D, De Paoli-Iseppi R, Zhou C, Mordant FL, Londrigan SL, Clark MB, Subbarao K, Stinear TP, Coin LJM, Pitt ME. Long-Read RNA Sequencing Identifies Polyadenylation Elongation and Differential Transcript Usage of Host Transcripts During SARS-CoV-2 In Vitro Infection. Front Immunol 2022; 13:832223. [PMID: 35464437 PMCID: PMC9019466 DOI: 10.3389/fimmu.2022.832223] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/09/2021] [Accepted: 03/14/2022] [Indexed: 12/04/2022] Open
Abstract
Better methods to interrogate host-pathogen interactions during Severe Acute Respiratory Syndrome Coronavirus 2 (SARS-CoV-2) infections are imperative to help understand and prevent this disease. Here we implemented RNA-sequencing (RNA-seq) using Oxford Nanopore Technologies (ONT) long-reads to measure differential host gene expression, transcript polyadenylation and isoform usage within various epithelial cell lines permissive and non-permissive for SARS-CoV-2 infection. SARS-CoV-2-infected and mock-infected Vero (African green monkey kidney epithelial cells), Calu-3 (human lung adenocarcinoma epithelial cells), Caco-2 (human colorectal adenocarcinoma epithelial cells) and A549 (human lung carcinoma epithelial cells) were analyzed over time (0, 2, 24, 48 hours). Differential polyadenylation was found to occur in both infected Calu-3 and Vero cells during a late time point (48 hpi), with Gene Ontology (GO) terms such as viral transcription and translation shown to be significantly enriched in Calu-3 data. Poly(A) tails showed increased lengths in the majority of the differentially polyadenylated transcripts in Calu-3 and Vero cell lines (up to ~101 nt in mean poly(A) length, padj = 0.029). Of these genes, ribosomal protein genes such as RPS4X and RPS6 also showed downregulation in expression levels, suggesting the importance of ribosomal protein genes during infection. Furthermore, differential transcript usage was identified in Caco-2, Calu-3 and Vero cells, including transcripts of genes such as GSDMB and KPNA2, which have previously been implicated in SARS-CoV-2 infections. Overall, these results highlight the potential role of differential polyadenylation and transcript usage in host immune response or viral manipulation of host mechanisms during infection, and therefore, showcase the value of long-read sequencing in identifying less-explored host responses to disease.
Collapse
Affiliation(s)
- Jessie J-Y Chang
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Josie Gleeson
- Centre for Stem Cell Systems, Department of Anatomy and Physiology, University of Melbourne, Melbourne, VIC, Australia
| | - Daniel Rawlinson
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Ricardo De Paoli-Iseppi
- Centre for Stem Cell Systems, Department of Anatomy and Physiology, University of Melbourne, Melbourne, VIC, Australia
| | - Chenxi Zhou
- Department of Clinical Pathology, University of Melbourne, Melbourne, VIC, Australia
| | - Francesca L Mordant
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Sarah L Londrigan
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Michael B Clark
- Centre for Stem Cell Systems, Department of Anatomy and Physiology, University of Melbourne, Melbourne, VIC, Australia
| | - Kanta Subbarao
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia.,World Health Organization (WHO) Collaborating Centre for Reference and Research on Influenza, Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia
| | - Timothy P Stinear
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia.,Department of Clinical Pathology, University of Melbourne, Melbourne, VIC, Australia.,Department of Infectious Disease, Imperial College London, London, United Kingdom.,Institute for Molecular Bioscience, University of Queensland, Brisbane, QLD, Australia
| | - Miranda E Pitt
- Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, VIC, Australia
| |
Collapse
|
14
|
Ganesamoorthy D, Robertson AJ, Chen W, Hall MB, Cao MD, Ferguson K, Lakhani SR, Nones K, Simpson PT, Coin LJM. Whole genome deep sequencing analysis of cell-free DNA in samples with low tumour content. BMC Cancer 2022; 22:85. [PMID: 35057759 PMCID: PMC8772083 DOI: 10.1186/s12885-021-09160-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2021] [Accepted: 12/27/2021] [Indexed: 12/03/2022] Open
Abstract
Background Circulating cell-free DNA (cfDNA) in the plasma of cancer patients contains cell-free tumour DNA (ctDNA) derived from tumour cells and it has been widely recognized as a non-invasive source of tumour DNA for diagnosis and prognosis of cancer. Molecular profiling of ctDNA is often performed using targeted sequencing or low-coverage whole genome sequencing (WGS) to identify tumour specific somatic mutations or somatic copy number aberrations (sCNAs). However, these approaches cannot efficiently detect all tumour-derived genomic changes in ctDNA. Methods We performed WGS analysis of cfDNA from 4 breast cancer patients and 2 patients with benign tumours. We sequenced matched germline DNA for all 6 patients and tumour samples from the breast cancer patients. All samples were sequenced on Illumina HiSeqXTen sequencing platform and achieved approximately 30x, 60x and 100x coverage on germline, tumour and plasma DNA samples, respectively. Results The mutational burden of the plasma samples (1.44 somatic mutations/Mb of genome) was higher than the matched tumour samples. However, 90% of high confidence somatic cfDNA variants were not detected in matched tumour samples and were found to comprise two background plasma mutational signatures. In contrast, cfDNA from the di-nucleosome fraction (300 bp–350 bp) had much higher proportion (30%) of variants shared with tumour. Despite high coverage sequencing we were unable to detect sCNAs in plasma samples. Conclusions Deep sequencing analysis of plasma samples revealed higher fraction of unique somatic mutations in plasma samples, which were not detected in matched tumour samples. Sequencing of di-nucleosome bound cfDNA fragments may increase recovery of tumour mutations from plasma. Supplementary Information The online version contains supplementary material available at 10.1186/s12885-021-09160-1.
Collapse
|
15
|
Zhang M, Jia C, Li F, Li C, Zhu Y, Akutsu T, Webb GI, Zou Q, Coin LJM, Song J. Critical assessment of computational tools for prokaryotic and eukaryotic promoter prediction. Brief Bioinform 2022; 23:6502561. [PMID: 35021193 PMCID: PMC8921625 DOI: 10.1093/bib/bbab551] [Citation(s) in RCA: 6] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/09/2021] [Revised: 11/12/2021] [Accepted: 11/30/2021] [Indexed: 01/13/2023] Open
Abstract
Promoters are crucial regulatory DNA regions for gene transcriptional activation. Rapid advances in next-generation sequencing technologies have accelerated the accumulation of genome sequences, providing increased training data to inform computational approaches for both prokaryotic and eukaryotic promoter prediction. However, it remains a significant challenge to accurately identify species-specific promoter sequences using computational approaches. To advance computational support for promoter prediction, in this study, we curated 58 comprehensive, up-to-date, benchmark datasets for 7 different species (i.e. Escherichia coli, Bacillus subtilis, Homo sapiens, Mus musculus, Arabidopsis thaliana, Zea mays and Drosophila melanogaster) to assist the research community to assess the relative functionality of alternative approaches and support future research on both prokaryotic and eukaryotic promoters. We revisited 106 predictors published since 2000 for promoter identification (40 for prokaryotic promoter, 61 for eukaryotic promoter, and 5 for both). We systematically evaluated their training datasets, computational methodologies, calculated features, performance and software usability. On the basis of these benchmark datasets, we benchmarked 19 predictors with functioning webservers/local tools and assessed their prediction performance. We found that deep learning and traditional machine learning-based approaches generally outperformed scoring function-based approaches. Taken together, the curated benchmark dataset repository and the benchmarking analysis in this study serve to inform the design and implementation of computational approaches for promoter prediction and facilitate more rigorous comparison of new techniques in the future.
Collapse
Affiliation(s)
| | - Cangzhi Jia
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | | | | | | | | | - Geoffrey I Webb
- Department of Data Science and Artificial Intelligence, Monash University, Melbourne, VIC 3800, Australia,Monash Data Futures Institute, Monash University, Melbourne, VIC 3800, Australia
| | - Quan Zou
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | - Lachlan J M Coin
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| | - Jiangning Song
- Corresponding authors: Jiangning Song, Biomedicine Discovery Institute and Department of Biochemistry and Molecular Biology, Monash University, Melbourne, VIC 3800, Australia. E-mail: ; Lachlan J.M. Coin, Department of Microbiology and Immunology, The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria 3000, Australia. E-mail: ; Quan Zou, Institute of Fundamental and Frontier Sciences, University of Electronic Science and Technology of China, Chengdu, China. E-mail: ; Cangzhi Jia, School of Science, Dalian Maritime University, Dalian 116026, China. E-mail:
| |
Collapse
|
16
|
Li F, Dong S, Leier A, Han M, Guo X, Xu J, Wang X, Pan S, Jia C, Zhang Y, Webb GI, Coin LJM, Li C, Song J. Positive-unlabeled learning in bioinformatics and computational biology: a brief review. Brief Bioinform 2021; 23:6415313. [PMID: 34729589 DOI: 10.1093/bib/bbab461] [Citation(s) in RCA: 7] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/30/2021] [Revised: 09/27/2021] [Accepted: 10/07/2021] [Indexed: 12/14/2022] Open
Abstract
Conventional supervised binary classification algorithms have been widely applied to address significant research questions using biological and biomedical data. This classification scheme requires two fully labeled classes of data (e.g. positive and negative samples) to train a classification model. However, in many bioinformatics applications, labeling data is laborious, and the negative samples might be potentially mislabeled due to the limited sensitivity of the experimental equipment. The positive unlabeled (PU) learning scheme was therefore proposed to enable the classifier to learn directly from limited positive samples and a large number of unlabeled samples (i.e. a mixture of positive or negative samples). To date, several PU learning algorithms have been developed to address various biological questions, such as sequence identification, functional site characterization and interaction prediction. In this paper, we revisit a collection of 29 state-of-the-art PU learning bioinformatic applications to address various biological questions. Various important aspects are extensively discussed, including PU learning methodology, biological application, classifier design and evaluation strategy. We also comment on the existing issues of PU learning and offer our perspectives for the future development of PU learning applications. We anticipate that our work serves as an instrumental guideline for a better understanding of the PU learning framework in bioinformatics and further developing next-generation PU learning frameworks for critical biological applications.
Collapse
Affiliation(s)
- Fuyi Li
- Monash University, Australia
| | | | - André Leier
- Department of Genetics, UAB School of Medicine, USA
| | - Meiya Han
- Department of Biochemistry and Molecular Biology, Monash University, Australia
| | | | - Jing Xu
- Computer Science and Technology from Nankai University, China
| | - Xiaoyu Wang
- Department of Biochemistry and Molecular Biology and Biomedicine Discovery Institute, Monash University, Australia
| | - Shirui Pan
- University of Technology Sydney (UTS), Ultimo, NSW, Australia
| | - Cangzhi Jia
- College of Science, Dalian Maritime University, Australia
| | - Yang Zhang
- Northwestern Polytechnical University, China
| | - Geoffrey I Webb
- Faculty of Information Technology at Monash University, Australia
| | - Lachlan J M Coin
- Department of Clinical Pathology, University of Melbourne, Australia
| | - Chen Li
- Biomedicine Discovery Institute and Department of Biochemistry of Molecular Biology, Monash University, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Melbourne, Australia
| |
Collapse
|
17
|
Li F, Guo X, Jin P, Chen J, Xiang D, Song J, Coin LJM. Porpoise: a new approach for accurate prediction of RNA pseudouridine sites. Brief Bioinform 2021; 22:6314697. [PMID: 34226915 DOI: 10.1093/bib/bbab245] [Citation(s) in RCA: 16] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2021] [Revised: 05/19/2021] [Accepted: 06/08/2021] [Indexed: 12/14/2022] Open
Abstract
Pseudouridine is a ubiquitous RNA modification type present in eukaryotes and prokaryotes, which plays a vital role in various biological processes. Almost all kinds of RNAs are subject to this modification. However, it remains a great challenge to identify pseudouridine sites via experimental approaches, requiring expensive and time-consuming experimental research. Therefore, computational approaches that can be used to perform accurate in silico identification of pseudouridine sites from the large amount of RNA sequence data are highly desirable and can aid in the functional elucidation of this critical modification. Here, we propose a new computational approach, termed Porpoise, to accurately identify pseudouridine sites from RNA sequence data. Porpoise builds upon a comprehensive evaluation of 18 frequently used feature encoding schemes based on the selection of four types of features, including binary features, pseudo k-tuple composition, nucleotide chemical property and position-specific trinucleotide propensity based on single-strand (PSTNPss). The selected features are fed into the stacked ensemble learning framework to enable the construction of an effective stacked model. Both cross-validation tests on the benchmark dataset and independent tests show that Porpoise achieves superior predictive performance than several state-of-the-art approaches. The application of model interpretation tools demonstrates the importance of PSTNPs for the performance of the trained models. This new method is anticipated to facilitate community-wide efforts to identify putative pseudouridine sites and formulate novel testable biological hypothesis.
Collapse
Affiliation(s)
- Fuyi Li
- Department of Microbiology and Immunology, Peter Doherty Institute for Infection and Immunity, the University of Melbourne, Australia
| | | | - Peipei Jin
- Department of Clinical Laboratory of Ruijin Hospital, affiliated with Shanghai Jiao Tong University School of Medicine, Shanghai, China
| | | | - Dongxu Xiang
- Faculty of Engineering and Information Technology, The University of Melbourne, Australia
| | - Jiangning Song
- Monash Biomedicine Discovery Institute, Monash University, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology at the University of Melbourne, Australia
| |
Collapse
|
18
|
Irwin AD, Coin LJM, Harris PNA, Cotta MO, Bauer MJ, Buckley C, Balch R, Kruger P, Meyer J, Shekar K, Brady K, Fourie C, Sharp N, Vlad L, Whiley D, Beatson SA, Forde BM, Paterson D, Clark J, Hajkowicz K, Raman S, Bialasiewicz S, Lipman J, Schlapbach LJ, Roberts JA. Optimising Treatment Outcomes for Children and Adults Through Rapid Genome Sequencing of Sepsis Pathogens. A Study Protocol for a Prospective, Multi-Centre Trial (DIRECT). Front Cell Infect Microbiol 2021; 11:667680. [PMID: 34249774 PMCID: PMC8261237 DOI: 10.3389/fcimb.2021.667680] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2021] [Accepted: 05/20/2021] [Indexed: 11/23/2022] Open
Abstract
Background Sepsis contributes significantly to morbidity and mortality globally. In Australia, 20,000 develop sepsis every year, resulting in 5,000 deaths, and more than AUD$846 million in expenditure. Prompt, appropriate antibiotic therapy is effective in improving outcomes in sepsis. Conventional culture-based methods to identify appropriate therapy have limited yield and take days to complete. Recently, nanopore technology has enabled rapid sequencing with real-time analysis of pathogen DNA. We set out to demonstrate the feasibility and diagnostic accuracy of pathogen sequencing direct from clinical samples, and estimate the impact of this approach on time to effective therapy when integrated with personalised software-guided antimicrobial dosing in children and adults on ICU with sepsis. Methods The DIRECT study is a pilot prospective, non-randomized multicentre trial of an integrated diagnostic and therapeutic algorithm combining rapid direct pathogen sequencing and software-guided, personalised antibiotic dosing in children and adults with sepsis on ICU. Participants and interventions DIRECT will collect microbiological and pharmacokinetic samples from approximately 200 children and adults with sepsis admitted to one of four ICUs in Brisbane. In Phase 1, we will evaluate Oxford Nanopore Technologies MinION sequencing direct from blood in 50 blood culture-proven sepsis patients recruited from consecutive patients with suspected sepsis. In Phase 2, a further 50 consecutive patients with suspected sepsis will be recruited in whom MinION sequencing will be combined with Bayesian software-guided (ID-ODS) personalised antimicrobial dosing. Outcome measures The primary outcome is time to effective antimicrobial therapy, defined as trough drug concentrations above the MIC of the pathogen. Secondary outcomes are diagnostic accuracy of MinION sequencing from whole blood, time to pathogen identification and susceptibility testing using sequencing direct from whole blood and from positive blood culture broth. Discussion Rapid pathogen sequencing coupled with antimicrobial dosing software has great potential to overcome the limitations of conventional diagnostics which often result in prolonged inappropriate antimicrobial therapy. Reduced time to optimal antimicrobial therapy may reduce sepsis mortality and ICU length of stay. This pilot study will yield key feasibility data to inform further, urgently needed sepsis studies. Phase 2 of the trial protocol is registered with the ANZCTR (ACTRN12620001122943). Trial registration Registered with the Australia New Zealand Clinical Trials Registry Number ACTRN12620001122943
Collapse
Affiliation(s)
- Adam D Irwin
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia.,Infection Management and Prevention Service, Queensland Children's Hospital, Brisbane, QLD, Australia
| | - Lachlan J M Coin
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, Melbourne, VIC, Australia.,Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia
| | - Patrick N A Harris
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Menino Osbert Cotta
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Michelle J Bauer
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Cameron Buckley
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Ross Balch
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Peter Kruger
- Intensive Care Unit, Princess Alexandra Hospital, Brisbane, QLD, Australia
| | - Jason Meyer
- Intensive Care Unit, Princess Alexandra Hospital, Brisbane, QLD, Australia
| | - Kiran Shekar
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia.,Adult Intensive Care Services and Critical Care Research Group, The Prince Charles Hospital, Brisbane, QLD, Australia
| | - Kara Brady
- Adult Intensive Care Services and Critical Care Research Group, The Prince Charles Hospital, Brisbane, QLD, Australia
| | - Cheryl Fourie
- Department of Infectious Diseases, Royal Brisbane and Women's Hospital, Brisbane, Brisbane, QLD, Australia
| | - Natalie Sharp
- Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Luminita Vlad
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - David Whiley
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Scott A Beatson
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Brian M Forde
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - David Paterson
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Julia Clark
- Infection Management and Prevention Service, Queensland Children's Hospital, Brisbane, QLD, Australia
| | - Krispin Hajkowicz
- Department of Infectious Diseases, Royal Brisbane and Women's Hospital, Brisbane, Brisbane, QLD, Australia
| | - Sainath Raman
- Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia
| | - Seweryn Bialasiewicz
- School of Chemistry and Molecular Biosciences, The University of Queensland, Brisbane, QLD, Australia
| | - Jeffrey Lipman
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| | - Luregn J Schlapbach
- Paediatric Intensive Care Unit, Queensland Children's Hospital, Children's Health Queensland, Brisbane, QLD, Australia.,Department of Pediatric and Neonatal Intensive Care, University Children's Hospital Zurich, Zurich, Switzerland
| | - Jason A Roberts
- UQ Centre for Clinical Research, The University of Queensland, Brisbane, QLD, Australia
| |
Collapse
|
19
|
Nguyen SH, Cao MD, Coin LJM. Real-time resolution of short-read assembly graph using ONT long reads. PLoS Comput Biol 2021; 17:e1008586. [PMID: 33471816 PMCID: PMC7850483 DOI: 10.1371/journal.pcbi.1008586] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/08/2020] [Revised: 02/01/2021] [Accepted: 11/30/2020] [Indexed: 11/19/2022] Open
Abstract
A streaming assembly pipeline utilising real-time Oxford Nanopore Technology (ONT) sequencing data is important for saving sequencing resources and reducing time-to-result. A previous approach implemented in npScarf provided an efficient streaming algorithm for hybrid assembly but was relatively prone to mis-assemblies compared to other graph-based methods. Here we present npGraph, a streaming hybrid assembly tool using the assembly graph instead of the separated pre-assembly contigs. It is able to produce more complete genome assembly by resolving the path finding problem on the assembly graph using long reads as the traversing guide. Application to synthetic and real data from bacterial isolate genomes show improved accuracy while still maintaining a low computational cost. npGraph also provides a graphical user interface (GUI) which provides a real-time visualisation of the progress of assembly. The tool and source code is available at https://github.com/hsnguyen/assembly.
Collapse
Affiliation(s)
- Son Hoang Nguyen
- Institute for Molecular Bioscience, the University of Queensland, St Lucia, Brisbane, Australia
- * E-mail: (SHN); (LC)
| | - Minh Duc Cao
- Institute for Molecular Bioscience, the University of Queensland, St Lucia, Brisbane, Australia
| | - Lachlan J. M. Coin
- Institute for Molecular Bioscience, the University of Queensland, St Lucia, Brisbane, Australia
- Department of Microbiology and Immunology, The University of Melbourne, Parkville, Australia
- Department of Clinical Pathology, The University of Melbourne, Parkville, Australia
- Department of Infectious Disease, Imperial College London, London, UK
- * E-mail: (SHN); (LC)
| |
Collapse
|
20
|
Murigneux V, Rai SK, Furtado A, Bruxner TJC, Tian W, Harliwong I, Wei H, Yang B, Ye Q, Anderson E, Mao Q, Drmanac R, Wang O, Peters BA, Xu M, Wu P, Topp B, Coin LJM, Henry RJ. Comparison of long-read methods for sequencing and assembly of a plant genome. Gigascience 2020; 9:giaa146. [PMID: 33347571 PMCID: PMC7751402 DOI: 10.1093/gigascience/giaa146] [Citation(s) in RCA: 43] [Impact Index Per Article: 10.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 07/07/2020] [Accepted: 11/22/2020] [Indexed: 01/25/2023] Open
Abstract
BACKGROUND Sequencing technologies have advanced to the point where it is possible to generate high-accuracy, haplotype-resolved, chromosome-scale assemblies. Several long-read sequencing technologies are available, and a growing number of algorithms have been developed to assemble the reads generated by those technologies. When starting a new genome project, it is therefore challenging to select the most cost-effective sequencing technology, as well as the most appropriate software for assembly and polishing. It is thus important to benchmark different approaches applied to the same sample. RESULTS Here, we report a comparison of 3 long-read sequencing technologies applied to the de novo assembly of a plant genome, Macadamia jansenii. We have generated sequencing data using Pacific Biosciences (Sequel I), Oxford Nanopore Technologies (PromethION), and BGI (single-tube Long Fragment Read) technologies for the same sample. Several assemblers were benchmarked in the assembly of Pacific Biosciences and Nanopore reads. Results obtained from combining long-read technologies or short-read and long-read technologies are also presented. The assemblies were compared for contiguity, base accuracy, and completeness, as well as sequencing costs and DNA material requirements. CONCLUSIONS The 3 long-read technologies produced highly contiguous and complete genome assemblies of M. jansenii. At the time of sequencing, the cost associated with each method was significantly different, but continuous improvements in technologies have resulted in greater accuracy, increased throughput, and reduced costs. We propose updating this comparison regularly with reports on significant iterations of the sequencing technologies.
Collapse
Affiliation(s)
- Valentine Murigneux
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Subash Kumar Rai
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Timothy J C Bruxner
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Wei Tian
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Ivon Harliwong
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Hanmin Wei
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Bicheng Yang
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Qianyu Ye
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Ellis Anderson
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Qing Mao
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Radoje Drmanac
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Ou Wang
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
| | - Brock A Peters
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Mengyang Xu
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Qingdao, Building 2, No. 2 Hengyunshan Road, Qingdao 266555, China
| | - Pei Wu
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Tianjin, Airport Business Park, Building E3, Airport Economics Area, Tianjin 300308, China
| | - Bruce Topp
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Lachlan J M Coin
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, 792 Elizabeth Street, Melbourne, VIC 3004, Australia
| | - Robert J Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
21
|
Murigneux V, Rai SK, Furtado A, Bruxner TJC, Tian W, Harliwong I, Wei H, Yang B, Ye Q, Anderson E, Mao Q, Drmanac R, Wang O, Peters BA, Xu M, Wu P, Topp B, Coin LJM, Henry RJ. Comparison of long-read methods for sequencing and assembly of a plant genome. Gigascience 2020; 9:6042729. [PMID: 33347571 DOI: 10.1101/2020.03.16.992933] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2020] [Revised: 07/07/2020] [Accepted: 11/22/2020] [Indexed: 05/23/2023] Open
Abstract
BACKGROUND Sequencing technologies have advanced to the point where it is possible to generate high-accuracy, haplotype-resolved, chromosome-scale assemblies. Several long-read sequencing technologies are available, and a growing number of algorithms have been developed to assemble the reads generated by those technologies. When starting a new genome project, it is therefore challenging to select the most cost-effective sequencing technology, as well as the most appropriate software for assembly and polishing. It is thus important to benchmark different approaches applied to the same sample. RESULTS Here, we report a comparison of 3 long-read sequencing technologies applied to the de novo assembly of a plant genome, Macadamia jansenii. We have generated sequencing data using Pacific Biosciences (Sequel I), Oxford Nanopore Technologies (PromethION), and BGI (single-tube Long Fragment Read) technologies for the same sample. Several assemblers were benchmarked in the assembly of Pacific Biosciences and Nanopore reads. Results obtained from combining long-read technologies or short-read and long-read technologies are also presented. The assemblies were compared for contiguity, base accuracy, and completeness, as well as sequencing costs and DNA material requirements. CONCLUSIONS The 3 long-read technologies produced highly contiguous and complete genome assemblies of M. jansenii. At the time of sequencing, the cost associated with each method was significantly different, but continuous improvements in technologies have resulted in greater accuracy, increased throughput, and reduced costs. We propose updating this comparison regularly with reports on significant iterations of the sequencing technologies.
Collapse
Affiliation(s)
- Valentine Murigneux
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Subash Kumar Rai
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Agnelo Furtado
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Timothy J C Bruxner
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
| | - Wei Tian
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Ivon Harliwong
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Hanmin Wei
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
| | - Bicheng Yang
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Qianyu Ye
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Australia, 300 Herston Road, Herston, QLD 4006, Australia
| | - Ellis Anderson
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Qing Mao
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Radoje Drmanac
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Ou Wang
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
| | - Brock A Peters
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- MGI, BGI-Shenzhen, Building 11, Beishan Industrial Zone, Yantian District, Shenzhen 518083, China
- Advanced Genomics Technology Lab, Complete Genomics Inc., 2904 Orchard Parkway, San Jose, CA 95134, USA
| | - Mengyang Xu
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Qingdao, Building 2, No. 2 Hengyunshan Road, Qingdao 266555, China
| | - Pei Wu
- BGI-Shenzhen, No.21 Hongan 3rd Street, Yantian District, Shenzhen 518083, China
- BGI-Tianjin, Airport Business Park, Building E3, Airport Economics Area, Tianjin 300308, China
| | - Bruce Topp
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Lachlan J M Coin
- Genome Innovation Hub, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, QLD 4072, Australia
- Department of Microbiology and Immunology, University of Melbourne at The Peter Doherty Institute for Infection and Immunity, 792 Elizabeth Street, Melbourne, VIC 3004, Australia
| | - Robert J Henry
- Queensland Alliance for Agriculture and Food Innovation, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
22
|
Zhou C, Olukolu B, Gemenet DC, Wu S, Gruneberg W, Cao MD, Fei Z, Zeng ZB, George AW, Khan A, Yencho GC, Coin LJM. Assembly of whole-chromosome pseudomolecules for polyploid plant genomes using outbred mapping populations. Nat Genet 2020; 52:1256-1264. [PMID: 33128049 DOI: 10.1038/s41588-020-00717-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2017] [Accepted: 09/15/2020] [Indexed: 12/31/2022]
Abstract
Despite advances in sequencing technologies, assembly of complex plant genomes remains elusive due to polyploidy and high repeat content. Here we report PolyGembler for grouping and ordering contigs into pseudomolecules by genetic linkage analysis. Our approach also provides an accurate method with which to detect and fix assembly errors. Using simulated data, we demonstrate that our approach is of high accuracy and outperforms three existing state-of-the-art genetic mapping tools. Particularly, our approach is more robust to the presence of missing genotype data and genotyping errors. We used our method to construct pseudomolecules for allotetraploid lawn grass utilizing PacBio long reads in combination with restriction site-associated DNA sequencing, and for diploid Ipomoea trifida and autotetraploid potato utilizing contigs assembled from Illumina reads in combination with genotype data generated by single-nucleotide polymorphism arrays and genotyping by sequencing, respectively. We resolved 13 assembly errors for a published I. trifida genome assembly and anchored eight unplaced scaffolds in the published potato genome.
Collapse
Affiliation(s)
- Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
- Department of Clinical Pathology, University of Melbourne, Melbourne, Victoria, Australia
| | - Bode Olukolu
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, USA
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, USA
| | - Dorcus C Gemenet
- International Potato Center, Lima, Peru
- CGIAR Excellence in Breeding Platform, International Maize and Wheat Improvement Center, Nairobi, Kenya
| | - Shan Wu
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | | | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, USA
| | - Zhao-Bang Zeng
- Department of Statistics, North Carolina State University, Raleigh, NC, USA
- Bioinformatics Research Center, North Carolina State University, Raleigh, NC, USA
| | - Andrew W George
- Data61, Commonwealth Scientific and Industrial Research Organisation, Brisbane, Queensland, Australia
| | - Awais Khan
- International Potato Center, Lima, Peru
- Department of Plant Pathology and Plant-Microbe Biology, Cornell University, Geneva, NY, USA
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, USA
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia.
- Department of Clinical Pathology, University of Melbourne, Melbourne, Victoria, Australia.
- The Peter Doherty Institute for Infection and Immunity, University of Melbourne, Melbourne, Victoria, Australia.
| |
Collapse
|
23
|
Zhou C, Duarte T, Silvestre R, Rossel G, Mwanga ROM, Khan A, George AW, Fei Z, Yencho GC, Ellis D, Coin LJM. Insights into population structure of East African sweetpotato cultivars from hybrid assembly of chloroplast genomes. Gates Open Res 2020; 2:41. [PMID: 33062940 PMCID: PMC7536352 DOI: 10.12688/gatesopenres.12856.2] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2020] [Indexed: 11/20/2022] Open
Abstract
Background: The chloroplast (cp) genome is an important resource for studying plant diversity and phylogeny. Assembly of the cp genomes from next-generation sequencing data is complicated by the presence of two large inverted repeats contained in the cp DNA. Methods: We constructed a complete circular cp genome assembly for the hexaploid sweetpotato using extremely low coverage (<1×) Oxford Nanopore whole-genome sequencing (WGS) data coupled with Illumina sequencing data for polishing. Results: The sweetpotato cp genome of 161,274 bp contains 152 genes, of which there are 96 protein coding genes, 8 rRNA genes and 48 tRNA genes. Using the cp genome assembly as a reference, we constructed complete cp genome assemblies for a further 17 sweetpotato cultivars from East Africa and an I. triloba line using Illumina WGS data. Analysis of the sweetpotato cp genomes demonstrated the presence of two distinct subpopulations in East Africa. Phylogenetic analysis of the cp genomes of the species from the Convolvulaceae Ipomoea section Batatas revealed that the most closely related diploid wild species of the hexaploid sweetpotato is I. trifida. Conclusions: Nanopore long reads are helpful in construction of cp genome assemblies, especially in solving the two long inverted repeats. We are generally able to extract cp sequences from WGS data of sufficiently high coverage for assembly of cp genomes. The cp genomes can be used to investigate the population structure and the phylogenetic relationship for the sweetpotato.
Collapse
Affiliation(s)
- Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Tania Duarte
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | | | | | | | - Awais Khan
- International Potato Center, P.O. Box 1558, Lima 12, Peru.,Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Geneva, NY, 14456, USA
| | - Andrew W George
- Data61, CSIRO, Ecosciences Precinct, Brisbane, QLD, 4102, Australia
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - G Craig Yencho
- Department of Horticulture, North Carolina State University, Raleigh, North Carolina, 27695, USA
| | - David Ellis
- International Potato Center, P.O. Box 1558, Lima 12, Peru
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| |
Collapse
|
24
|
Pitt ME, Cao MD, Butler MS, Ramu S, Ganesamoorthy D, Blaskovich MAT, Coin LJM, Cooper MA. Octapeptin C4 and polymyxin resistance occur via distinct pathways in an epidemic XDR Klebsiella pneumoniae ST258 isolate. J Antimicrob Chemother 2020; 74:582-593. [PMID: 30445429 DOI: 10.1093/jac/dky458] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2018] [Revised: 10/09/2018] [Accepted: 10/09/2018] [Indexed: 12/12/2022] Open
Abstract
BACKGROUND Polymyxin B and E (colistin) have been pivotal in the treatment of XDR Gram-negative bacterial infections; however, resistance has emerged. A structurally related lipopeptide, octapeptin C4, has shown significant potency against XDR bacteria, including polymyxin-resistant strains, but its mode of action remains undefined. OBJECTIVES We sought to compare and contrast the acquisition of resistance in an XDR Klebsiella pneumoniae (ST258) clinical isolate in vitro with all three lipopeptides to potentially unveil variations in their mode of action. METHODS The isolate was exposed to increasing concentrations of polymyxins and octapeptin C4 over 20 days. Day 20 strains underwent WGS, complementation assays, antimicrobial susceptibility testing and lipid A analysis. RESULTS Twenty days of exposure to the polymyxins resulted in a 1000-fold increase in the MIC, whereas for octapeptin C4 a 4-fold increase was observed. There was no cross-resistance observed between the polymyxin- and octapeptin-resistant strains. Sequencing of polymyxin-resistant isolates revealed mutations in previously known resistance-associated genes, including crrB, mgrB, pmrB, phoPQ and yciM, along with novel mutations in qseC. Octapeptin C4-resistant isolates had mutations in mlaDF and pqiB, genes related to phospholipid transport. These genetic variations were reflected in distinct phenotypic changes to lipid A. Polymyxin-resistant isolates increased 4-amino-4-deoxyarabinose fortification of lipid A phosphate groups, whereas the lipid A of octapeptin C4-resistant strains harboured a higher abundance of hydroxymyristate and palmitoylate. CONCLUSIONS Octapeptin C4 has a distinct mode of action compared with the polymyxins, highlighting its potential as a future therapeutic agent to combat the increasing threat of XDR bacteria.
Collapse
Affiliation(s)
- Miranda E Pitt
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Mark S Butler
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Soumya Ramu
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Mark A T Blaskovich
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Matthew A Cooper
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| |
Collapse
|
25
|
Pitt ME, Nguyen SH, Duarte TPS, Teng H, Blaskovich MAT, Cooper MA, Coin LJM. Evaluating the genome and resistome of extensively drug-resistant Klebsiella pneumoniae using native DNA and RNA Nanopore sequencing. Gigascience 2020; 9:giaa002. [PMID: 32016399 PMCID: PMC6998412 DOI: 10.1093/gigascience/giaa002] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/03/2019] [Revised: 10/17/2019] [Accepted: 01/10/2020] [Indexed: 01/06/2023] Open
Abstract
BACKGROUND Klebsiella pneumoniae frequently harbours multidrug resistance, and current diagnostics struggle to rapidly identify appropriate antibiotics to treat these bacterial infections. The MinION device can sequence native DNA and RNA in real time, providing an opportunity to compare the utility of DNA and RNA for prediction of antibiotic susceptibility. However, the effectiveness of bacterial direct RNA sequencing and base-calling has not previously been investigated. This study interrogated the genome and transcriptome of 4 extensively drug-resistant (XDR) K. pneumoniae clinical isolates; however, further antimicrobial susceptibility testing identified 3 isolates as pandrug-resistant (PDR). RESULTS The majority of acquired resistance (≥75%) resided on plasmids including several megaplasmids (≥100 kb). DNA sequencing detected most resistance genes (≥70%) within 2 hours of sequencing. Neural network-based base-calling of direct RNA achieved up to 86% identity rate, although ≤23% of reads could be aligned. Direct RNA sequencing (with ∼6 times slower pore translocation) was able to identify (within 10 hours) ≥35% of resistance genes, including those associated with resistance to aminoglycosides, β-lactams, trimethoprim, and sulphonamide and also quinolones, rifampicin, fosfomycin, and phenicol in some isolates. Direct RNA sequencing also identified the presence of operons containing up to 3 resistance genes. Polymyxin-resistant isolates showed a heightened transcription of phoPQ (≥2-fold) and the pmrHFIJKLM operon (≥8-fold). Expression levels estimated from direct RNA sequencing displayed strong correlation (Pearson: 0.86) compared to quantitative real-time PCR across 11 resistance genes. CONCLUSION Overall, MinION sequencing rapidly detected the XDR/PDR K. pneumoniae resistome, and direct RNA sequencing provided accurate estimation of expression levels of these genes.
Collapse
Affiliation(s)
- Miranda E Pitt
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria, 3000, Australia
| | - Son H Nguyen
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Tânia P S Duarte
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Haotian Teng
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Mark A T Blaskovich
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Matthew A Cooper
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Road, Brisbane, Queensland, 4072, Australia
- The Peter Doherty Institute for Infection and Immunity, The University of Melbourne, 792 Elizabeth Street, Melbourne, Victoria, 3000, Australia
| |
Collapse
|
26
|
Gemenet DC, da Silva Pereira G, De Boeck B, Wood JC, Mollinari M, Olukolu BA, Diaz F, Mosquera V, Ssali RT, David M, Kitavi MN, Burgos G, Felde TZ, Ghislain M, Carey E, Swanckaert J, Coin LJM, Fei Z, Hamilton JP, Yada B, Yencho GC, Zeng ZB, Mwanga ROM, Khan A, Gruneberg WJ, Buell CR. Quantitative trait loci and differential gene expression analyses reveal the genetic basis for negatively associated β-carotene and starch content in hexaploid sweetpotato [Ipomoea batatas (L.) Lam.]. Theor Appl Genet 2020; 133:23-36. [PMID: 31595335 PMCID: PMC6952332 DOI: 10.1007/s00122-019-03437-7] [Citation(s) in RCA: 17] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/12/2019] [Accepted: 09/17/2019] [Indexed: 05/10/2023]
Abstract
KEY MESSAGE β-Carotene content in sweetpotato is associated with the Orange and phytoene synthase genes; due to physical linkage of phytoene synthase with sucrose synthase, β-carotene and starch content are negatively correlated. In populations depending on sweetpotato for food security, starch is an important source of calories, while β-carotene is an important source of provitamin A. The negative association between the two traits contributes to the low nutritional quality of sweetpotato consumed, especially in sub-Saharan Africa. Using a biparental mapping population of 315 F1 progeny generated from a cross between an orange-fleshed and a non-orange-fleshed sweetpotato variety, we identified two major quantitative trait loci (QTL) on linkage group (LG) three (LG3) and twelve (LG12) affecting starch, β-carotene, and their correlated traits, dry matter and flesh color. Analysis of parental haplotypes indicated that these two regions acted pleiotropically to reduce starch content and increase β-carotene in genotypes carrying the orange-fleshed parental haplotype at the LG3 locus. Phytoene synthase and sucrose synthase, the rate-limiting and linked genes located within the QTL on LG3 involved in the carotenoid and starch biosynthesis, respectively, were differentially expressed in Beauregard versus Tanzania storage roots. The Orange gene, the molecular switch for chromoplast biogenesis, located within the QTL on LG12 while not differentially expressed was expressed in developing roots of the parental genotypes. We conclude that these two QTL regions act together in a cis and trans manner to inhibit starch biosynthesis in amyloplasts and enhance chromoplast biogenesis, carotenoid biosynthesis, and accumulation in orange-fleshed sweetpotato. Understanding the genetic basis of this negative association between starch and β-carotene will inform future sweetpotato breeding strategies targeting sweetpotato for food and nutritional security.
Collapse
Affiliation(s)
- Dorcus C Gemenet
- International Potato Center, ILRI Campus, Old Naivasha Road, P.O. Box 25171-00603, Nairobi, Kenya.
| | | | - Bert De Boeck
- International Potato Center, Av. La Molina 1895, Lima, Peru
| | - Joshua C Wood
- Michigan State University, East Lansing, MI, 48824, USA
| | | | - Bode A Olukolu
- North Carolina State University, Raleigh, NC, 27695, USA
- University of Tennessee, Knoxville, TN, 37996, USA
| | - Federico Diaz
- International Potato Center, Av. La Molina 1895, Lima, Peru
| | | | | | - Maria David
- International Potato Center, Av. La Molina 1895, Lima, Peru
| | - Mercy N Kitavi
- International Potato Center, ILRI Campus, Old Naivasha Road, P.O. Box 25171-00603, Nairobi, Kenya
| | | | | | - Marc Ghislain
- International Potato Center, ILRI Campus, Old Naivasha Road, P.O. Box 25171-00603, Nairobi, Kenya
| | | | | | - Lachlan J M Coin
- University of Queensland, St. Lucia, Brisbane, QLD, 4072, Australia
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | | | - Benard Yada
- National Crops Resources Research Institute (NaCCRI), Namulonge, P.O. Box 7084, Kampala, Uganda
| | - G Craig Yencho
- North Carolina State University, Raleigh, NC, 27695, USA
| | - Zhao-Bang Zeng
- North Carolina State University, Raleigh, NC, 27695, USA
| | | | - Awais Khan
- International Potato Center, Av. La Molina 1895, Lima, Peru
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, NY, 14456, USA
| | | | - C Robin Buell
- Michigan State University, East Lansing, MI, 48824, USA
| |
Collapse
|
27
|
Raftery LJ, Howard CB, Grewal YS, Vaidyanathan R, Jones ML, Anderson W, Korbie D, Duarte T, Cao MD, Nguyen SH, Coin LJM, Mahler SM, Trau M. Retooling phage display with electrohydrodynamic nanomixing and nanopore sequencing. Lab Chip 2019; 19:4083-4092. [PMID: 31712799 DOI: 10.1039/c9lc00978g] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/10/2023]
Abstract
Phage display methodologies offer a versatile platform for the isolation of single-chain Fv (scFv) molecules which may be rebuilt into monoclonal antibodies. Herein, we report on a complete workflow termed PhageXpress, for rapid selection of single-chain Fv sequences by leveraging electrohydrodynamic-manipulation of a solution containing phage library particles to enhance target binding whilst minimizing non-specific interactions. Our PhageXpress technique is combined with Oxford Nanopore Technologies' MinION sequencer and custom bioinformatics to achieve high-throughput screening of phage libraries. We performed 4 rounds of biopanning against Dengue virus (DENV) non-structural protein 1 (NS1) using traditional methods (4 week turnaround), which resulted in the isolation of 19 unique scFv clones. We validated the feasibility and efficiency of the PhageXpress method utilizing the same phage library and antigen target. Notably, we successfully mapped 14 of the 19 anti-NS1 scFv sequences (∼74%) with our new method, despite using ∼30-fold less particles during screening and conducting only a single round of biopanning. We believe this approach supersedes traditional methods for the discovery of bio-recognition molecules such as antibodies by speeding up the process for the development of therapeutic and diagnostic biologics.
Collapse
Affiliation(s)
- Lyndon J Raftery
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia.
| | - Christopher B Howard
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia and ARC Training Centre for Biopharmaceutical Innovation, AIBN, University of Queensland, Brisbane, Australia
| | - Yadveer S Grewal
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia
| | - Ramanathan Vaidyanathan
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia
| | - Martina L Jones
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and ARC Training Centre for Biopharmaceutical Innovation, AIBN, University of Queensland, Brisbane, Australia
| | - Will Anderson
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia
| | - Darren Korbie
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia
| | - Tania Duarte
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Son Hoang Nguyen
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| | - Stephen M Mahler
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and ARC Training Centre for Biopharmaceutical Innovation, AIBN, University of Queensland, Brisbane, Australia
| | - Matt Trau
- Australian Institute for Bioengineering and Nanotechnology (AIBN), University of Queensland, Brisbane, Australia. and Centre for Personalised Nanomedicine, AIBN, University of Queensland, Brisbane, Australia and School of Chemistry and Molecular Biosciences (SCMB), University of Queensland, Brisbane, Australia
| |
Collapse
|
28
|
Couto Alves A, De Silva NMG, Karhunen V, Sovio U, Das S, Taal HR, Warrington NM, Lewin AM, Kaakinen M, Cousminer DL, Thiering E, Timpson NJ, Bond TA, Lowry E, Brown CD, Estivill X, Lindi V, Bradfield JP, Geller F, Speed D, Coin LJM, Loh M, Barton SJ, Beilin LJ, Bisgaard H, Bønnelykke K, Alili R, Hatoum IJ, Schramm K, Cartwright R, Charles MA, Salerno V, Clément K, Claringbould AAJ, van Duijn CM, Moltchanova E, Eriksson JG, Elks C, Feenstra B, Flexeder C, Franks S, Frayling TM, Freathy RM, Elliott P, Widén E, Hakonarson H, Hattersley AT, Rodriguez A, Banterle M, Heinrich J, Heude B, Holloway JW, Hofman A, Hyppönen E, Inskip H, Kaplan LM, Hedman AK, Läärä E, Prokisch H, Grallert H, Lakka TA, Lawlor DA, Melbye M, Ahluwalia TS, Marinelli M, Millwood IY, Palmer LJ, Pennell CE, Perry JR, Ring SM, Savolainen MJ, Rivadeneira F, Standl M, Sunyer J, Tiesler CMT, Uitterlinden AG, Schierding W, O’Sullivan JM, Prokopenko I, Herzig KH, Smith GD, O'Reilly P, Felix JF, Buxton JL, Blakemore AIF, Ong KK, Jaddoe VWV, Grant SFA, Sebert S, McCarthy MI, Järvelin MR. GWAS on longitudinal growth traits reveals different genetic factors influencing infant, child, and adult BMI. Sci Adv 2019; 5:eaaw3095. [PMID: 31840077 PMCID: PMC6904961 DOI: 10.1126/sciadv.aaw3095] [Citation(s) in RCA: 66] [Impact Index Per Article: 13.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/07/2018] [Accepted: 08/06/2019] [Indexed: 05/29/2023]
Abstract
Early childhood growth patterns are associated with adult health, yet the genetic factors and the developmental stages involved are not fully understood. Here, we combine genome-wide association studies with modeling of longitudinal growth traits to study the genetics of infant and child growth, followed by functional, pathway, genetic correlation, risk score, and colocalization analyses to determine how developmental timings, molecular pathways, and genetic determinants of these traits overlap with those of adult health. We found a robust overlap between the genetics of child and adult body mass index (BMI), with variants associated with adult BMI acting as early as 4 to 6 years old. However, we demonstrated a completely distinct genetic makeup for peak BMI during infancy, influenced by variation at the LEPR/LEPROT locus. These findings suggest that different genetic factors control infant and child BMI. In light of the obesity epidemic, these findings are important to inform the timing and targets of prevention strategies.
Collapse
Affiliation(s)
- Alexessander Couto Alves
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, Surrey, UK
| | - N. Maneka G. De Silva
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| | - Ville Karhunen
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| | - Ulla Sovio
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
| | - Shikta Das
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- MRC Unit for Lifelong Health and Ageing at UCL, University College London, London, UK
| | - H. Rob Taal
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Paediatrics, Erasmus MC, Sophia Children’s Hospital, Rotterdam, Netherlands
| | - Nicole M. Warrington
- Division of Obstetrics and Gynaecology, The University of Western Australia, Perth, Western Australia, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
| | - Alexandra M. Lewin
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
| | - Marika Kaakinen
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK
- Centre for Pharmacology and Therapeutics, Division of Experimental Medicine, Department of Medicine, Imperial College London, Hammersmith Hospital, London, UK
- Department of Clinical and Experimental Medicine, School of Biosciences and Medicine, University of Surrey, Surrey, UK
| | - Diana L. Cousminer
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Institute of Biomedicine, Department of Physiology, University of Eastern Finland, Kuopio, Finland
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Elisabeth Thiering
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
- Division of Metabolic Diseases and Nutritional Medicine, Dr von Hauner Children’s Hospital, Ludwig-Maximilians University Munich, Munich, Germany
| | - Nicholas J. Timpson
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Tom A. Bond
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| | - Estelle Lowry
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
| | - Christopher D. Brown
- Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Xavier Estivill
- Genomics and Disease Group, Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Catalonia, Spain
- Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
- Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Madrid, Spain
| | - Virpi Lindi
- Institute of Biomedicine, Department of Physiology, University of Eastern Finland, Kuopio, Finland
| | - Jonathan P. Bradfield
- Center for Applied Genomics, Abramson Research Center, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
| | - Frank Geller
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Doug Speed
- Aarhus Institute of Advanced Studies (AIAS), Aarhus University, Aarhus, Denmark
- UCL Genetics Institute, University College London, London, UK
| | - Lachlan J. M. Coin
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
| | - Marie Loh
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Translational Laboratory in Genetic Medicine (TLGM), Agency for Science, Technology and Research (A*STAR) Singapore, Singapore
| | - Sheila J. Barton
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton General Hospital, Southampton, UK
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
| | - Lawrence J. Beilin
- Medical School, Royal Perth Hospital, University of Western Australia, Perth, Western Australia, Australia
| | - Hans Bisgaard
- COPSAC, The Copenhagen Prospective Studies on Asthma in Childhood, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Klaus Bønnelykke
- COPSAC, The Copenhagen Prospective Studies on Asthma in Childhood, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Rohia Alili
- CRNH Ile de France, Hôpital Pitié-Salpêtrière, Paris, France
| | - Ida J. Hatoum
- CRNH Ile de France, Hôpital Pitié-Salpêtrière, Paris, France
- Obesity, Metabolism, and Nutrition Institute and Gastrointestinal Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Katharina Schramm
- Institute of Human Genetics, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Human Genetics, Technische Universität München, München, Germany
| | - Rufus Cartwright
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Institute for Reproductive and Developmental Biology, Imperial College London, London, UK
| | - Marie-Aline Charles
- Inserm, UMR 1153 (CRESS), Paris Descartes University, Villejuif, Paris, France
| | - Vincenzo Salerno
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
| | - Karine Clément
- CRNH Ile de France, Hôpital Pitié-Salpêtrière, Paris, France
- Inserm, UMR 1153 (CRESS), Paris Descartes University, Villejuif, Paris, France
| | - Annique A. J. Claringbould
- University Medical Centre Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV Groningen, Netherlands
| | - BIOS Consortium
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, Surrey, UK
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
- MRC Unit for Lifelong Health and Ageing at UCL, University College London, London, UK
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Paediatrics, Erasmus MC, Sophia Children’s Hospital, Rotterdam, Netherlands
- Division of Obstetrics and Gynaecology, The University of Western Australia, Perth, Western Australia, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK
- Centre for Pharmacology and Therapeutics, Division of Experimental Medicine, Department of Medicine, Imperial College London, Hammersmith Hospital, London, UK
- Department of Clinical and Experimental Medicine, School of Biosciences and Medicine, University of Surrey, Surrey, UK
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Institute of Biomedicine, Department of Physiology, University of Eastern Finland, Kuopio, Finland
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
- Division of Metabolic Diseases and Nutritional Medicine, Dr von Hauner Children’s Hospital, Ludwig-Maximilians University Munich, Munich, Germany
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Genomics and Disease Group, Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Catalonia, Spain
- Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
- Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Sidra Medical and Research Center, Doha, Qatar
- Center for Applied Genomics, Abramson Research Center, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
- Aarhus Institute of Advanced Studies (AIAS), Aarhus University, Aarhus, Denmark
- UCL Genetics Institute, University College London, London, UK
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
- Translational Laboratory in Genetic Medicine (TLGM), Agency for Science, Technology and Research (A*STAR) Singapore, Singapore
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton General Hospital, Southampton, UK
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
- Medical School, Royal Perth Hospital, University of Western Australia, Perth, Western Australia, Australia
- COPSAC, The Copenhagen Prospective Studies on Asthma in Childhood, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
- CRNH Ile de France, Hôpital Pitié-Salpêtrière, Paris, France
- Obesity, Metabolism, and Nutrition Institute and Gastrointestinal Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Institute of Human Genetics, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Human Genetics, Technische Universität München, München, Germany
- Institute for Reproductive and Developmental Biology, Imperial College London, London, UK
- Inserm, UMR 1153 (CRESS), Paris Descartes University, Villejuif, Paris, France
- University Medical Centre Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV Groningen, Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
- Department of General Practice and Primary Health Care, University of Helsinki, and Helsinki University Hospital, Helsinki, Finland
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
- Folkhalsan Research Center, Helsinki, Finland
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Royal Devon and Exeter Hospital, Exeter, UK
- National Institute for Health Research, Imperial College Biomedical Research Centre, London, UK
- Health Data Research UK London, Imperial College London, London, UK
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Institute of Diabetes, Obesity and Metabolism, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- School of Psychology, College of Social Science, University of Lincoln Brayford Pool Lincoln, Lincolnshire, UK
- Human Genetics and Medical Genomics, Faculty of Medicine, University of Southampton, Southampton, UK
- South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
- Great Ormond Street Hospital Institute of Child Health, University College London, London, UK
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, North Terrace, Adelaide, South Australia, Australia
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institute, Stockholm, Sweden
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, Finland
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
- Kuopio Research Institute of Exercise Medicine, Kuopio, Finland
- Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital, Kuopio, Finland
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- Department of Medicine, Stanford University Medical School, Stanford, CA, USA
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), University of Oxford, Old Road Campus, Oxford, UK
- Medical Research Council Population Health Research Unit (MRC PHRU) at the University of Oxford, Oxford, UK
- School of Public Health and Robinson Research Institute, University of Adelaide, Adelaide, Australia
- Avon Longitudinal Study of Parents and Children, School of Social and Community Medicine, University of Bristol, Bristol, UK
- Division of Internal Medicine, and Biocenter of Oulu, Faculty of Medicine, Oulu University, Oulu, Finland
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Liggins Institute, University of Auckland, Auckland, New Zealand
- A Better Start—National Science, Challenge, University of Auckland, Auckland, New Zealand
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Headington, Oxford, UK
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Research Unit of Biomedicine, University Oulu, Oulu, Finland
- Medical Research Center and Oulu University Hospital, University of Oulu, Oulu, Finland
- Department of Gastroenterology and Metabolism, Poznan University of Medical Sciences, Poznan, Poland
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King’s College London, De Crespigny Park, London, UK
- School of Life Sciences, Pharmacy and Chemistry, Kingston University, Kingston upon Thames, UK
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
- Section of Investigative Medicine, Division of Diabetes, Endocrinology and Metabolism, Imperial College London, London, UK
- Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
- Unit of Primary Care, Oulu University Hospital, Oulu, Finland
| | - Cornelia M. van Duijn
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Elena Moltchanova
- Department of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
| | - Johan G. Eriksson
- Department of General Practice and Primary Health Care, University of Helsinki, and Helsinki University Hospital, Helsinki, Finland
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
- Folkhalsan Research Center, Helsinki, Finland
| | - Cathy Elks
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
| | - Bjarke Feenstra
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
| | - Claudia Flexeder
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
| | - Stephen Franks
- Institute for Reproductive and Developmental Biology, Imperial College London, London, UK
| | - Timothy M. Frayling
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Royal Devon and Exeter Hospital, Exeter, UK
| | - Rachel M. Freathy
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Royal Devon and Exeter Hospital, Exeter, UK
| | - Paul Elliott
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- National Institute for Health Research, Imperial College Biomedical Research Centre, London, UK
- Health Data Research UK London, Imperial College London, London, UK
| | - Elisabeth Widén
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
| | - Hakon Hakonarson
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Center for Applied Genomics, Abramson Research Center, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Institute of Diabetes, Obesity and Metabolism, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Andrew T. Hattersley
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Royal Devon and Exeter Hospital, Exeter, UK
| | - Alina Rodriguez
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- School of Psychology, College of Social Science, University of Lincoln Brayford Pool Lincoln, Lincolnshire, UK
| | - Marco Banterle
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
| | - Joachim Heinrich
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
| | - Barbara Heude
- Inserm, UMR 1153 (CRESS), Paris Descartes University, Villejuif, Paris, France
| | - John W. Holloway
- Human Genetics and Medical Genomics, Faculty of Medicine, University of Southampton, Southampton, UK
| | - Albert Hofman
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Elina Hyppönen
- South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
- Great Ormond Street Hospital Institute of Child Health, University College London, London, UK
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, North Terrace, Adelaide, South Australia, Australia
| | - Hazel Inskip
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton General Hospital, Southampton, UK
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
| | - Lee M. Kaplan
- Obesity, Metabolism, and Nutrition Institute and Gastrointestinal Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
| | - Asa K. Hedman
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institute, Stockholm, Sweden
| | - Esa Läärä
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, Finland
| | - Holger Prokisch
- Institute of Human Genetics, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Human Genetics, Technische Universität München, München, Germany
| | - Harald Grallert
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
| | - Timo A. Lakka
- Institute of Biomedicine, Department of Physiology, University of Eastern Finland, Kuopio, Finland
- Kuopio Research Institute of Exercise Medicine, Kuopio, Finland
- Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital, Kuopio, Finland
| | - Debbie A. Lawlor
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Mads Melbye
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- Department of Medicine, Stanford University Medical School, Stanford, CA, USA
| | - Tarunveer S. Ahluwalia
- COPSAC, The Copenhagen Prospective Studies on Asthma in Childhood, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
| | - Marcella Marinelli
- Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
- Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Madrid, Spain
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
| | - Iona Y. Millwood
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), University of Oxford, Old Road Campus, Oxford, UK
- Medical Research Council Population Health Research Unit (MRC PHRU) at the University of Oxford, Oxford, UK
| | - Lyle J. Palmer
- School of Public Health and Robinson Research Institute, University of Adelaide, Adelaide, Australia
| | - Craig E. Pennell
- Division of Obstetrics and Gynaecology, The University of Western Australia, Perth, Western Australia, Australia
| | - John R. Perry
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
| | - Susan M. Ring
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
- Avon Longitudinal Study of Parents and Children, School of Social and Community Medicine, University of Bristol, Bristol, UK
| | - Markku J. Savolainen
- Division of Internal Medicine, and Biocenter of Oulu, Faculty of Medicine, Oulu University, Oulu, Finland
| | - Fernando Rivadeneira
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Marie Standl
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
| | - Jordi Sunyer
- Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
- Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Madrid, Spain
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
| | - Carla M. T. Tiesler
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
- Division of Metabolic Diseases and Nutritional Medicine, Dr von Hauner Children’s Hospital, Ludwig-Maximilians University Munich, Munich, Germany
| | - Andre G. Uitterlinden
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | | | - Justin M. O’Sullivan
- Liggins Institute, University of Auckland, Auckland, New Zealand
- A Better Start—National Science, Challenge, University of Auckland, Auckland, New Zealand
| | - Inga Prokopenko
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK
- Department of Clinical and Experimental Medicine, School of Biosciences and Medicine, University of Surrey, Surrey, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Headington, Oxford, UK
| | - Karl-Heinz Herzig
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Research Unit of Biomedicine, University Oulu, Oulu, Finland
- Medical Research Center and Oulu University Hospital, University of Oulu, Oulu, Finland
- Department of Gastroenterology and Metabolism, Poznan University of Medical Sciences, Poznan, Poland
| | - George Davey Smith
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
| | - Paul O'Reilly
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King’s College London, De Crespigny Park, London, UK
| | - Janine F. Felix
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Paediatrics, Erasmus MC, Sophia Children’s Hospital, Rotterdam, Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Jessica L. Buxton
- School of Life Sciences, Pharmacy and Chemistry, Kingston University, Kingston upon Thames, UK
| | - Alexandra I. F. Blakemore
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
- Section of Investigative Medicine, Division of Diabetes, Endocrinology and Metabolism, Imperial College London, London, UK
| | - Ken K. Ong
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
| | - Vincent W. V. Jaddoe
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
| | - Struan F. A. Grant
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Center for Applied Genomics, Abramson Research Center, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Institute of Diabetes, Obesity and Metabolism, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
| | - Sylvain Sebert
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Mark I. McCarthy
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Headington, Oxford, UK
- Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
| | - Marjo-Riitta Järvelin
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Medical Research Center and Oulu University Hospital, University of Oulu, Oulu, Finland
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
- Unit of Primary Care, Oulu University Hospital, Oulu, Finland
| | - Early Growth Genetics (EGG) Consortium
- Department of Epidemiology and Biostatistics, MRC-PHE Centre for Environment and Health, School of Public Health, Imperial College London, London, UK
- School of Biosciences and Medicine, Faculty of Health and Medical Sciences, University of Surrey, Surrey, UK
- Department of Obstetrics and Gynaecology, University of Cambridge, Cambridge, UK
- NIHR Cambridge Biomedical Research Centre, Cambridge, UK
- MRC Unit for Lifelong Health and Ageing at UCL, University College London, London, UK
- The Generation R Study Group, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Paediatrics, Erasmus MC, Sophia Children’s Hospital, Rotterdam, Netherlands
- Division of Obstetrics and Gynaecology, The University of Western Australia, Perth, Western Australia, Australia
- The University of Queensland Diamantina Institute, The University of Queensland, Woolloongabba, Queensland, Australia
- Department of Medical Statistics, London School of Hygiene and Tropical Medicine, London, UK
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, Hammersmith Hospital, London, UK
- Centre for Pharmacology and Therapeutics, Division of Experimental Medicine, Department of Medicine, Imperial College London, Hammersmith Hospital, London, UK
- Department of Clinical and Experimental Medicine, School of Biosciences and Medicine, University of Surrey, Surrey, UK
- Division of Human Genetics, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Institute of Biomedicine, Department of Physiology, University of Eastern Finland, Kuopio, Finland
- Institute for Molecular Medicine Finland, University of Helsinki, Helsinki, Finland
- Institute of Epidemiology I, Helmholtz Zentrum München, German Research Center for Environmental Health, Munich Neuherberg, Germany
- Division of Metabolic Diseases and Nutritional Medicine, Dr von Hauner Children’s Hospital, Ludwig-Maximilians University Munich, Munich, Germany
- MRC Integrative Epidemiology Unit at the University of Bristol and NIHR Bristol Biomedical Research Center, Bristol, UK
- Population Health Science, Bristol Medical School, University of Bristol, Bristol, UK
- Center for Life Course Health Research, Faculty of Medicine, University of Oulu, Oulu, Finland
- Department of Genetics and Institute for Biomedical Informatics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Genomics and Disease Group, Bioinformatics and Genomics Programme, Centre for Genomic Regulation (CRG), Barcelona, Catalonia, Spain
- Pompeu Fabra University (UPF), Barcelona, Catalonia, Spain
- Hospital del Mar Medical Research Institute (IMIM), Barcelona, Catalonia, Spain
- Spanish Consortium for Research on Epidemiology and Public Health (CIBERESP), Madrid, Spain
- Sidra Medical and Research Center, Doha, Qatar
- Center for Applied Genomics, Abramson Research Center, The Children’s Hospital of Philadelphia, Philadelphia, PA, USA
- Department of Epidemiology Research, Statens Serum Institut, Copenhagen, Denmark
- Aarhus Institute of Advanced Studies (AIAS), Aarhus University, Aarhus, Denmark
- UCL Genetics Institute, University College London, London, UK
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Queensland, Australia
- Translational Laboratory in Genetic Medicine (TLGM), Agency for Science, Technology and Research (A*STAR) Singapore, Singapore
- MRC Lifecourse Epidemiology Unit, University of Southampton, Southampton General Hospital, Southampton, UK
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
- Medical School, Royal Perth Hospital, University of Western Australia, Perth, Western Australia, Australia
- COPSAC, The Copenhagen Prospective Studies on Asthma in Childhood, Faculty of Health Sciences, University of Copenhagen, Copenhagen, Denmark
- CRNH Ile de France, Hôpital Pitié-Salpêtrière, Paris, France
- Obesity, Metabolism, and Nutrition Institute and Gastrointestinal Unit, Massachusetts General Hospital, Boston, MA, USA
- Department of Medicine, Harvard Medical School, Boston, MA, USA
- Institute of Human Genetics, Helmholtz Center Munich, German Research Center for Environmental Health, Neuherberg, Germany
- Institute of Human Genetics, Technische Universität München, München, Germany
- Institute for Reproductive and Developmental Biology, Imperial College London, London, UK
- Inserm, UMR 1153 (CRESS), Paris Descartes University, Villejuif, Paris, France
- University Medical Centre Groningen, Department of Genetics, Antonius Deusinglaan 1, 9713 AV Groningen, Netherlands
- Department of Epidemiology, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Department of Mathematics and Statistics, University of Canterbury, Christchurch, New Zealand
- Department of General Practice and Primary Health Care, University of Helsinki, and Helsinki University Hospital, Helsinki, Finland
- Department of Chronic Disease Prevention, National Institute for Health and Welfare, Helsinki, Finland
- Folkhalsan Research Center, Helsinki, Finland
- MRC Epidemiology Unit, University of Cambridge School of Clinical Medicine, Institute of Metabolic Science, Cambridge Biomedical Campus, Cambridge, UK
- Institute of Biomedical and Clinical Science, University of Exeter Medical School, University of Exeter, Royal Devon and Exeter Hospital, Exeter, UK
- National Institute for Health Research, Imperial College Biomedical Research Centre, London, UK
- Health Data Research UK London, Imperial College London, London, UK
- Department of Pediatrics, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- Institute of Diabetes, Obesity and Metabolism, Perelman School of Medicine, University of Pennsylvania, Philadelphia, PA, USA
- School of Psychology, College of Social Science, University of Lincoln Brayford Pool Lincoln, Lincolnshire, UK
- Human Genetics and Medical Genomics, Faculty of Medicine, University of Southampton, Southampton, UK
- South Australian Health and Medical Research Institute, Adelaide, South Australia, Australia
- Great Ormond Street Hospital Institute of Child Health, University College London, London, UK
- Australian Centre for Precision Health, University of South Australia Cancer Research Institute, North Terrace, Adelaide, South Australia, Australia
- NIHR Southampton Biomedical Research Centre, University of Southampton and University Hospital Southampton NHS Foundation Trust, Southampton, UK
- Wellcome Centre for Human Genetics, University of Oxford, Oxford, UK
- Cardiovascular Medicine Unit, Department of Medicine, Karolinska Institute, Stockholm, Sweden
- Research Unit of Mathematical Sciences, University of Oulu, Oulu, Finland
- Research Unit of Molecular Epidemiology, Helmholtz Zentrum München, German Research Center for Environmental Health, Neuherberg, Germany
- German Center for Diabetes Research (DZD), Neuherberg, Germany
- Kuopio Research Institute of Exercise Medicine, Kuopio, Finland
- Department of Clinical Physiology and Nuclear Medicine, Kuopio University Hospital, Kuopio, Finland
- Department of Clinical Medicine, University of Copenhagen, Copenhagen, Denmark
- Department of Medicine, Stanford University Medical School, Stanford, CA, USA
- ISGlobal, Centre for Research in Environmental Epidemiology (CREAL), Barcelona, Spain
- Clinical Trial Service Unit and Epidemiological Studies Unit (CTSU), University of Oxford, Old Road Campus, Oxford, UK
- Medical Research Council Population Health Research Unit (MRC PHRU) at the University of Oxford, Oxford, UK
- School of Public Health and Robinson Research Institute, University of Adelaide, Adelaide, Australia
- Avon Longitudinal Study of Parents and Children, School of Social and Community Medicine, University of Bristol, Bristol, UK
- Division of Internal Medicine, and Biocenter of Oulu, Faculty of Medicine, Oulu University, Oulu, Finland
- Department of Internal Medicine, Erasmus MC, University Medical Center Rotterdam, Rotterdam, Netherlands
- Liggins Institute, University of Auckland, Auckland, New Zealand
- A Better Start—National Science, Challenge, University of Auckland, Auckland, New Zealand
- Oxford Centre for Diabetes, Endocrinology and Metabolism, University of Oxford, Churchill Hospital, Headington, Oxford, UK
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Research Unit of Biomedicine, University Oulu, Oulu, Finland
- Medical Research Center and Oulu University Hospital, University of Oulu, Oulu, Finland
- Department of Gastroenterology and Metabolism, Poznan University of Medical Sciences, Poznan, Poland
- MRC Social, Genetic and Developmental Psychiatry Centre, Institute of Psychiatry, King’s College London, De Crespigny Park, London, UK
- School of Life Sciences, Pharmacy and Chemistry, Kingston University, Kingston upon Thames, UK
- Department of Life Sciences, College of Health and Life Sciences, Brunel University London, London, UK
- Section of Investigative Medicine, Division of Diabetes, Endocrinology and Metabolism, Imperial College London, London, UK
- Oxford NIHR Biomedical Research Centre, Oxford University Hospitals NHS Foundation Trust, John Radcliffe Hospital, Oxford, UK
- Unit of Primary Care, Oulu University Hospital, Oulu, Finland
| |
Collapse
|
29
|
Shimizu C, Kim J, Eleftherohorinou H, Wright VJ, Hoang LT, Tremoulet AH, Franco A, Hibberd ML, Takahashi A, Kubo M, Ito K, Tanaka T, Onouchi Y, Coin LJM, Levin M, Burns JC, Shike H. HLA-C variants associated with amino acid substitutions in the peptide binding groove influence susceptibility to Kawasaki disease. Hum Immunol 2019; 80:731-738. [PMID: 31122742 PMCID: PMC10793643 DOI: 10.1016/j.humimm.2019.04.020] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/04/2018] [Revised: 04/26/2019] [Accepted: 04/27/2019] [Indexed: 10/26/2022]
Abstract
Kawasaki disease (KD) is a pediatric vasculitis caused by an unknown trigger in genetically susceptible children. The incidence varies widely across genetically diverse populations. Several associations with HLA Class I alleles have been reported in single cohort studies. Using a genetic approach, from the nine single nucleotide variants (SNVs) associated with KD susceptibility in children of European descent, we identified SNVs near the HLA-C (rs6906846) and HLA-B genes (rs2254556) whose association was replicated in a Japanese descent cohort (rs6906846 p = 0.01, rs2254556 p = 0.005). The risk allele (A at rs6906846) was also associated with HLA-C*07:02 and HLA-C*04:01 in both US multi-ethnic and Japanese cohorts and HLA-C*12:02 only in the Japanese cohort. The risk A-allele was associated with eight non-conservative amino acid substitutions (amino acid positions); Asp or Ser (9), Arg (14), Ala (49), Ala (73), Ala (90), Arg (97), Phe or Ser (99), and Phe or Ser (116) in the HLA-C peptide binding groove that binds peptides for presentation to cytotoxic T cells (CTL). This raises the possibility of increased affinity to a "KD peptide" that contributes to the vasculitis of KD in genetically susceptible children.
Collapse
Affiliation(s)
- Chisato Shimizu
- Department of Pediatrics, University California San Diego, La Jolla, CA, USA.
| | - Jihoon Kim
- Division of Biomedical Informatics, Department of Medicine, University California San Diego, La Jolla, CA, USA
| | - Hariklia Eleftherohorinou
- Section of Paediatrics, Division of Infectious Diseases, Department of Medicine, Imperial College London, London, UK
| | - Victoria J Wright
- Section of Paediatrics, Division of Infectious Diseases, Department of Medicine, Imperial College London, London, UK
| | | | - Adriana H Tremoulet
- Department of Pediatrics, University California San Diego, La Jolla, CA, USA; Department of Cardiology, Rady Childrens' Hospital San Diego, San Diego, CA, USA
| | - Alessandra Franco
- Department of Pediatrics, University California San Diego, La Jolla, CA, USA
| | | | - Atsushi Takahashi
- Laboratory for Statistical Analysis, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan; Department of Genomic Medicine, Research Institute, National Cerebral and Cardiovascular Center, Suita, Osaka, Japan
| | - Michiaki Kubo
- RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Kaoru Ito
- Laboratory for Cardiovascular Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan
| | - Toshihiro Tanaka
- Department of Human Genetics and Disease Diversity, Tokyo Medical and Dental University Graduate School of Medical and Dental Sciences, Bunkyo-ku, Tokyo, Japan
| | - Yoshihiro Onouchi
- Laboratory for Cardiovascular Diseases, RIKEN Center for Integrative Medical Sciences, Yokohama, Kanagawa, Japan; Department of Public Health, Chiba University Graduate School of Medicine, Chiba, Japan
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Australia
| | - Michael Levin
- Section of Paediatrics, Division of Infectious Diseases, Department of Medicine, Imperial College London, London, UK
| | - Jane C Burns
- Department of Pediatrics, University California San Diego, La Jolla, CA, USA; Department of Cardiology, Rady Childrens' Hospital San Diego, San Diego, CA, USA
| | - Hiroko Shike
- Department of Pathology, HLA Laboratory, Penn State Hershey Medical Center, Hershey, PA, USA
| |
Collapse
|
30
|
Bainomugisa A, Pandey S, Donnan E, Simpson G, Foster J, Lavu E, Hiasihri S, McBryde ES, Moke R, Vincent S, Sintchenko V, Marais BJ, Coin LJM, Coulter C. Cross-Border Movement of Highly Drug-Resistant Mycobacterium tuberculosis from Papua New Guinea to Australia through Torres Strait Protected Zone, 2010-2015. Emerg Infect Dis 2019; 25:406-415. [PMID: 30789135 DOI: 10.3201/eid2503.181003] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/22/2023] Open
Abstract
In this retrospective study, we used whole-genome sequencing (WGS) to delineate transmission dynamics, characterize drug-resistance markers, and identify risk factors of transmission among Papua New Guinea residents of the Torres Strait Protected Zone (TSPZ) who had tuberculosis diagnoses during 2010-2015. Of 117 isolates collected, we could acquire WGS data for 100; 79 were Beijing sublineage 2.2.1.1, which was associated with active transmission (odds ratio 6.190, 95% CI 2.221-18.077). Strains were distributed widely throughout the TSPZ. Clustering occurred more often within than between villages (p = 0.0013). Including 4 multidrug-resistant tuberculosis isolates from Australia citizens epidemiologically linked to the TSPZ into the transmission network analysis revealed 2 probable cross-border transmission events. All multidrug-resistant isolates (33/104) belonged to Beijing sublineage 2.2.1.1 and had high-level isoniazid and ethionamide co-resistance; 2 isolates were extensively drug resistant. Including WGS in regional surveillance could improve tuberculosis transmission tracking and control strategies within the TSPZ.
Collapse
|
31
|
Bialasiewicz S, Duarte TPS, Nguyen SH, Sukumaran V, Stewart A, Appleton S, Pitt ME, Bainomugisa A, Jennison AV, Graham R, Coin LJM, Hajkowicz K. Rapid diagnosis of Capnocytophaga canimorsus septic shock in an immunocompetent individual using real-time Nanopore sequencing: a case report. BMC Infect Dis 2019; 19:660. [PMID: 31340776 PMCID: PMC6657077 DOI: 10.1186/s12879-019-4173-2] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/11/2018] [Accepted: 06/07/2019] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Rapid diagnosis and appropriate treatment is imperative in bacterial sepsis due increasing risk of mortality with every hour without appropriate antibiotic therapy. Atypical infections with fastidious organisms may take more than 4 days to diagnose leading to calls for improved methods for rapidly diagnosing sepsis. Capnocytophaga canimorsus is a slow-growing, fastidious gram-negative bacillus which is a common commensal within the mouths of dogs, but rarely cause infections in humans. C. canimorsus sepsis risk factors include immunosuppression, alcoholism and elderly age. Here we report on the application of emerging nanopore sequencing methods to rapidly diagnose an atypical case of C. canimorsus septic shock. CASE PRESENTATION A 62 year-old female patient was admitted to an intensive care unit with septic shock and multi-organ failure six days after a reported dog bite. Blood cultures were unable to detect a pathogen after 3 days despite observed intracellular bacilli on blood smears. Real-time nanopore sequencing was subsequently employed on whole blood to detect Capnocytophaga canimorsus in 19 h. The patient was not immunocompromised and did not have any other known risk factors. Whole-genome sequencing of clinical sample and of the offending dog's oral swabs showed near-identical C. canimorsus genomes. The patient responded to antibiotic treatment and was discharged from hospital 31 days after admission. CONCLUSIONS Use of real-time nanopore sequencing reduced the time-to-diagnosis of Capnocytophaga canimorsus in this case from 6.25 days to 19 h. Capnocytophaga canimorsus should be considered in cases of suspected sepsis involving cat or dog contact, irrespective of the patient's known risk factors.
Collapse
Affiliation(s)
- Seweryn Bialasiewicz
- Centre for Children's Health Research, Children's Health Queensland, 62 Graham St., South Brisbane, QLD, 4101, Australia. .,Child Health Research Centre, The University of Queensland, 62 Graham St., South Brisbane, QLD, 4101, Australia.
| | - Tania P S Duarte
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Rd, St Lucia, QLD, 4072, Australia
| | - Son H Nguyen
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Rd, St Lucia, QLD, 4072, Australia
| | - Vichitra Sukumaran
- Infectious Diseases Unit Royal Brisbane and Women's Hospital, Level 6, Joyce Tweddell Building, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia
| | - Alexandra Stewart
- Infectious Diseases Unit Royal Brisbane and Women's Hospital, Level 6, Joyce Tweddell Building, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia
| | - Sally Appleton
- QML Pathology, PO Box 2280, Mansfield, QLD, 4122, Australia
| | - Miranda E Pitt
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Rd, St Lucia, QLD, 4072, Australia
| | - Arnold Bainomugisa
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Rd, St Lucia, QLD, 4072, Australia
| | - Amy V Jennison
- Forensic and Scientific Services, Queensland Department of Health, 39 Kessels Rd, Coopers Plains, QLD, 4108, Australia
| | - Rikki Graham
- Forensic and Scientific Services, Queensland Department of Health, 39 Kessels Rd, Coopers Plains, QLD, 4108, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, 306 Carmody Rd, St Lucia, QLD, 4072, Australia
| | - Krispin Hajkowicz
- Infectious Diseases Unit Royal Brisbane and Women's Hospital, Level 6, Joyce Tweddell Building, Royal Brisbane and Women's Hospital, Brisbane, QLD, 4029, Australia.,School of Clinical Medicine, University of Queensland Level 6, Oral Health Centre, (883) 288 Herston Road, Herston, QLD, 4006, Australia
| |
Collapse
|
32
|
Teng H, Cao MD, Hall MB, Duarte T, Wang S, Coin LJM. Correction to: Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning. Gigascience 2019; 8:5488102. [PMID: 31077312 PMCID: PMC6511066 DOI: 10.1093/gigascience/giz049] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Affiliation(s)
- Haotian Teng
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Michael B Hall
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Tania Duarte
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Sheng Wang
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| |
Collapse
|
33
|
Cao MD, Ganesamoorthy D, Zhou C, Coin LJM. Simulating the dynamics of targeted capture sequencing with CapSim. Bioinformatics 2018; 34:873-874. [PMID: 29092025 PMCID: PMC6192212 DOI: 10.1093/bioinformatics/btx691] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/05/2017] [Accepted: 10/27/2017] [Indexed: 11/13/2022] Open
Abstract
Motivation Targeted sequencing using capture probes has become increasingly popular in clinical
applications due to its scalability and cost-effectiveness. The approach also allows for
higher sequencing coverage of the targeted regions resulting in better analysis
statistical power. However, because of the dynamics of the hybridization process, it is
difficult to evaluate the efficiency of the probe design prior to the experiments which
are time consuming and costly. Results We developed CapSim, a software package for simulation of targeted sequencing. Given a
genome sequence and a set of probes, CapSim simulates the fragmentation, the dynamics of
probe hybridization and the sequencing of the captured fragments on Illumina and PacBio
sequencing platforms. The simulated data can be used for evaluating the performance of
the analysis pipeline, as well as the efficiency of the probe design. Parameters of the
various stages in the sequencing process can also be evaluated in order to optimize the
experiments. Availability and implementation CapSim is publicly available under BSD license at https://github.com/Devika1/capsim. Supplementary information Supplementary data are
available at Bioinformatics online.
Collapse
Affiliation(s)
- Minh Duc Cao
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, St Lucia, QLD 4072, Australia
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, St Lucia, QLD 4072, Australia
| | - Chenxi Zhou
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, St Lucia, QLD 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, St Lucia, QLD 4072, Australia.,Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK
| |
Collapse
|
34
|
Shao H, Zhou C, Cao MD, Coin LJM. Ongoing human chromosome end extension revealed by analysis of BioNano and nanopore data. Sci Rep 2018; 8:16616. [PMID: 30413723 PMCID: PMC6226469 DOI: 10.1038/s41598-018-34774-0] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/04/2018] [Accepted: 10/22/2018] [Indexed: 11/08/2022] Open
Abstract
The majority of human chromosome ends remain incompletely assembled due to their highly repetitive structure. In this study, we use BioNano data to anchor and extend chromosome ends from two European trios as well as two unrelated Asian genomes. At least 11 BioNano assembled chromosome ends are structurally divergent from the reference genome, including both missing sequence and extensions. These extensions are heritable and in some cases divergent between Asian and European samples. Six out of nine predicted extension sequences from NA12878 can be confirmed and filled by nanopore data. We identify two multi-kilobase sequence families both enriched more than 100-fold in extension sequence (p-values < 1e-5) whose origins can be traced to interstitial sequence on ancestral primate chromosome 7. Extensive sub-telomeric duplication of these families has occurred in the human lineage subsequent to divergence from chimpanzees.
Collapse
Affiliation(s)
- Haojing Shao
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia.
| |
Collapse
|
35
|
Wu S, Lau KH, Cao Q, Hamilton JP, Sun H, Zhou C, Eserman L, Gemenet DC, Olukolu BA, Wang H, Crisovan E, Godden GT, Jiao C, Wang X, Kitavi M, Manrique-Carpintero N, Vaillancourt B, Wiegert-Rininger K, Yang X, Bao K, Schaff J, Kreuze J, Gruneberg W, Khan A, Ghislain M, Ma D, Jiang J, Mwanga ROM, Leebens-Mack J, Coin LJM, Yencho GC, Buell CR, Fei Z. Genome sequences of two diploid wild relatives of cultivated sweetpotato reveal targets for genetic improvement. Nat Commun 2018; 9:4580. [PMID: 30389915 PMCID: PMC6214957 DOI: 10.1038/s41467-018-06983-8] [Citation(s) in RCA: 130] [Impact Index Per Article: 21.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/20/2018] [Accepted: 10/05/2018] [Indexed: 02/08/2023] Open
Abstract
Sweetpotato [Ipomoea batatas (L.) Lam.] is a globally important staple food crop, especially for sub-Saharan Africa. Agronomic improvement of sweetpotato has lagged behind other major food crops due to a lack of genomic and genetic resources and inherent challenges in breeding a heterozygous, clonally propagated polyploid. Here, we report the genome sequences of its two diploid relatives, I. trifida and I. triloba, and show that these high-quality genome assemblies are robust references for hexaploid sweetpotato. Comparative and phylogenetic analyses reveal insights into the ancient whole-genome triplication history of Ipomoea and evolutionary relationships within the Batatas complex. Using resequencing data from 16 genotypes widely used in African breeding programs, genes and alleles associated with carotenoid biosynthesis in storage roots are identified, which may enable efficient breeding of varieties with high provitamin A content. These resources will facilitate genome-enabled breeding in this important food security crop.
Collapse
Affiliation(s)
- Shan Wu
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - Kin H Lau
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Qinghe Cao
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
- Jiangsu Xuzhou Sweetpotato Research Center, Xuzhou, Jiangsu, 221131, China
| | - John P Hamilton
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Honghe Sun
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
- National Engineering Research Center for Vegetables, Beijing Academy of Agriculture and Forestry Sciences, Beijing, 100097, China
| | - Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Lauren Eserman
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
- Department of Conservation and Research, Atlanta Botanical Garden, Atlanta, GA, 30309, USA
| | | | - Bode A Olukolu
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, 27695, USA
- Department of Entomology and Plant Pathology, University of Tennessee, Knoxville, TN, 37996, USA
| | - Haiyan Wang
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | - Emily Crisovan
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Grant T Godden
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | - Chen Jiao
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - Xin Wang
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - Mercy Kitavi
- International Potato Center, Nairobi, 00603, Kenya
| | | | - Brieanne Vaillancourt
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
| | | | - Xinsun Yang
- Food Crops Institute, Hubei Academy of Agricultural Sciences, Wuhan, 430064, China
| | - Kan Bao
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - Jennifer Schaff
- Genomic Sciences Laboratory, North Carolina State University, Raleigh, NC, 27695, USA
| | - Jan Kreuze
- International Potato Center, Lima 12, Peru
| | | | - Awais Khan
- International Potato Center, Lima 12, Peru
- Plant Pathology and Plant-Microbe Biology Section, Cornell University, Geneva, NY, 14456, USA
| | | | - Daifu Ma
- Jiangsu Xuzhou Sweetpotato Research Center, Xuzhou, Jiangsu, 221131, China
| | - Jiming Jiang
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA
- Department of Horticulture, Michigan State University, East Lansing, MI, 48824, USA
| | | | - Jim Leebens-Mack
- Department of Plant Biology, University of Georgia, Athens, GA, 30602, USA
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - G Craig Yencho
- Department of Horticultural Science, North Carolina State University, Raleigh, NC, 27695, USA
| | - C Robin Buell
- Department of Plant Biology, Michigan State University, East Lansing, MI, 48824, USA.
- Plant Resilience Institute, Michigan State University, East Lansing, MI, 48824, USA.
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA.
- USDA-ARS Robert W. Holley Center for Agriculture and Health, Ithaca, NY, 14853, USA.
| |
Collapse
|
36
|
Wright VJ, Herberg JA, Kaforou M, Shimizu C, Eleftherohorinou H, Shailes H, Barendregt AM, Menikou S, Gormley S, Berk M, Hoang LT, Tremoulet AH, Kanegaye JT, Coin LJM, Glodé MP, Hibberd M, Kuijpers TW, Hoggart CJ, Burns JC, Levin M. Diagnosis of Kawasaki Disease Using a Minimal Whole-Blood Gene Expression Signature. JAMA Pediatr 2018; 172:e182293. [PMID: 30083721 PMCID: PMC6233768 DOI: 10.1001/jamapediatrics.2018.2293] [Citation(s) in RCA: 74] [Impact Index Per Article: 12.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Figures] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
IMPORTANCE To date, there is no diagnostic test for Kawasaki disease (KD). Diagnosis is based on clinical features shared with other febrile conditions, frequently resulting in delayed or missed treatment and an increased risk of coronary artery aneurysms. OBJECTIVE To identify a whole-blood gene expression signature that distinguishes children with KD in the first week of illness from other febrile conditions. DESIGN, SETTING, AND PARTICIPANTS The case-control study comprised a discovery group that included a training and test set and a validation group of children with KD or comparator febrile illness. The setting was pediatric centers in the United Kingdom, Spain, the Netherlands, and the United States. The training and test discovery group comprised 404 children with infectious and inflammatory conditions (78 KD, 84 other inflammatory diseases, and 242 bacterial or viral infections) and 55 healthy controls. The independent validation group comprised 102 patients with KD, including 72 in the first 7 days of illness, and 130 febrile controls. The study dates were March 1, 2009, to November 14, 2013, and data analysis took place from January 1, 2015, to December 31, 2017. MAIN OUTCOMES AND MEASURES Whole-blood gene expression was evaluated using microarrays, and minimal transcript sets distinguishing KD were identified using a novel variable selection method (parallel regularized regression model search). The ability of transcript signatures (implemented as disease risk scores) to discriminate KD cases from controls was assessed by area under the curve (AUC), sensitivity, and specificity at the optimal cut point according to the Youden index. RESULTS Among 404 patients in the discovery set, there were 78 with KD (median age, 27 months; 55.1% male) and 326 febrile controls (median age, 37 months; 56.4% male). Among 202 patients in the validation set, there were 72 with KD (median age, 34 months; 62.5% male) and 130 febrile controls (median age, 17 months; 56.9% male). A 13-transcript signature identified in the discovery training set distinguished KD from other infectious and inflammatory conditions in the discovery test set, with AUC of 96.2% (95% CI, 92.5%-99.9%), sensitivity of 81.7% (95% CI, 60.0%-94.8%), and specificity of 92.1% (95% CI, 84.0%-97.0%). In the validation set, the signature distinguished KD from febrile controls, with AUC of 94.6% (95% CI, 91.3%-98.0%), sensitivity of 85.9% (95% CI, 76.8%-92.6%), and specificity of 89.1% (95% CI, 83.0%-93.7%). The signature was applied to clinically defined categories of definite, highly probable, and possible KD, resulting in AUCs of 98.1% (95% CI, 94.5%-100%), 96.3% (95% CI, 93.3%-99.4%), and 70.0% (95% CI, 53.4%-86.6%), respectively, mirroring certainty of clinical diagnosis. CONCLUSIONS AND RELEVANCE In this study, a 13-transcript blood gene expression signature distinguished KD from other febrile conditions. Diagnostic accuracy increased with certainty of clinical diagnosis. A test incorporating the 13-transcript disease risk score may enable earlier diagnosis and treatment of KD and reduce inappropriate treatment in those with other diagnoses.
Collapse
Affiliation(s)
| | - Jethro A. Herberg
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Myrsini Kaforou
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Chisato Shimizu
- Department of Pediatrics, University of California San Diego, La Jolla,Rady Children’s Hospital–San Diego, San Diego, California
| | | | - Hannah Shailes
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Anouk M. Barendregt
- Department of Pediatric Hematology, Immunology & Infectious Diseases, Emma Children’s Hospital, Academic Medical Centre, University of Amsterdam, Amsterdam, the Netherlands
| | - Stephanie Menikou
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Stuart Gormley
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Maurice Berk
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | | | - Adriana H. Tremoulet
- Department of Pediatrics, University of California San Diego, La Jolla,Rady Children’s Hospital–San Diego, San Diego, California
| | - John T. Kanegaye
- Department of Pediatrics, University of California San Diego, La Jolla,Rady Children’s Hospital–San Diego, San Diego, California
| | - Lachlan J. M. Coin
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, London, United Kingdom,Institute for Molecular Bioscience, The University of Queensland, St Lucia, Australia
| | - Mary P. Glodé
- Section of Infectious Diseases, Department of Pediatrics, University of Colorado Denver School of Medicine Anschutz Medical Campus, Aurora,Children’s Hospital Colorado, Aurora
| | - Martin Hibberd
- Infectious Diseases, Genome Institute of Singapore, Singapore
| | - Taco W. Kuijpers
- Department of Pediatric Hematology, Immunology & Infectious Diseases, Emma Children’s Hospital, Academic Medical Centre, University of Amsterdam, Amsterdam, the Netherlands,Sanquin Research and Landsteiner Laboratory, Department of Blood Cell Research, Academic Medical Centre, University of Amsterdam, Amsterdam, the Netherlands
| | - Clive J. Hoggart
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | - Jane C. Burns
- Department of Pediatrics, University of California San Diego, La Jolla,Rady Children’s Hospital–San Diego, San Diego, California
| | - Michael Levin
- Section of Paediatrics, Imperial College London, London, United Kingdom
| | | |
Collapse
|
37
|
Zhou C, Duarte T, Silvestre R, Rossel G, Mwanga ROM, Khan A, George AW, Fei Z, Yencho GC, Ellis D, Coin LJM. Insights into population structure of East African sweetpotato cultivars from hybrid assembly of chloroplast genomes. Gates Open Res 2018; 2:41. [PMID: 33062940 PMCID: PMC7536352 DOI: 10.12688/gatesopenres.12856.1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 08/30/2018] [Indexed: 03/31/2024] Open
Abstract
Background: The chloroplast (cp) genome is an important resource for studying plant diversity and phylogeny. Assembly of the cp genomes from next-generation sequencing data is complicated by the presence of two large inverted repeats contained in the cp DNA. Methods: We constructed a complete circular cp genome assembly for the hexaploid sweetpotato using extremely low coverage (<1×) Oxford Nanopore whole-genome sequencing (WGS) data coupled with Illumina sequencing data for polishing. Results: The sweetpotato cp genome of 161,274 bp contains 152 genes, of which there are 96 protein coding genes, 8 rRNA genes and 48 tRNA genes. Using the cp genome assembly as a reference, we constructed complete cp genome assemblies for a further 17 sweetpotato cultivars from East Africa and an I. triloba line using Illumina WGS data. Analysis of the sweetpotato cp genomes demonstrated the presence of two distinct subpopulations in East Africa. Phylogenetic analysis of the cp genomes of the species from the Convolvulaceae Ipomoea section Batatas revealed that the most closely related diploid wild species of the hexaploid sweetpotato is I. trifida. Conclusions: Nanopore long reads are helpful in construction of cp genome assemblies, especially in solving the two long inverted repeats. We are generally able to extract cp sequences from WGS data of sufficiently high coverage for assembly of cp genomes. The cp genomes can be used to investigate the population structure and the phylogenetic relationship for the sweetpotato.
Collapse
Affiliation(s)
- Chenxi Zhou
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | - Tania Duarte
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| | | | | | | | - Awais Khan
- International Potato Center, P.O. Box 1558, Lima 12, Peru
- Plant Pathology and Plant-Microbe Biology Section, School of Integrative Plant Science, Cornell University, Geneva, NY, 14456, USA
| | - Andrew W. George
- Data61, CSIRO, Ecosciences Precinct, Brisbane, QLD, 4102, Australia
| | - Zhangjun Fei
- Boyce Thompson Institute, Cornell University, Ithaca, NY, 14853, USA
| | - G. Craig Yencho
- Department of Horticulture, North Carolina State University, Raleigh, North Carolina, 27695, USA
| | - David Ellis
- International Potato Center, P.O. Box 1558, Lima 12, Peru
| | - Lachlan J. M. Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD, 4072, Australia
| |
Collapse
|
38
|
Lu J, Jin M, Nguyen SH, Mao L, Li J, Coin LJM, Yuan Z, Guo J. Non-antibiotic antimicrobial triclosan induces multiple antibiotic resistance through genetic mutation. Environ Int 2018; 118:257-265. [PMID: 29902774 DOI: 10.1016/j.envint.2018.06.004] [Citation(s) in RCA: 101] [Impact Index Per Article: 16.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/26/2018] [Revised: 06/04/2018] [Accepted: 06/05/2018] [Indexed: 05/07/2023]
Abstract
Antibiotic resistance poses a major threat to public health. Overuse and misuse of antibiotics are generally recognized as the key factors contributing to antibiotic resistance. However, whether non-antibiotic, anti-microbial (NAAM) chemicals can directly induce antibiotic resistance is unclear. We aim to investigate whether the exposure to a NAAM chemical triclosan (TCS) has an impact on inducing antibiotic resistance on Escherichia coli. Here, we report that at a concentration of 0.2 mg/L TCS induces multi-drug resistance in wild-type Escherichia coli after 30-day TCS exposure. The oxidative stress induced by TCS caused genetic mutations in genes such as fabI, frdD, marR, acrR and soxR, and subsequent up-regulation of the transcription of genes encoding beta-lactamases and multi-drug efflux pumps, together with down-regulation of genes related to membrane permeability. The findings advance our understanding of the potential role of NAAM chemicals in the dissemination of antibiotic resistance in microbes, and highlight the need for controlling biocide applications.
Collapse
Affiliation(s)
- Ji Lu
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Min Jin
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Son Hoang Nguyen
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Likai Mao
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Jie Li
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Zhiguo Yuan
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Jianhua Guo
- Advanced Water Management Centre (AWMC), The University of Queensland, St Lucia, Brisbane, QLD 4072, Australia.
| |
Collapse
|
39
|
Nguyen SH, Duarte TPS, Coin LJM, Cao MD. Real-time demultiplexing Nanopore barcoded sequencing data with npBarcode. Bioinformatics 2018; 33:3988-3990. [PMID: 28961965 DOI: 10.1093/bioinformatics/btx537] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2017] [Accepted: 08/23/2017] [Indexed: 11/14/2022] Open
Abstract
Motivation The recent introduction of a barcoding protocol for Oxford Nanopore sequencing has increased the versatility of the technology. Several bioinformatics tools have been developed to demultiplex barcoded reads, but none of them supports streaming analysis. This limits the use of multiplexed sequencing in real-time applications, which is one of the main advantages of the technology. Results We introduced npBarcode, an open source and cross-platform tool for barcode demultiplexing in streaming fashion that can be used to pipe data to further real-time analyses. The tool also provides a friendly graphical user interface by integrating the module into npReader, making possible to monitor the progress concurrently when the sequencing is still in progress. We show that our algorithm achieves accuracies at least as good as competing tools. Availability and implementation npBarcode is bundled in Japsa-a Java tools kit for genome analysis, and is freely available at https://github.com/mdcao/japsa. Contact s.nguyen@uq.edu.au or l.coin@imb.uq.edu.au. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Son Hoang Nguyen
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Tania P S Duarte
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD 4072, Australia
| |
Collapse
|
40
|
Shao H, Ganesamoorthy D, Duarte T, Cao MD, Hoggart CJ, Coin LJM. npInv: accurate detection and genotyping of inversions using long read sub-alignment. BMC Bioinformatics 2018; 19:261. [PMID: 30001702 PMCID: PMC6044046 DOI: 10.1186/s12859-018-2252-9] [Citation(s) in RCA: 23] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2017] [Accepted: 06/18/2018] [Indexed: 11/21/2022] Open
Abstract
Background Detection of genomic inversions remains challenging. Many existing methods primarily target inzversions with a non repetitive breakpoint, leaving inverted repeat (IR) mediated non-allelic homologous recombination (NAHR) inversions largely unexplored. Result We present npInv, a novel tool specifically for detecting and genotyping NAHR inversion using long read sub-alignment of long read sequencing data. We benchmark npInv with other tools in both simulation and real data. We use npInv to generate a whole-genome inversion map for NA12878 consisting of 30 NAHR inversions (of which 15 are novel), including all previously known NAHR mediated inversions in NA12878 with flanking IR less than 7kb. Our genotyping accuracy on this dataset was 94%. We used PCR to confirm the presence of two of these novel inversions. We show that there is a near linear relationship between the length of flanking IR and the minimum inversion size, without inverted repeats. Conclusion The application of npInv shows high accuracy in both simulation and real data. The results give deeper insight into understanding inversion. Electronic supplementary material The online version of this article (10.1186/s12859-018-2252-9) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Haojing Shao
- Genomics of Development and Disease Division, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Rd, St Lucia, Brisbane, 4067, Australia
| | - Devika Ganesamoorthy
- Genomics of Development and Disease Division, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Rd, St Lucia, Brisbane, 4067, Australia
| | - Tania Duarte
- Genomics of Development and Disease Division, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Rd, St Lucia, Brisbane, 4067, Australia
| | - Minh Duc Cao
- Genomics of Development and Disease Division, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Rd, St Lucia, Brisbane, 4067, Australia
| | - Clive J Hoggart
- Department of Medicine, Imperial College London, Level 2, Faculty Building South Kensington Campus, London, SW7 2AZ, United Kingdom
| | - Lachlan J M Coin
- Genomics of Development and Disease Division, Institute for Molecular Bioscience, University of Queensland, 306 Carmody Rd, St Lucia, Brisbane, 4067, Australia.
| |
Collapse
|
41
|
Teng H, Cao MD, Hall MB, Duarte T, Wang S, Coin LJM. Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning. Gigascience 2018; 7:4966989. [PMID: 29648610 PMCID: PMC5946831 DOI: 10.1093/gigascience/giy037] [Citation(s) in RCA: 83] [Impact Index Per Article: 13.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2017] [Revised: 02/07/2018] [Indexed: 11/13/2022] Open
Abstract
Sequencing by translocating DNA fragments through an array of nanopores is a rapidly maturing technology that offers faster and cheaper sequencing than other approaches. However, accurately deciphering the DNA sequence from the noisy and complex electrical signal is challenging. Here, we report Chiron, the first deep learning model to achieve end-to-end basecalling and directly translate the raw signal to DNA sequence without the error-prone segmentation step. Trained with only a small set of 4,000 reads, we show that our model provides state-of-the-art basecalling accuracy, even on previously unseen species. Chiron achieves basecalling speeds of more than 2,000 bases per second using desktop computer graphics processing units.
Collapse
Affiliation(s)
- Haotian Teng
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Michael B Hall
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Tania Duarte
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| | - Sheng Wang
- Computational Bioscience Research Center (CBRC), King Abdullah University of Science and Technology (KAUST), Thuwal, 23955, Saudi Arabia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, QLD 4072, Australia
| |
Collapse
|
42
|
Pitt ME, Elliott AG, Cao MD, Ganesamoorthy D, Karaiskos I, Giamarellou H, Abboud CS, Blaskovich MAT, Cooper MA, Coin LJM. Multifactorial chromosomal variants regulate polymyxin resistance in extensively drug-resistant Klebsiella pneumoniae. Microb Genom 2018; 4:e000158. [PMID: 29431605 PMCID: PMC5885010 DOI: 10.1099/mgen.0.000158] [Citation(s) in RCA: 33] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/13/2017] [Accepted: 01/21/2018] [Indexed: 12/05/2022] Open
Abstract
Extensively drug-resistant Klebsiella pneumoniae (XDR-KP) infections cause high mortality and are disseminating globally. Identifying the genetic basis underpinning resistance allows for rapid diagnosis and treatment. XDR isolates sourced from Greece and Brazil, including 19 polymyxin-resistant and five polymyxin-susceptible strains, were subjected to whole genome sequencing. Seventeen of the 19 polymyxin-resistant isolates harboured variations upstream or within mgrB. The most common mutation identified was an insertion at nucleotide position 75 in mgrB via an ISKpn26-like element in the ST258 lineage and ISKpn13 in one ST11 isolate. Three strains acquired an IS1 element upstream of mgrB and another strain had an ISKpn25 insertion at 133 bp. Other isolates had truncations (C28STOP, Q30STOP) or a missense mutation (D29E) affecting mgrB. Complementation assays revealed all mgrB perturbations contributed to resistance. Missense mutations in phoQ (T281M, G385C) were also found to facilitate resistance. Several variants in phoPQ co-segregating with the ISKpn26-like insertion were identified as potential partial suppressor mutations. Three ST258 samples were found to contain subpopulations with different resistance-conferring mutations, including the ISKpn26-like insertion colonizing with a novel mutation in pmrB (P158R), both confirmed via complementation assays. These findings highlight the broad spectrum of chromosomal modifications which can facilitate and regulate resistance against polymyxins in K. pneumoniae.
Collapse
Affiliation(s)
- Miranda E. Pitt
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Alysha G. Elliott
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Minh Duc Cao
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Ilias Karaiskos
- 6th Department of Internal Medicine, Hygeia General Hospital, Athens, Greece
| | - Helen Giamarellou
- 6th Department of Internal Medicine, Hygeia General Hospital, Athens, Greece
| | - Cely S. Abboud
- Instituto Dante Pazzanese de Cardiologia, São Paulo, Brazil
| | - Mark A. T. Blaskovich
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Matthew A. Cooper
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| | - Lachlan J. M. Coin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, Australia
| |
Collapse
|
43
|
Chen W, Robertson AJ, Ganesamoorthy D, Coin LJM. sCNAphase: using haplotype resolved read depth to genotype somatic copy number alterations from low cellularity aneuploid tumors. Nucleic Acids Res 2017; 45:e34. [PMID: 27903916 PMCID: PMC5389684 DOI: 10.1093/nar/gkw1086] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/18/2016] [Accepted: 10/26/2016] [Indexed: 02/03/2023] Open
Abstract
Accurate identification of copy number alterations is an essential step in understanding the events driving tumor progression. While a variety of algorithms have been developed to use high-throughput sequencing data to profile copy number changes, no tool is able to reliably characterize ploidy and genotype absolute copy number from tumor samples that contain less than 40% tumor cells. To increase our power to resolve the copy number profile from low-cellularity tumor samples, we developed a novel approach that pre-phases heterozygote germline single nucleotide polymorphisms (SNPs) in order to replace the commonly used ‘B-allele frequency’ with a more powerful ‘parental-haplotype frequency’. We apply our tool—sCNAphase—to characterize the copy number and loss-of-heterozygosity profiles of four publicly available breast cancer cell-lines. Comparisons to previous spectral karyotyping and microarray studies revealed that sCNAphase reliably identified overall ploidy as well as the individual copy number mutations from each cell-line. Analysis of artificial cell-line mixtures demonstrated the capacity of this method to determine the level of tumor cellularity, consistently identify sCNAs and characterize ploidy in samples with as little as 10% tumor cells. This novel methodology has the potential to bring sCNA profiling to low-cellularity tumors, a form of cancer unable to be accurately studied by current methods.
Collapse
Affiliation(s)
- Wenhan Chen
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, 4072, Australia
| | - Alan J Robertson
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, 4072, Australia
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, 4072, Australia
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, St Lucia, Queensland, 4072, Australia
| |
Collapse
|
44
|
Kaforou M, Herberg JA, Wright VJ, Coin LJM, Levin M. Diagnosis of Bacterial Infection Using a 2-Transcript Host RNA Signature in Febrile Infants 60 Days or Younger. JAMA 2017; 317:1577-1578. [PMID: 28418473 DOI: 10.1001/jama.2017.1365] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 11/14/2022]
Affiliation(s)
- Myrsini Kaforou
- Section of Pediatrics, Imperial College London, London, United Kingdom
| | - Jethro A Herberg
- Section of Pediatrics, Imperial College London, London, United Kingdom
| | - Victoria J Wright
- Section of Pediatrics, Imperial College London, London, United Kingdom
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Queensland, Australia
| | - Michael Levin
- Section of Pediatrics, Imperial College London, London, United Kingdom
| |
Collapse
|
45
|
Cao MD, Nguyen SH, Ganesamoorthy D, Elliott AG, Cooper MA, Coin LJM. Scaffolding and completing genome assemblies in real-time with nanopore sequencing. Nat Commun 2017; 8:14515. [PMID: 28218240 PMCID: PMC5321748 DOI: 10.1038/ncomms14515] [Citation(s) in RCA: 82] [Impact Index Per Article: 11.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/11/2016] [Accepted: 01/06/2017] [Indexed: 01/10/2023] Open
Abstract
Third generation sequencing technologies provide the opportunity to improve genome assemblies by generating long reads spanning most repeat sequences. However, current analysis methods require substantial amounts of sequence data and computational resources to overcome the high error rates. Furthermore, they can only perform analysis after sequencing has completed, resulting in either over-sequencing, or in a low quality assembly due to under-sequencing. Here we present npScarf, which can scaffold and complete short read assemblies while the long read sequencing run is in progress. It reports assembly metrics in real-time so the sequencing run can be terminated once an assembly of sufficient quality is obtained. In assembling four bacterial and one eukaryotic genomes, we show that npScarf can construct more complete and accurate assemblies while requiring less sequencing data and computational resources than existing methods. Our approach offers a time- and resource-effective strategy for completing short read assemblies.
Collapse
Affiliation(s)
- Minh Duc Cao
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| | - Son Hoang Nguyen
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| | - Alysha G. Elliott
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| | - Matthew A. Cooper
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| | - Lachlan J. M. Coin
- Institute for Molecular Bioscience, University of Queensland, St Lucia, Brisbane, Queensland 4072 Australia
| |
Collapse
|
46
|
Cao MD, Ganesamoorthy D, Cooper MA, Coin LJM. Realtime analysis and visualization of MinION sequencing data with npReader. Bioinformatics 2015; 32:764-6. [PMID: 26556383 DOI: 10.1093/bioinformatics/btv658] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2015] [Accepted: 11/04/2015] [Indexed: 11/13/2022] Open
Abstract
MOTIVATION The recently released Oxford Nanopore MinION sequencing platform presents many innovative features opening up potential for a range of applications not previously possible. Among these features, the ability to sequence in real-time provides a unique opportunity for many time-critical applications. While many software packages have been developed to analyze its data, there is still a lack of toolkits that support the streaming and real-time analysis of MinION sequencing data. RESULTS We developed npReader, an open-source software package to facilitate real-time analysis of MinION sequencing data. npReader can simultaneously extract sequence reads and stream them to downstream analysis pipelines while the samples are being sequenced on the MinION device. It provides a command line interface for easy integration into a bioinformatics work flow, as well as a graphical user interface which concurrently displays the statistics of the run. It also provides an application programming interface for development of streaming algorithms in order to fully utilize the extent of nanopore sequencing potential. AVAILABILITY AND IMPLEMENTATION npReader is written in Java and is freely available at https://github.com/mdcao/npReader CONTACT m.cao1@uq.edu.au or l.coin@imb.uq.edu.au.
Collapse
Affiliation(s)
- Minh Duc Cao
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia and
| | - Devika Ganesamoorthy
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia and
| | - Matthew A Cooper
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia and
| | - Lachlan J M Coin
- Institute for Molecular Bioscience, The University of Queensland, Brisbane, QLD, Australia and Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK
| |
Collapse
|
47
|
Abstract
MOTIVATION Exome sequencing technologies have transformed the field of Mendelian genetics and allowed for efficient detection of genomic variants in protein-coding regions. The target enrichment process that is intrinsic to exome sequencing is inherently imperfect, generating large amounts of unintended off-target sequence. Off-target data are characterized by very low and highly heterogeneous coverage and are usually discarded by exome analysis pipelines. We posit that off-target read depth is a rich, but overlooked, source of information that could be mined to detect intergenic copy number variation (CNV). We propose cnvOffseq, a novel normalization framework for off-target read depth that is based on local adaptive singular value decomposition (SVD). This method is designed to address the heterogeneity of the underlying data and allows for accurate and precise CNV detection and genotyping in off-target regions. RESULTS cnvOffSeq was benchmarked on whole-exome sequencing samples from the 1000 Genomes Project. In a set of 104 gold standard intergenic deletions, our method achieved a sensitivity of 57.5% and a specificity of 99.2%, while maintaining a low FDR of 5%. For gold standard deletions longer than 5 kb, cnvOffSeq achieves a sensitivity of 90.4% without increasing the FDR. cnvOffSeq outperforms both whole-genome and whole-exome CNV detection methods considerably and is shown to offer a substantial improvement over naïve local SVD. AVAILABILITY AND IMPLEMENTATION cnvOffSeq is available at http://sourceforge.net/p/cnvoffseq/. SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Evangelos Bellos
- Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| | - Lachlan J M Coin
- Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia Department of Genomics of Common Disease, Imperial College London, London W12 0NN, UK and Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| |
Collapse
|
48
|
Bellos E, Kumar V, Lin C, Maggi J, Phua ZY, Cheng CY, Cheung CMG, Hibberd ML, Wong TY, Coin LJM, Davila S. cnvCapSeq: detecting copy number variation in long-range targeted resequencing data. Nucleic Acids Res 2014; 42:e158. [PMID: 25228465 PMCID: PMC4227763 DOI: 10.1093/nar/gku849] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022] Open
Abstract
Targeted resequencing technologies have allowed for efficient and cost-effective detection of genomic variants in specific regions of interest. Although capture sequencing has been primarily used for investigating single nucleotide variants and indels, it has the potential to elucidate a broader spectrum of genetic variation, including copy number variants (CNVs). Various methods exist for detecting CNV in whole-genome and exome sequencing datasets. However, no algorithms have been specifically designed for contiguous target sequencing, despite its increasing importance in clinical and research applications. We have developed cnvCapSeq, a novel method for accurate and sensitive CNV discovery and genotyping in long-range targeted resequencing. cnvCapSeq was benchmarked using a simulated contiguous capture sequencing dataset comprising 21 genomic loci of various lengths. cnvCapSeq was shown to outperform the best existing exome CNV method by a wide margin both in terms of sensitivity (92.0 versus 48.3%) and specificity (99.8 versus 70.5%). We also applied cnvCapSeq to a real capture sequencing cohort comprising a contiguous 358 kb region that contains the Complement Factor H gene cluster. In this dataset, cnvCapSeq identified 41 samples with CNV, including two with duplications, with a genotyping accuracy of 99%, as ascertained by quantitative real-time PCR.
Collapse
Affiliation(s)
- Evangelos Bellos
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, London W12 0NN, UK
| | - Vikrant Kumar
- Genome Institute of Singapore, 60 Biopolis St., 138672, Singapore
| | - Clarabelle Lin
- Genome Institute of Singapore, 60 Biopolis St., 138672, Singapore
| | - Jordi Maggi
- Institute of Medical Molecular Genetics, University of Zurich, Wagistrasse 12, 8952 Schlieren, Switzerland
| | - Zai Yang Phua
- Genome Institute of Singapore, 60 Biopolis St., 138672, Singapore
| | - Ching-Yu Cheng
- Singapore Eye Research Institute, Singapore National Eye Center, 11 Third Hospital Avenue, 168751, Singapore Department of Ophthalmology, National University of Singapore, 1E Kent Ridge Road, 119228, Singapore
| | - Chui Ming Gemmy Cheung
- Singapore Eye Research Institute, Singapore National Eye Center, 11 Third Hospital Avenue, 168751, Singapore Department of Ophthalmology, National University of Singapore, 1E Kent Ridge Road, 119228, Singapore
| | - Martin L Hibberd
- Genome Institute of Singapore, 60 Biopolis St., 138672, Singapore Faculty of Infectious and Tropical Diseases, London School of Hygiene and Tropical Medicine, London WC1E 7HT, UK
| | - Tien Yin Wong
- Singapore Eye Research Institute, Singapore National Eye Center, 11 Third Hospital Avenue, 168751, Singapore Department of Ophthalmology, National University of Singapore, 1E Kent Ridge Road, 119228, Singapore
| | - Lachlan J M Coin
- Department of Genomics of Common Disease, School of Public Health, Imperial College London, London W12 0NN, UK Institute for Molecular Bioscience, University of Queensland, St Lucia, QLD 4072, Australia
| | - Sonia Davila
- Genome Institute of Singapore, 60 Biopolis St., 138672, Singapore
| |
Collapse
|
49
|
Zhang F, Chen R, Liu D, Yao X, Li G, Jin Y, Yu C, Li Y, Coin LJM. YHap: a population model for probabilistic assignment of Y haplogroups from re-sequencing data. BMC Bioinformatics 2013; 14:331. [PMID: 24252171 PMCID: PMC4225519 DOI: 10.1186/1471-2105-14-331] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2013] [Accepted: 11/12/2013] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Y haplogroup analyses are an important component of genealogical reconstruction, population genetic analyses, medical genetics and forensics. These fields are increasingly moving towards use of low-coverage, high throughput sequencing. While there have been methods recently proposed for assignment of Y haplogroups on the basis of high-coverage sequence data, assignment on the basis of low-coverage data remains challenging. RESULTS We developed a new algorithm, YHap, which uses an imputation framework to jointly predict Y chromosome genotypes and assign Y haplogroups using low coverage population sequence data. We use data from the 1000 genomes project to demonstrate that YHap provides accurate Y haplogroup assignment with less than 2x coverage. CONCLUSIONS Borrowing information across multiple samples within a population using an imputation framework enables accurate Y haplogroup assignment.
Collapse
|
50
|
Couto Alves A, Bruhn S, Ramasamy A, Wang H, Holloway JW, Hartikainen AL, Jarvelin MR, Benson M, Balding DJ, Coin LJM. Dysregulation of complement system and CD4+ T cell activation pathways implicated in allergic response. PLoS One 2013; 8:e74821. [PMID: 24116013 PMCID: PMC3792967 DOI: 10.1371/journal.pone.0074821] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Accepted: 08/06/2013] [Indexed: 11/18/2022] Open
Abstract
Allergy is a complex disease that is likely to involve dysregulated CD4+ T cell activation. Here we propose a novel methodology to gain insight into how coordinated behaviour emerges between disease-dysregulated pathways in response to pathophysiological stimuli. Using peripheral blood mononuclear cells of allergic rhinitis patients and controls cultured with and without pollen allergens, we integrate CD4+ T cell gene expression from microarray data and genetic markers of allergic sensitisation from GWAS data at the pathway level using enrichment analysis; implicating the complement system in both cellular and systemic response to pollen allergens. We delineate a novel disease network linking T cell activation to the complement system that is significantly enriched for genes exhibiting correlated gene expression and protein-protein interactions, suggesting a tight biological coordination that is dysregulated in the disease state in response to pollen allergen but not to diluent. This novel disease network has high predictive power for the gene and protein expression of the Th2 cytokine profile (IL-4, IL-5, IL-10, IL-13) and of the Th2 master regulator (GATA3), suggesting its involvement in the early stages of CD4+ T cell differentiation. Dissection of the complement system gene expression identifies 7 genes specifically associated with atopic response to pollen, including C1QR1, CFD, CFP, ITGB2, ITGAX and confirms the role of C3AR1 and C5AR1. Two of these genes (ITGB2 and C3AR1) are also implicated in the network linking complement system to T cell activation, which comprises 6 differentially expressed genes. C3AR1 is also significantly associated with allergic sensitisation in GWAS data.
Collapse
MESH Headings
- Allergens/pharmacology
- CD4-Positive T-Lymphocytes/drug effects
- CD4-Positive T-Lymphocytes/immunology
- CD4-Positive T-Lymphocytes/metabolism
- Cell Differentiation/drug effects
- Cell Differentiation/genetics
- Cytokines/genetics
- Cytokines/metabolism
- GATA3 Transcription Factor/genetics
- GATA3 Transcription Factor/metabolism
- Gene Expression Profiling
- Humans
- Leukocytes, Mononuclear/drug effects
- Leukocytes, Mononuclear/immunology
- Leukocytes, Mononuclear/metabolism
- Lymphocyte Activation/drug effects
- Lymphocyte Activation/genetics
- Lymphocyte Activation/immunology
- Pollen
- Receptors, Complement/genetics
- Receptors, Complement/metabolism
- Rhinitis, Allergic, Seasonal/genetics
- Rhinitis, Allergic, Seasonal/immunology
- Rhinitis, Allergic, Seasonal/metabolism
Collapse
Affiliation(s)
- Alexessander Couto Alves
- Department of Epidemiology and Biostatistics, Imperial College London, MRC-HPA Centre for Environment and Health, Imperial College London, London, United Kingdom
| | - Sören Bruhn
- Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
| | - Adaikalavan Ramasamy
- Department of Epidemiology and Biostatistics, Imperial College London, MRC-HPA Centre for Environment and Health, Imperial College London, London, United Kingdom
- Department of Medical and Molecular Genetics, King's College London, London, United Kingdom
| | - Hui Wang
- Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
- Dept of Paediatrics, Gothenburg University, Gothenburg, Sweden
| | - John W. Holloway
- Human Development and Health, Faculty of Medicine, University of Southampton, Southampton, United Kingdom
| | - Anna-Liisa Hartikainen
- Department of Clinical Sciences, Obstetrics and Gynecology, Institute of Clinical Medicine, University of Oulu, Oulu, Finland
| | - Marjo-Riitta Jarvelin
- Department of Epidemiology and Biostatistics, Imperial College London, MRC-HPA Centre for Environment and Health, Imperial College London, London, United Kingdom
- Institute of Health Sciences, University of Oulu, and Unit of General Practice, University Hospital of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
- National Institute of Health and Welfare, Oulu, Finland
| | - Mikael Benson
- Department of Clinical and Experimental Medicine, Linköping University, Linköping, Sweden
| | - David J. Balding
- Department of Epidemiology and Biostatistics, Imperial College London, MRC-HPA Centre for Environment and Health, Imperial College London, London, United Kingdom
- Genetics Institute, University College London, United Kingdom
| | - Lachlan J. M. Coin
- Department of Genomics of Common Diseases, School of Public Health, Imperial College London, London, United Kingdom
- Institute for Molecular Bioscience, University of Queensland, Brisbane, Australia
| |
Collapse
|