1
|
Kruasuwan W, Sawatwong P, Jenjaroenpun P, Wankaew N, Arigul T, Yongkiettrakul S, Lunha K, Sudjai A, Siludjai D, Skaggs B, Wongsurawat T. Comparative evaluation of commercial DNA isolation approaches for nanopore-only bacterial genome assembly and plasmid recovery. Sci Rep 2024; 14:27672. [PMID: 39532954 PMCID: PMC11557978 DOI: 10.1038/s41598-024-78066-2] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2024] [Accepted: 10/28/2024] [Indexed: 11/16/2024] Open
Abstract
The advent of Oxford Nanopore Technologies has undergone significant improvements in terms of sequencing costs, accuracy, and sequencing read lengths, making it a cost-effective, and readily accessible approach for analyzing microbial genomes. A major challenge for bacterial whole genome sequencing by Nanopore technology is the requirement for a higher quality and quantity of high molecular weight DNA compared to short-read sequencing platforms. In this study, using eight pathogenic bacteria, we evaluated the quality, quantity, and fragmented size distribution of extracted DNA obtained from three different commercial DNA extraction kits, and one automated robotic platform. Our results demonstrated significant variation in DNA yield and purity among the extraction kits. The ZymoBIOMICS DNA Miniprep Kit (ZM) provided a higher purity of DNA compared to other kit-based extractions. All kit-based DNA extractions were successfully performed on all twenty-four samples using a single MinION flow cell, with the Nanobind CBB Big DNA kit (NB) yielding the longest raw reads. The Fire Monkey HMW-DNA Extraction Kit (FM) and the automated Roche MagNaPure 96 platform (RO) outperformed in genome assembly, particularly in gram-negative bacteria. Based on our finding, we recommend a minimum read coverage and raw read N50, obtained from the appropriate DNA extraction kit for each bacterial species, to optimize genome assembly and plasmid recovery. This approach will assist end-users in selecting the most effective kit-based extraction method for bacterial whole-genome assembly using only long-read nanopore sequences.
Collapse
Affiliation(s)
- Worarat Kruasuwan
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
- Siriraj Long-read Lab (Si-LoL), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Pongpun Sawatwong
- Division of Global Health Protection, Ministry of Public Health-U.S. Center of Diseases Control and Prevention, Nonthaburi, Thailand
| | - Piroon Jenjaroenpun
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
- Siriraj Long-read Lab (Si-LoL), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA
| | - Natnicha Wankaew
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
- Siriraj Long-read Lab (Si-LoL), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Tantip Arigul
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
- Siriraj Long-read Lab (Si-LoL), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Suganya Yongkiettrakul
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Kamonwan Lunha
- National Center for Genetic Engineering and Biotechnology (BIOTEC), National Science and Technology Development Agency (NSTDA), Pathum Thani, Thailand
| | - Aunthikarn Sudjai
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand
| | - Duangkamon Siludjai
- Division of Global Health Protection, Ministry of Public Health-U.S. Center of Diseases Control and Prevention, Nonthaburi, Thailand
| | - Beth Skaggs
- Division of Global Health Protection, Ministry of Public Health-U.S. Center of Diseases Control and Prevention, Nonthaburi, Thailand
| | - Thidathip Wongsurawat
- Division of Medical Bioinformatics, Research Department, Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
- Siriraj Long-read Lab (Si-LoL), Faculty of Medicine Siriraj Hospital, Mahidol University, Bangkok, Thailand.
- Department of Biomedical Informatics, University of Arkansas for Medical Sciences, Little Rock, AR, USA.
| |
Collapse
|
2
|
Luo Y, Payne M, Kaur S, Octavia S, Lan R. Genomic evidence of two-staged transmission of the early seventh cholera pandemic. Nat Commun 2024; 15:8504. [PMID: 39353924 PMCID: PMC11445481 DOI: 10.1038/s41467-024-52800-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/25/2024] [Accepted: 09/19/2024] [Indexed: 10/03/2024] Open
Abstract
The seventh cholera pandemic started in 1961 in Indonesia and spread across the world in three waves in the decades that followed. Here, we utilised genomic evidence to detail the first wave of the seventh pandemic. Genomes of 22 seventh pandemic Vibrio cholerae isolates from 1961 to 1979 were completely sequenced. Together with 152 publicly available genomes from the same period, they fell into seven phylogenetic clusters (CL1-CL7). By multilevel genome typing (MGT), all were assigned to MGT2 ST1 (Wave 1) except three isolates in CL7 which were typed as MGT2 ST2 (Wave 2). The Wave 1 seventh pandemic expanded in two stages, with Stage 1 (CL1-CL5) spread across Asia and Stage 2 (CL6 and CL7) spread to the Middle East and Africa. Three non-synonymous mutations, one each, in three regulatory genes, csrD (global regulator), acfB (chemotaxis), and luxO (quorum sensing) may have critically contributed to its pandemicity. The three MGT2 ST2 isolates in CL7 were the progenitors of Wave 2 and evolved from within Wave 1 with acquisition of a novel IncA/C plasmid. Our findings provide new insight into the evolution and transmission of the early seventh pandemic, which may aid future cholera prevention and control.
Collapse
Affiliation(s)
- Yun Luo
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Michael Payne
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Sandeep Kaur
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Sophie Octavia
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia
| | - Ruiting Lan
- School of Biotechnology and Biomolecular Sciences, University of New South Wales, Sydney, NSW, Australia.
| |
Collapse
|
3
|
Waters EV, Lee WWY, Ismail Ahmed A, Chattaway MA, Langridge GC. From acute to persistent infection: revealing phylogenomic variations in Salmonella Agona. PLoS Pathog 2024; 20:e1012679. [PMID: 39480892 PMCID: PMC11556752 DOI: 10.1371/journal.ppat.1012679] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/22/2024] [Revised: 11/12/2024] [Accepted: 10/17/2024] [Indexed: 11/02/2024] Open
Abstract
Salmonella enterica serovar Agona (S. Agona) has been increasingly recognised as a prominent cause of gastroenteritis. This serovar is a strong biofilm former that can undergo genome rearrangement and enter a viable but non-culturable state whilst remaining metabolically active. Similar strategies are employed by S. Typhi, the cause of typhoid fever, during human infection, which are believed to assist with the transition from acute infection to chronic carriage. Here we report S. Agona's ability to persist in people and examine factors that might be contributing to chronic carriage. A review of 2233 S. Agona isolates from UK infections (2004-2020) and associated carriage was undertaken, in which 1155 had short-read sequencing data available. A subset of 207 isolates was selected from different stages of acute and persistent infections within individual patients. The subset underwent long-read sequencing and genome structure (GS) analysis, as well as phenotyping assays including carbon source utilisation and biofilm formation. Associations between genotypes and phenotypes were investigated to compare acute infections to those which progress to chronic. GS analysis revealed the conserved arrangement GS1.0 in 195 isolates, and 8 additional GSs in 12 isolates. These rearranged isolates were typically associated with early, convalescent carriage (3 weeks- 3 months). We also identified an increase in SNP variation during this period of infection. We believe this increase in genome-scale and SNP variation reflects a population expansion after acute S. Agona infection, potentially reflecting an immune evasion mechanism which enables persistent infection to become established.
Collapse
Affiliation(s)
- Emma V. Waters
- Microbes and Food Safety, Quadram Institute Bioscience, Norwich, United Kingdom
- Centre for Microbial Interactions, Norwich Research Park, Norwich, United Kingdom
| | - Winnie W. Y. Lee
- Microbes and Food Safety, Quadram Institute Bioscience, Norwich, United Kingdom
- School of Cellular and Molecular Medicine, University of Bristol, Bristol, United Kingdom
| | - Amina Ismail Ahmed
- Gastrointestinal Bacteria Reference Unit, United Kingdom Health Security Agency, London, United Kingdom
| | - Marie-Anne Chattaway
- Gastrointestinal Bacteria Reference Unit, United Kingdom Health Security Agency, London, United Kingdom
- Genomic and Enabling Data Health Protection Research Unit, University of Warwick, Coventry, United Kingdom
| | - Gemma C. Langridge
- Microbes and Food Safety, Quadram Institute Bioscience, Norwich, United Kingdom
- Centre for Microbial Interactions, Norwich Research Park, Norwich, United Kingdom
| |
Collapse
|
4
|
Tam YL, Cameron S, Preston A, Cowley L. GWarrange: a pre- and post- genome-wide association studies pipeline for detecting phenotype-associated genome rearrangement events. Microb Genom 2024; 10:001268. [PMID: 38980151 PMCID: PMC11316554 DOI: 10.1099/mgen.0.001268] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2024] [Accepted: 06/17/2024] [Indexed: 07/10/2024] Open
Abstract
The use of k-mers to capture genetic variation in bacterial genome-wide association studies (bGWAS) has demonstrated its effectiveness in overcoming the plasticity of bacterial genomes by providing a comprehensive array of genetic variants in a genome set that is not confined to a single reference genome. However, little attempt has been made to interpret k-mers in the context of genome rearrangements, partly due to challenges in the exhaustive and high-throughput identification of genome structure and individual rearrangement events. Here, we present GWarrange, a pre- and post-bGWAS processing methodology that leverages the unique properties of k-mers to facilitate bGWAS for genome rearrangements. Repeat sequences are common instigators of genome rearrangements through intragenomic homologous recombination, and they are commonly found at rearrangement boundaries. Using whole-genome sequences, repeat sequences are replaced by short placeholder sequences, allowing the regions flanking repeats to be incorporated into relatively short k-mers. Then, locations of flanking regions in significant k-mers are mapped back to complete genome sequences to visualise genome rearrangements. Four case studies based on two bacterial species (Bordetella pertussis and Enterococcus faecium) and a simulated genome set are presented to demonstrate the ability to identify phenotype-associated rearrangements. GWarrange is available at https://github.com/DorothyTamYiLing/GWarrange.
Collapse
Affiliation(s)
- Yi Ling Tam
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Sarah Cameron
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Andrew Preston
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| | - Lauren Cowley
- The Milner Centre for Evolution and Department of Life Sciences, University of Bath, Claverton Down, Bath, BA2 7AY, UK
| |
Collapse
|
5
|
Li Z, Liu X, Ning N, Li T, Wang H. Diversity, Distribution, and Chromosomal Rearrangements of TRIP1 Repeat Sequences in Escherichia coli. Genes (Basel) 2024; 15:236. [PMID: 38397225 PMCID: PMC10888264 DOI: 10.3390/genes15020236] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/17/2024] [Revised: 02/07/2024] [Accepted: 02/10/2024] [Indexed: 02/25/2024] Open
Abstract
The bacterial genome contains numerous repeated sequences that greatly affect its genomic plasticity. The Escherichia coli K-12 genome contains three copies of the TRIP1 repeat sequence (TRIP1a, TRIP1b, and TRIP1c). However, the diversity, distribution, and role of the TRIP1 repeat sequence in the E. coli genome are still unclear. In this study, after screening 6725 E. coli genomes, the TRIP1 repeat was found in the majority of E. coli strains (96%: 6454/6725). The copy number and direction of the TRIP1 repeat sequence varied in each genome. Overall, 2449 genomes (36%: 2449/6725) had three copies of TRIP1 (TRIP1a, TRIP1b, and TRIP1c), which is the same as E. coli K-12. Five types of TRIP1 repeats, including two new types (TRIP1d and TRIP1e), are identified in E. coli genomes, located in 4703, 3529, 5741, 1565, and 232 genomes, respectively. Each type of TRIP1 repeat is localized to a specific locus on the chromosome. TRIP1 repeats can cause intra-chromosomal rearrangements. A total of 156 rearrangement events were identified, of which 88% (137/156) were between TRIP1a and TRIP1c. These findings have important implications for future research on TRIP1 repeats.
Collapse
Affiliation(s)
- Zhan Li
- State Key Laboratory of Pathogens and Biosecurity, Beijing Institute of Microbiology and Epidemiology, No. 20 Dongda Street, Fengtai District, Beijing 100071, China; (Z.L.); (N.N.); (T.L.)
| | - Xiong Liu
- Chinese PLA Center for Disease Control and Prevention, Dongda Street 20#, Fengtai District, Beijing 100071, China;
| | - Nianzhi Ning
- State Key Laboratory of Pathogens and Biosecurity, Beijing Institute of Microbiology and Epidemiology, No. 20 Dongda Street, Fengtai District, Beijing 100071, China; (Z.L.); (N.N.); (T.L.)
| | - Tao Li
- State Key Laboratory of Pathogens and Biosecurity, Beijing Institute of Microbiology and Epidemiology, No. 20 Dongda Street, Fengtai District, Beijing 100071, China; (Z.L.); (N.N.); (T.L.)
| | - Hui Wang
- State Key Laboratory of Pathogens and Biosecurity, Beijing Institute of Microbiology and Epidemiology, No. 20 Dongda Street, Fengtai District, Beijing 100071, China; (Z.L.); (N.N.); (T.L.)
| |
Collapse
|
6
|
Burciaga S, Trachsel JM, Sockett D, Aulik N, Monson MS, Anderson CL, Bearson SMD. Genomic and phenotypic comparison of two variants of multidrug-resistant Salmonella enterica serovar Heidelberg isolated during the 2015-2017 multi-state outbreak in cattle. Front Microbiol 2023; 14:1282832. [PMID: 37928690 PMCID: PMC10623430 DOI: 10.3389/fmicb.2023.1282832] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2023] [Accepted: 10/02/2023] [Indexed: 11/07/2023] Open
Abstract
Salmonella enterica subspecies enterica serovar Heidelberg (Salmonella Heidelberg) has caused several multistate foodborne outbreaks in the United States, largely associated with the consumption of poultry. However, a 2015-2017 multidrug-resistant (MDR) Salmonella Heidelberg outbreak was linked to contact with dairy beef calves. Traceback investigations revealed calves infected with outbreak strains of Salmonella Heidelberg exhibited symptoms of disease frequently followed by death from septicemia. To investigate virulence characteristics of Salmonella Heidelberg as a pathogen in bovine, two variants with distinct pulse-field gel electrophoresis (PFGE) patterns that differed in morbidity and mortality during the multistate outbreak were genotypically and phenotypically characterized and compared. Strain SX 245 with PFGE pattern JF6X01.0523 was identified as a dominant and highly pathogenic variant causing high morbidity and mortality in affected calves, whereas strain SX 244 with PFGE pattern JF6X01.0590 was classified as a low pathogenic variant causing less morbidity and mortality. Comparison of whole-genome sequences determined that SX 245 lacked ~200 genes present in SX 244, including genes associated with the IncI1 plasmid and phages; SX 244 lacked eight genes present in SX 245 including a second YdiV Anti-FlhC(2)FlhD(4) factor, a lysin motif domain containing protein, and a pentapeptide repeat protein. RNA-sequencing revealed fimbriae-related, flagella-related, and chemotaxis genes had increased expression in SX 245 compared to SX 244. Furthermore, SX 245 displayed higher invasion of human and bovine epithelial cells than SX 244. These data suggest that the presence and up-regulation of genes involved in type 1 fimbriae production, flagellar regulation and biogenesis, and chemotaxis may play a role in the increased pathogenicity and host range expansion of the Salmonella Heidelberg isolates involved in the bovine-related outbreak.
Collapse
Affiliation(s)
- Selma Burciaga
- United States Department of Agriculture, Agriculture Research Services, National Animal Disease Center, Ames, IA, United States
- Oak Ridge Institute for Science and Education (ORISE), ARS Research Participation Program, Oak Ridge, TN, United States
| | - Julian M. Trachsel
- United States Department of Agriculture, Agriculture Research Services, National Animal Disease Center, Ames, IA, United States
| | - Donald Sockett
- Wisconsin Veterinary Diagnostic Laboratory, University of Wisconsin, Madison, WI, United States
| | - Nicole Aulik
- Wisconsin Veterinary Diagnostic Laboratory, University of Wisconsin, Madison, WI, United States
| | - Melissa S. Monson
- United States Department of Agriculture, Agriculture Research Services, National Animal Disease Center, Ames, IA, United States
| | - Christopher L. Anderson
- United States Department of Agriculture, Agriculture Research Services, National Animal Disease Center, Ames, IA, United States
| | - Shawn M. D. Bearson
- United States Department of Agriculture, Agriculture Research Services, National Animal Disease Center, Ames, IA, United States
| |
Collapse
|