1
|
García-Bayona L, Said N, Coyne MJ, Flores K, Elmekki NM, Sheahan ML, Camacho AG, Hutt K, Yildiz FH, Kovács ÁT, Waldor MK, Comstock LE. A pervasive large conjugative plasmid mediates multispecies biofilm formation in the intestinal microbiota increasing resilience to perturbations. bioRxiv 2024:2024.04.29.590671. [PMID: 38746121 PMCID: PMC11092513 DOI: 10.1101/2024.04.29.590671] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
Although horizontal gene transfer is pervasive in the intestinal microbiota, we understand only superficially the roles of most exchanged genes and how the mobile repertoire affects community dynamics. Similarly, little is known about the mechanisms underlying the ability of a community to recover after a perturbation. Here, we identified and functionally characterized a large conjugative plasmid that is one of the most frequently transferred elements among Bacteroidales species and is ubiquitous in diverse human populations. This plasmid encodes both an extracellular polysaccharide and fimbriae, which promote the formation of multispecies biofilms in the mammalian gut. We use a hybridization-based approach to visualize biofilms in clarified whole colon tissue with unprecedented 3D spatial resolution. These biofilms increase bacterial survival to common stressors encountered in the gut, increasing strain resiliency, and providing a rationale for the plasmid's recent spread and high worldwide prevalence.
Collapse
|
2
|
Mirarab S, Rivas-González I, Feng S, Stiller J, Fang Q, Mai U, Hickey G, Chen G, Brajuka N, Fedrigo O, Formenti G, Wolf JBW, Howe K, Antunes A, Schierup MH, Paten B, Jarvis ED, Zhang G, Braun EL. A region of suppressed recombination misleads neoavian phylogenomics. Proc Natl Acad Sci U S A 2024; 121:e2319506121. [PMID: 38557186 PMCID: PMC11009670 DOI: 10.1073/pnas.2319506121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/14/2023] [Accepted: 02/07/2024] [Indexed: 04/04/2024] Open
Abstract
Genomes are typically mosaics of regions with different evolutionary histories. When speciation events are closely spaced in time, recombination makes the regions sharing the same history small, and the evolutionary history changes rapidly as we move along the genome. When examining rapid radiations such as the early diversification of Neoaves 66 Mya, typically no consistent history is observed across segments exceeding kilobases of the genome. Here, we report an exception. We found that a 21-Mb region in avian genomes, mapped to chicken chromosome 4, shows an extremely strong and discordance-free signal for a history different from that of the inferred species tree. Such a strong discordance-free signal, indicative of suppressed recombination across many millions of base pairs, is not observed elsewhere in the genome for any deep avian relationships. Although long regions with suppressed recombination have been documented in recently diverged species, our results pertain to relationships dating circa 65 Mya. We provide evidence that this strong signal may be due to an ancient rearrangement that blocked recombination and remained polymorphic for several million years prior to fixation. We show that the presence of this region has misled previous phylogenomic efforts with lower taxon sampling, showing the interplay between taxon and locus sampling. We predict that similar ancient rearrangements may confound phylogenetic analyses in other clades, pointing to a need for new analytical models that incorporate the possibility of such events.
Collapse
Affiliation(s)
- Siavash Mirarab
- Electrical and Computer Engineering Department, University of California, San Diego, CA95032
| | | | - Shaohong Feng
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou310058, China
- Liangzhu Laboratory, Zhejiang University, Hangzhou311121, China
| | - Josefin Stiller
- Section for Ecology & Evolution, Department of Biology, University of Copenhagen, København2100, Denmark
| | - Qi Fang
- BGI-Research, Shenzhen518083, China
| | - Uyen Mai
- Electrical and Computer Engineering Department, University of California, San Diego, CA95032
| | - Glenn Hickey
- Genomics Institute, University of California, Santa Cruz, CA96064
| | - Guangji Chen
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou310058, China
- Liangzhu Laboratory, Zhejiang University, Hangzhou311121, China
| | - Nadolina Brajuka
- Vertebrate Genome Lab, Rockefeller University, New York, NY10065
| | - Olivier Fedrigo
- Vertebrate Genome Lab, Rockefeller University, New York, NY10065
| | - Giulio Formenti
- Vertebrate Genome Lab, Rockefeller University, New York, NY10065
| | - Jochen B. W. Wolf
- Division of Evolutionary Biology, Faculty of Biology, Ludwig-Maximillians-Universität, Munich82152, Germany
| | - Kerstin Howe
- Tree of Life Division, Wellcome Sanger Institute, CambridgeCB10 1RQ, United Kingdom
| | - Agostinho Antunes
- Interdisciplinary Centre of Marine and Environmental Research, University of Porto, Porto4099-002, Portugal
- Department of Biology, Faculty of Sciences, University of Porto, Porto4099-002, Portugal
| | | | - Benedict Paten
- Genomics Institute, University of California, Santa Cruz, CA96064
| | - Erich D. Jarvis
- Vertebrate Genome Lab, Rockefeller University, New York, NY10065
| | - Guojie Zhang
- Center for Evolutionary & Organismal Biology, Zhejiang University School of Medicine, Hangzhou310058, China
| | - Edward L. Braun
- Department of Biology, University of Florida, Gainesville, FL32611
| |
Collapse
|
3
|
Legeai F, Romain S, Capblancq T, Doniol-Valcroze P, Joron M, Lemaitre C, Després L. Chromosome-Level Assembly and Annotation of the Pearly Heath Coenonympha arcania Butterfly Genome. Genome Biol Evol 2024; 16:evae055. [PMID: 38491969 PMCID: PMC10980516 DOI: 10.1093/gbe/evae055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Revised: 03/07/2024] [Accepted: 03/13/2024] [Indexed: 03/18/2024] Open
Abstract
We present the first chromosome-level genome assembly and annotation of the pearly heath Coenonympha arcania, generated with a PacBio HiFi sequencing approach and complemented with Hi-C data. We additionally compare synteny, gene, and repeat content between C. arcania and other Lepidopteran genomes. This reference genome will enable future population genomics studies with Coenonympha butterflies, a species-rich genus that encompasses some of the most highly endangered butterfly taxa in Europe.
Collapse
Affiliation(s)
- Fabrice Legeai
- Inria, CNRS, IRISA, University of Rennes, 35000 Rennes, France
- IGEPP, INRAE, Institut Agro, University of Rennes, 35653 Le Rheu, France
| | - Sandra Romain
- Inria, CNRS, IRISA, University of Rennes, 35000 Rennes, France
| | - Thibaut Capblancq
- LECA, CNRS, Université Grenoble-Alpes, Université Savoie Mont Blanc, Grenoble, France
| | | | - Mathieu Joron
- CEFE, CNRS, EPHE, IRD, Université de Montpellier, Montpellier, France
| | - Claire Lemaitre
- Inria, CNRS, IRISA, University of Rennes, 35000 Rennes, France
| | - Laurence Després
- LECA, CNRS, Université Grenoble-Alpes, Université Savoie Mont Blanc, Grenoble, France
| |
Collapse
|
4
|
Do V, Nguyen S, Le D, Nguyen T, Nguyen C, Ho T, Vo N, Nguyen T, Nguyen H, Cao M. Pasa: leveraging population pangenome graph to scaffold prokaryote genome assemblies. Nucleic Acids Res 2024; 52:e15. [PMID: 38084888 PMCID: PMC10853769 DOI: 10.1093/nar/gkad1170] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2023] [Revised: 11/07/2023] [Accepted: 11/22/2023] [Indexed: 02/10/2024] Open
Abstract
Whole genome sequencing has increasingly become the essential method for studying the genetic mechanisms of antimicrobial resistance and for surveillance of drug-resistant bacterial pathogens. The majority of bacterial genomes sequenced to date have been sequenced with Illumina sequencing technology, owing to its high-throughput, excellent sequence accuracy, and low cost. However, because of the short-read nature of the technology, these assemblies are fragmented into large numbers of contigs, hindering the obtaining of full information of the genome. We develop Pasa, a graph-based algorithm that utilizes the pangenome graph and the assembly graph information to improve scaffolding quality. By leveraging the population information of the bacteria species, Pasa is able to utilize the linkage information of the gene families of the species to resolve the contig graph of the assembly. We show that our method outperforms the current state of the arts in terms of accuracy, and at the same time, is computationally efficient to be applied to a large number of existing draft assemblies.
Collapse
Affiliation(s)
- Van Hoan Do
- Center for Applied Mathematics and Informatics, Le Quy Don Technical University, Hanoi, Vietnam
| | | | - Duc Quang Le
- Faculty of IT, Hanoi University of Civil Engineering, Hanoi, Vietnam
| | - Tam Thi Nguyen
- Oxford University Clinical Research Unit, Hanoi, Vietnam
| | - Canh Hao Nguyen
- Bioinformatics Center, Institute for Chemical Research, Kyoto University, Japan
| | - Tho Huu Ho
- Department of Medical Microbiology, The 103 Military Hospital, Vietnam Military Medical University, Hanoi, Vietnam
- Department of Genomics & Cytogenetics, Institute of Biomedicine & Pharmacy, Vietnam Military Medical University, Hanoi, Vietnam
| | - Nam S Vo
- Center for Biomedical Informatics, Vingroup Big Data Institute, Hanoi, Vietnam
| | | | | | | |
Collapse
|
5
|
Calamari ZT, Song A, Cohen E, Akter M, Roy RD, Hallikas O, Christensen MM, Li P, Marangoni P, Jernvall J, Klein OD. Conserved and derived expression patterns and positive selection on dental genes reveal complex evolutionary context of ever-growing rodent molars. bioRxiv 2023:2023.12.18.572015. [PMID: 38187646 PMCID: PMC10769287 DOI: 10.1101/2023.12.18.572015] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 01/09/2024]
Abstract
Background Continuously growing teeth are an important innovation in mammalian evolution, yet genetic regulation of continuous growth by stem cells remains incompletely understood. Dental stem cells are lost at the onset of tooth root formation, but this loss of continuous crown growth is difficult to study in the mouse because regulatory signaling overlaps with signals that pattern tooth size and shape. Within the voles (Cricetidae, Rodentia, Glires), species have evolved both rooted and unrooted molars that have similar size and shape. We assembled a de novo genome of Myodes glareolus, a vole with high-crowned, rooted molars, and performed genomic and transcriptomic analyses in a broad phylogenetic context of Glires (rodents and lagomorphs) to assess differential selection and evolution in tooth forming genes. Results Our de novo genome recovered 91% of single-copy orthologs for Euarchontoglires and had a total length of 2.44 Gigabases, enabling genomic and transcriptomic analyses. We identified six dental genes undergoing positive selection across Glires and two genes undergoing positive selection in species with unrooted molars, Dspp and Aqp1. Transcriptomics analyses demonstrated conserved patterns of dental gene expression with species-specific variation likely related to developmental timing and morphological differences between mouse and vole molars. Conclusions Our results support ongoing dental gene evolution in rodents with unrooted molars. We identify candidate genes for further functional analyses, particularly Dspp, which plays an important role in mineralizing tissues. Our expression results support conservation of dental genes between voles and model species like mice, while revealing significant effects of overall tooth morphology on gene expression.
Collapse
Affiliation(s)
- Zachary T. Calamari
- Baruch College, City University of New York, One Bernard Baruch Way, New York, NY 10010, USA
- The Graduate Center, City University of New York, 365 Fifth Ave, New York, NY 10016, USA
- Program in Craniofacial Biology and Department of Orofacial Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Division of Paleontology, American Museum of Natural History, Central Park West at 79 Street, New York, NY, 10024, USA
| | - Andrew Song
- Baruch College, City University of New York, One Bernard Baruch Way, New York, NY 10010, USA
- Cornell University, 616 Thurston Ave, Ithaca, NY 14853, USA
| | - Emily Cohen
- Baruch College, City University of New York, One Bernard Baruch Way, New York, NY 10010, USA
- New York University College of Dentistry, 345 E 34 St, New York, NY 10010
| | - Muspika Akter
- Baruch College, City University of New York, One Bernard Baruch Way, New York, NY 10010, USA
| | - Rishi Das Roy
- Institute of Biotechnology, University of Helsinki, FI-00014 Helsinki, Finland
| | - Outi Hallikas
- Institute of Biotechnology, University of Helsinki, FI-00014 Helsinki, Finland
| | - Mona M. Christensen
- Institute of Biotechnology, University of Helsinki, FI-00014 Helsinki, Finland
| | - Pengyang Li
- Program in Craniofacial Biology and Department of Orofacial Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Pediatrics, Cedars-Sinai Guerin Children’s, 8700 Beverly Blvd., Suite 2416, Los Angeles, CA 90048, USA
| | - Pauline Marangoni
- Program in Craniofacial Biology and Department of Orofacial Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Pediatrics, Cedars-Sinai Guerin Children’s, 8700 Beverly Blvd., Suite 2416, Los Angeles, CA 90048, USA
| | - Jukka Jernvall
- Institute of Biotechnology, University of Helsinki, FI-00014 Helsinki, Finland
- Department of Geosciences and Geography, University of Helsinki, FI-00014 Helsinki, Finland
| | - Ophir D. Klein
- Program in Craniofacial Biology and Department of Orofacial Sciences, University of California, San Francisco, San Francisco, CA 94158, USA
- Department of Pediatrics, Cedars-Sinai Guerin Children’s, 8700 Beverly Blvd., Suite 2416, Los Angeles, CA 90048, USA
| |
Collapse
|
6
|
Liu K, Xie N, Wang Y, Liu X. The Utilization of Reference-Guided Assembly and In Silico Libraries Improves the Draft Genome of Clarias batrachus and Culter alburnus. Mar Biotechnol (NY) 2023; 25:907-917. [PMID: 37661218 DOI: 10.1007/s10126-023-10248-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/19/2023] [Accepted: 08/28/2023] [Indexed: 09/05/2023]
Abstract
Long-read sequencing technologies can generate highly contiguous genome assemblies compared to short-read methods. However, their higher cost often poses a significant barrier. To address this, we explore the utilization of mapping-based genome assembly and reference-guided assembly as cost-effective alternative approaches. We assess the efficacy of these approaches in improving the contiguity of Clarias batrachus and Culter alburnus draft genomes. Our findings demonstrate that employing an iterative mapping strategy leads to a reduction in assembly errors. Specifically, after three iterations, the Mismatches per 100 kbp value for the C. batrachus genome decreased from 2447.20 to 2432.67, reaching a minimum of 2422.67 after two iterations. Additionally, the N50 value for the C. batrachus genome increased from 362,143 to 1,315,126 bp, with a maximum of 1,315,403 bp after two iterations. Furthermore, we achieved Mismatches per 100 kbp values of 3.70 for the reference-guided assembly of C. batrachus and 0.34 for C. alburnus. Correspondingly, the N50 value for the C. batrachus and C. alburnus genomes increased from 362,143 bp and 3,686,385 bp to 2,026,888 bp and 43,735,735 bp, respectively. Finally, we successfully utilized the improved C. batrachus and C. alburnus genomes to compare genome studies using the combined approach of Ragout and Ragtag. Through a comprehensive comparative analysis of mapping-based and reference-guided genome assembly methods, we shed light on the specific contributions of reference-guided assembly in reducing assembly errors and improving assembly continuity and integrity. These advancements establish reference-guided assembly and the utilization of in silico libraries as a promising and suitable approach for comparative genomics studies.
Collapse
Affiliation(s)
- Kai Liu
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, 310024, China.
| | - Nan Xie
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, 310024, China
| | - Yuxi Wang
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, 310024, China
| | - Xinyi Liu
- Institute of Fishery Science, Hangzhou Academy of Agricultural Sciences, Hangzhou, 310024, China
| |
Collapse
|
7
|
Didik T, Yau APY, Cheung HL, Lee SY, Chan NH, Wah YT, Luk HKH, Choi GKY, Cheng NHY, Tse H, Li Y, Wong SCY, Lung DC. Long-range air dispersion of Candida auris in a cardiothoracic unit outbreak in Hong Kong. J Hosp Infect 2023; 142:105-114. [PMID: 37806452 DOI: 10.1016/j.jhin.2023.09.019] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2023] [Revised: 09/21/2023] [Accepted: 09/24/2023] [Indexed: 10/10/2023]
Abstract
BACKGROUND Nosocomial outbreaks of Candida auris, a multidrug-resistant fungus, are increasingly reported worldwide; the mode of transmission has usually been reported to be via direct contact. Some studies previously suggested potential short-distance air dispersal during high-turbulence activities, but evidence on long-range air dispersal remains scarce. AIM To describe a C. auris nosocomial outbreak involving two wards (H7, 5E) in two local hospitals. METHODS Samples were taken from patients, ward surfaces (frequently touched items and non-reachable surfaces) while settle plates were used for passive air sampling to investigate possible contributions by direct contact and air dispersal. Epidemiological and phylogenetic analyses were also performed on the C. auris isolates from this outbreak. FINDINGS Eighteen patients were confirmed to have asymptomatic C. auris skin colonization. C. auris was expectedly identified in samplings from frequently touched ward items but was also isolated in two samples from ceiling supply air grilles which were 2.4 m high and inaccessible by patients. Moreover, one sample from a corridor return air grille as far as 9.8 m away from the C. auris cohort area was also positive. Two passive air samplings were positive, including one from a cubicle with no confirmed cases for four days, suggesting possible air dispersal of C. auris. Whole-genome sequencing confirmed clonality of air, environment, and patients' isolates. CONCLUSION This is the first study to demonstrate potential long-range air dispersal of C. auris in an open-cubicle ward setting. Ventilation precautions and decontamination of out-of-reach high-level surfaces should be considered in C. auris outbreak management.
Collapse
Affiliation(s)
- T Didik
- Department of Pathology, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China; Department of Pathology, Hong Kong Children's Hospital, Hong Kong Special Administrative Region, China
| | - A P-Y Yau
- Department of Respiratory Medicine, Kowloon Hospital, Hong Kong Special Administrative Region, China
| | - H L Cheung
- Department of Cardiothoracic Surgery, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China
| | - S-Y Lee
- Infection Control Team, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China
| | - N-H Chan
- Infection Control Team, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China
| | - Y-T Wah
- Infection Control Team, Kowloon Hospital, Hong Kong Special Administrative Region, China
| | - H K-H Luk
- Department of Pathology, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China
| | - G K-Y Choi
- Department of Pathology, Hong Kong Children's Hospital, Hong Kong Special Administrative Region, China
| | - N H-Y Cheng
- Department of Pathology, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China
| | - H Tse
- Department of Laboratory Medicine, Khoo Teck Puat Hospital, Singapore
| | - Y Li
- Department of Mechanical Engineering, Faculty of Engineering, The University of Hong Kong, Hong Kong Special Administrative Region, China
| | - S C Y Wong
- Department of Pathology, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China; Department of Pathology, Hong Kong Children's Hospital, Hong Kong Special Administrative Region, China
| | - D C Lung
- Department of Pathology, Queen Elizabeth Hospital, Hong Kong Special Administrative Region, China; Department of Pathology, Hong Kong Children's Hospital, Hong Kong Special Administrative Region, China.
| |
Collapse
|
8
|
Gellman RH, Olm MR, Terrapon N, Enam F, Higginbottom SK, Sonnenburg JL, Sonnenburg ED. Hadza Prevotella require diet-derived microbiota-accessible carbohydrates to persist in mice. Cell Rep 2023; 42:113233. [PMID: 38510311 PMCID: PMC10954246 DOI: 10.1016/j.celrep.2023.113233] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/22/2024] Open
Abstract
Industrialization has transformed the gut microbiota, reducing the prevalence of Prevotella relative to Bacteroides. Here, we isolate Bacteroides and Prevotella strains from the microbiota of Hadza hunter-gatherers in Tanzania, a population with high levels of Prevotella. We demonstrate that plant-derived microbiota-accessible carbohydrates (MACs) are required for persistence of Prevotella copri but not Bacteroides thetaiotaomicron in vivo. Differences in carbohydrate metabolism gene content, expression, and in vitro growth reveal that Hadza Prevotella strains specialize in degrading plant carbohydrates, while Hadza Bacteroides isolates use both plant and host-derived carbohydrates, a difference mirrored in Bacteroides from non-Hadza populations. When competing directly, P. copri requires plant-derived MACs to maintain colonization in the presence of B. thetaiotaomicron, as a no-MAC diet eliminates P. copri colonization. Prevotella's reliance on plant-derived MACs and Bacteroides' ability to use host mucus carbohydrates could explain the reduced prevalence of Prevotella in populations consuming a low-MAC, industrialized diet.
Collapse
Affiliation(s)
- Rebecca H. Gellman
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Matthew R. Olm
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Nicolas Terrapon
- Architecture et Fonction des Macromolé cules Biologiques, INRAE, CNRS, Aix-Marseille Université, Marseille, France
| | - Fatima Enam
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Steven K. Higginbottom
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Justin L. Sonnenburg
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- Center for Human Microbiome Studies, Stanford University School of Medicine, Stanford, CA, USA
| | - Erica D. Sonnenburg
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
- Center for Human Microbiome Studies, Stanford University School of Medicine, Stanford, CA, USA
- Lead contact
| |
Collapse
|
9
|
Signor S, Vedanayagam J, Kim BY, Wierzbicki F, Kofler R, Lai EC. Rapid evolutionary diversification of the flamenco locus across simulans clade Drosophila species. PLoS Genet 2023; 19:e1010914. [PMID: 37643184 PMCID: PMC10495008 DOI: 10.1371/journal.pgen.1010914] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2023] [Revised: 09/11/2023] [Accepted: 08/09/2023] [Indexed: 08/31/2023] Open
Abstract
Suppression of transposable elements (TEs) is paramount to maintain genomic integrity and organismal fitness. In D. melanogaster, the flamenco locus is a master suppressor of TEs, preventing the mobilization of certain endogenous retrovirus-like TEs from somatic ovarian support cells to the germline. It is transcribed by Pol II as a long (100s of kb), single-stranded, primary transcript, and metabolized into ~24-32 nt Piwi-interacting RNAs (piRNAs) that target active TEs via antisense complementarity. flamenco is thought to operate as a trap, owing to its high content of recent horizontally transferred TEs that are enriched in antisense orientation. Using newly-generated long read genome data, which is critical for accurate assembly of repetitive sequences, we find that flamenco has undergone radical transformations in sequence content and even copy number across simulans clade Drosophilid species. Drosophila simulans flamenco has duplicated and diverged, and neither copy exhibits synteny with D. melanogaster beyond the core promoter. Moreover, flamenco organization is highly variable across D. simulans individuals. Next, we find that D. simulans and D. mauritiana flamenco display signatures of a dual-stranded cluster, with ping-pong signals in the testis and/or embryo. This is accompanied by increased copy numbers of germline TEs, consistent with these regions operating as functional dual-stranded clusters. Overall, the physical and functional diversity of flamenco orthologs is testament to the extremely dynamic consequences of TE arms races on genome organization, not only amongst highly related species, but even amongst individuals.
Collapse
Affiliation(s)
- Sarah Signor
- Biological Sciences, North Dakota State University, Fargo, North Dakota, United States of America
| | - Jeffrey Vedanayagam
- Developmental Biology Program, Sloan-Kettering Institute, New York, New York, United States of America
- Department of Neuroscience, Developmental and Regenerative Biology, University of Texas at San Antonio, Texas, United States of America
| | - Bernard Y. Kim
- Department of Biology, Stanford University, Stanford, California, United States of America
| | - Filip Wierzbicki
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
- Vienna Graduate School of Population Genetics, Vienna, Austria
| | - Robert Kofler
- Institut für Populationsgenetik, Vetmeduni Vienna, Vienna, Austria
| | - Eric C. Lai
- Developmental Biology Program, Sloan-Kettering Institute, New York, New York, United States of America
| |
Collapse
|
10
|
Luo J, Guan T, Chen G, Yu Z, Zhai H, Yan C, Luo H. SLHSD: hybrid scaffolding method based on short and long reads. Brief Bioinform 2023; 24:7152317. [PMID: 37141142 DOI: 10.1093/bib/bbad169] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/03/2022] [Revised: 01/08/2023] [Accepted: 04/12/2023] [Indexed: 05/05/2023] Open
Abstract
In genome assembly, scaffolding can obtain more complete and continuous scaffolds. Current scaffolding methods usually adopt one type of read to construct a scaffold graph and then orient and order contigs. However, scaffolding with the strengths of two or more types of reads seems to be a better solution to some tricky problems. Combining the advantages of different types of data is significant for scaffolding. Here, a hybrid scaffolding method (SLHSD) is present that simultaneously leverages the precision of short reads and the length advantage of long reads. Building an optimal scaffold graph is an important foundation for getting scaffolds. SLHSD uses a new algorithm that combines long and short read alignment information to determine whether to add an edge and how to calculate the edge weight in a scaffold graph. In addition, SLHSD develops a strategy to ensure that edges with high confidence can be added to the graph with priority. Then, a linear programming model is used to detect and remove remaining false edges in the graph. We compared SLHSD with other scaffolding methods on five datasets. Experimental results show that SLHSD outperforms other methods. The open-source code of SLHSD is available at https://github.com/luojunwei/SLHSD.
Collapse
Affiliation(s)
- Junwei Luo
- School of Software, Henan Polytechnic University, Jiaozuo 454003, China
| | - Ting Guan
- School of Software, Henan Polytechnic University, Jiaozuo 454003, China
| | - Guolin Chen
- School of Software, Henan Polytechnic University, Jiaozuo 454003, China
| | - Zhonghua Yu
- School of Software, Henan Polytechnic University, Jiaozuo 454003, China
| | - Haixia Zhai
- School of Software, Henan Polytechnic University, Jiaozuo 454003, China
| | - Chaokun Yan
- School of Computer and Information Engineering, Henan University, Kaifeng 475001, China
| | - Huimin Luo
- School of Computer and Information Engineering, Henan University, Kaifeng 475001, China
| |
Collapse
|
11
|
Gauthier J, Meier J, Legeai F, McClure M, Whibley A, Bretaudeau A, Boulain H, Parrinello H, Mugford ST, Durbin R, Zhou C, McCarthy S, Wheat CW, Piron-Prunier F, Monsempes C, François MC, Jay P, Noûs C, Persyn E, Jacquin-Joly E, Meslin C, Montagné N, Lemaitre C, Elias M. First chromosome scale genomes of ithomiine butterflies (Nymphalidae: Ithomiini): Comparative models for mimicry genetic studies. Mol Ecol Resour 2023; 23:872-885. [PMID: 36533297 DOI: 10.1111/1755-0998.13749] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/20/2022] [Revised: 11/30/2022] [Accepted: 12/05/2022] [Indexed: 12/23/2022]
Abstract
The ithomiine butterflies (Nymphalidae: Danainae) represent the largest known radiation of Müllerian mimetic butterflies. They dominate by number the mimetic butterfly communities, which include species such as the iconic neotropical Heliconius genus. Recent studies on the ecology and genetics of speciation in Ithomiini have suggested that sexual pheromones, colour pattern and perhaps hostplant could drive reproductive isolation. However, no reference genome was available for Ithomiini, which has hindered further exploration on the genetic architecture of these candidate traits, and more generally on the genomic patterns of divergence. Here, we generated high-quality, chromosome-scale genome assemblies for two Melinaea species, M. marsaeus and M. menophilus, and a draft genome of the species Ithomia salapia. We obtained genomes with a size ranging from 396 to 503 Mb across the three species and scaffold N50 of 40.5 and 23.2 Mb for the two chromosome-scale assemblies. Using collinearity analyses we identified massive rearrangements between the two closely related Melinaea species. An annotation of transposable elements and gene content was performed, as well as a specialist annotation to target chemosensory genes, which is crucial for host plant detection and mate recognition in mimetic species. A comparative genomic approach revealed independent gene expansions in ithomiines and particularly in gustatory receptor genes. These first three genomes of ithomiine mimetic butterflies constitute a valuable addition and a welcome comparison to existing biological models such as Heliconius, and will enable further understanding of the mechanisms of adaptation in butterflies.
Collapse
Affiliation(s)
| | - Joana Meier
- Department of Zoology, University of Cambridge, Cambridge, UK
| | - Fabrice Legeai
- BIPAA, IGEPP, INRAE, Institut Agro, Univ Rennes, Rennes, France
- Univ Rennes, Inria, CNRS, IRISA, Rennes, France
| | - Melanie McClure
- Institut Systématique Évolution Biodiversité (ISYEB), Centre National de la Recherche Scientifique, MNHN, EPHE, Sorbonne Université, Université des Antilles, Paris, France
- Laboratoire Écologie, Évolution, Interactions des Systèmes Amazoniens (LEEISA), Université de Guyane, CNRS, IFREMER, Cayenne, France
| | - Annabel Whibley
- School of Biological Sciences, University of Auckland, Auckland, New Zealand
| | - Anthony Bretaudeau
- BIPAA, IGEPP, INRAE, Institut Agro, Univ Rennes, Rennes, France
- Univ Rennes, Inria, CNRS, IRISA, Rennes, France
| | - Hélène Boulain
- Department of Ecology and Evolution, University of Lausanne, Lausanne, Switzerland
| | - Hugues Parrinello
- MGX-Montpellier GenomiX, Univ. Montpellier, CNRS, INSERM, Montpellier, France
| | - Sam T Mugford
- Department of Crop Genetics, John Innes Centre, Norwich Research Park, Norwich, UK
| | - Richard Durbin
- Department of Genetics, University of Cambridge, Cambridge, UK
- Tree of Life Programme, Wellcome Sanger Institute, Hinxton, UK
| | - Chenxi Zhou
- Department of Genetics, University of Cambridge, Cambridge, UK
- Tree of Life Programme, Wellcome Sanger Institute, Hinxton, UK
| | - Shane McCarthy
- Department of Genetics, University of Cambridge, Cambridge, UK
- Tree of Life Programme, Wellcome Sanger Institute, Hinxton, UK
| | | | - Florence Piron-Prunier
- Institut Systématique Évolution Biodiversité (ISYEB), Centre National de la Recherche Scientifique, MNHN, EPHE, Sorbonne Université, Université des Antilles, Paris, France
| | - Christelle Monsempes
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
| | - Marie-Christine François
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
| | - Paul Jay
- Ecologie Systématique Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, Orsay, France
| | | | - Emma Persyn
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
- CIRAD, UMR PVBMT, St Pierre, France
| | - Emmanuelle Jacquin-Joly
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
| | - Camille Meslin
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
| | - Nicolas Montagné
- Institute of Ecology and Environmental Sciences of Paris, Sorbonne Université, INRAE, CNRS, IRD, UPEC, Université de Paris, Paris, France
| | | | - Marianne Elias
- Institut Systématique Évolution Biodiversité (ISYEB), Centre National de la Recherche Scientifique, MNHN, EPHE, Sorbonne Université, Université des Antilles, Paris, France
| |
Collapse
|
12
|
Dimos B, Phelps M. A homology guide for Pacific salmon genus Oncorhynchus resolves patterns of ohnolog retention, resolution and local adaptation following the salmonid-specific whole-genome duplication event. Ecol Evol 2023; 13:e9994. [PMID: 37091557 PMCID: PMC10119027 DOI: 10.1002/ece3.9994] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/06/2022] [Revised: 03/17/2023] [Accepted: 03/21/2023] [Indexed: 04/25/2023] Open
Abstract
Salmonid fishes have emerged as a tractable model to study whole-genome duplications (WGDs) as this group has undergone four rounds of WGDs. While most of the salmonid genome has returned to a diploid state, a significant proportion of genes are maintained as duplicates and are referred to as ohnologs. The fact that much of the modern salmonid gene repertoire is comprised of ohnologs, while other genes have returned to their singleton state creates complications for genetic studies by obscuring homology relationships. The difficulty this creates is particularly prominent in Pacific salmonids belonging to genus Oncorhynchus who are the focus of intense genetics-based conservation and management efforts owing to the important ecological and cultural roles these fish play. To address this gap, we generated a homology guide for six species of Oncorhynchus with available genomes and used this guide to describe patterns of ohnolog retention and resolution. Overall, we find that ohnologs comprise approximately half of each species modern gene repertoires, which are functionally enriched for genes involved in DNA binding, while the less numerous singleton genes are heavily enriched in dosage-sensitive processes such as mitochondrial metabolism. Additionally, by reanalyzing published expression data from locally adapted strains of O. mykiss, we show that numerous ohnologs exhibit adaptive expression profiles; however, ohnologs are not more likely to display adaptive signatures than either paralogs or singletons. Finally, we demonstrate the utility of our homology guide by investigating the evolutionary relationship among genes highlighted as playing a role in salmonid life-history traits or gene editing targets.
Collapse
Affiliation(s)
- Bradford Dimos
- Department of Animal SciencesWashington State UniversityPullmanWashingtonUSA
| | - Michael Phelps
- Department of Animal SciencesWashington State UniversityPullmanWashingtonUSA
| |
Collapse
|
13
|
Dabernig-Heinz J, Wagner GE, Leitner E, Ruppitsch W, Steinmetz I, Högenauer C, Zechner EL, Kienesberger S. Complete Genome Sequence of Klebsiella oxytoca Strain AHC-6, Isolated from a Patient during Acute Antibiotic-Associated Hemorrhagic Colitis. Microbiol Resour Announc 2023;:e0135022. [PMID: 36926996 DOI: 10.1128/mra.01350-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/18/2023] Open
Abstract
Klebsiella oxytoca is a ubiquitous bacterium that is increasingly associated with inflammatory diseases. Here, we report the hybrid assembled genome for cytotoxic K. oxytoca strain AHC-6. The genome comprises a total of 5.7 Mbp, with a GC content of 55.2% and 5,258 coding sequences after assembly and annotation.
Collapse
|
14
|
Gellman RH, Olm MR, Terrapon N, Enam F, Higginbottom SK, Sonnenburg JL, Sonnenburg ED. Hadza Prevotella Require Diet-derived Microbiota Accessible Carbohydrates to Persist in Mice. bioRxiv 2023:2023.03.08.531063. [PMID: 36945614 PMCID: PMC10028851 DOI: 10.1101/2023.03.08.531063] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/12/2023]
Abstract
Industrialization has transformed the gut microbiota, reducing the prevalence of Prevotella relative to Bacteroides. Here, we isolate Bacteroides and Prevotella strains from the microbiota of Hadza hunter-gatherers of Tanzania, a population with high levels of Prevotella. We demonstrate that plant-derived microbiota-accessible carbohydrates (MACs) are required for persistence of Prevotella copri but not Bacteroides thetaiotaomicron in vivo. Differences in carbohydrate metabolism gene content, expression, and in vitro growth reveal that Hadza Prevotella strains specialize in degrading plant carbohydrates, while Hadza Bacteroides isolates use both plant and host-derived carbohydrates, a difference mirrored in Bacteroides from non-Hadza populations. When competing directly, P. copri requires plant-derived MACs to maintain colonization in the presence of B. thetaiotaomicron, as a no MAC diet eliminates P. copri colonization. Prevotella's reliance on plant-derived MACs and Bacteroides' ability to use host mucus carbohydrates could explain the reduced prevalence of Prevotella in populations consuming a low-MAC, industrialized diet.
Collapse
Affiliation(s)
- Rebecca H Gellman
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Matthew R Olm
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Nicolas Terrapon
- Architecture et Fonction des Macromolécules Biologiques, INRAE, CNRS, Aix-Marseille Université, Marseille, France
| | - Fatima Enam
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Steven K Higginbottom
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| | - Justin L Sonnenburg
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
- Chan Zuckerberg Biohub, San Francisco, CA, USA
- Center for Human Microbiome Studies, Stanford University School of Medicine, Stanford, CA, USA
| | - Erica D Sonnenburg
- Department of Microbiology and Immunology, Stanford University School of Medicine, Stanford, CA, USA
| |
Collapse
|
15
|
Peykov S, Strateva T. Whole-Genome Sequencing-Based Resistome Analysis of Nosocomial Multidrug-Resistant Non-Fermenting Gram-Negative Pathogens from the Balkans. Microorganisms 2023; 11:microorganisms11030651. [PMID: 36985224 PMCID: PMC10051916 DOI: 10.3390/microorganisms11030651] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2023] [Revised: 02/28/2023] [Accepted: 03/01/2023] [Indexed: 03/06/2023] Open
Abstract
Non-fermenting Gram-negative bacilli (NFGNB), such as Pseudomonas aeruginosa and Acinetobacter baumannii, are among the major opportunistic pathogens involved in the global antibiotic resistance epidemic. They are designated as urgent/serious threats by the Centers for Disease Control and Prevention and are part of the World Health Organization’s list of critical priority pathogens. Also, Stenotrophomonas maltophilia is increasingly recognized as an emerging cause for healthcare-associated infections in intensive care units, life-threatening diseases in immunocompromised patients, and severe pulmonary infections in cystic fibrosis and COVID-19 individuals. The last annual report of the ECDC showed drastic differences in the proportions of NFGNB with resistance towards key antibiotics in different European Union/European Economic Area countries. The data for the Balkans are of particular concern, indicating more than 80% and 30% of invasive Acinetobacter spp. and P. aeruginosa isolates, respectively, to be carbapenem-resistant. Moreover, multidrug-resistant and extensively drug-resistant S. maltophilia from the region have been recently reported. The current situation in the Balkans includes a migrant crisis and reshaping of the Schengen Area border. This results in collision of diverse human populations subjected to different protocols for antimicrobial stewardship and infection control. The present review article summarizes the findings of whole-genome sequencing-based resistome analyses of nosocomial multidrug-resistant NFGNBs in the Balkan countries.
Collapse
Affiliation(s)
- Slavil Peykov
- Department of Genetics, Faculty of Biology, Sofia University “St. Kliment Ohridski”, 8, Dragan Tzankov Blvd., 1164 Sofia, Bulgaria
- Department of Medical Microbiology, Faculty of Medicine, Medical University of Sofia, 2, Zdrave Str., 1431 Sofia, Bulgaria
- BioInfoTech Laboratory, Sofia Tech Park, 111, Tsarigradsko Shosse Blvd., 1784 Sofia, Bulgaria
- Correspondence: (S.P.); (T.S.); Tel.: +359-87-6454492 (S.P.); +359-2-9172750 (T.S.)
| | - Tanya Strateva
- Department of Medical Microbiology, Faculty of Medicine, Medical University of Sofia, 2, Zdrave Str., 1431 Sofia, Bulgaria
- Correspondence: (S.P.); (T.S.); Tel.: +359-87-6454492 (S.P.); +359-2-9172750 (T.S.)
| |
Collapse
|
16
|
Kumaraswamy M, Coady A, Szubin R, Martin TCS, Palsson B, Nizet V, Monk JM. Comprehensive whole genome sequencing with hybrid assembly of multi-drug resistant Candida albicans isolate causing cerebral abscess. Curr Res Microb Sci 2023; 4:100180. [PMID: 36685102 PMCID: PMC9852921 DOI: 10.1016/j.crmicr.2023.100180] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Comprehensive whole genome sequencing (WGS) with hybrid assembly of a multi-drug resistant (MDR) Candida albicans (CA) isolate causing cerebral abscess was performed using Illumina paired end and Oxford Nanopore long read technologies. The innovative technologies utilized here enabled us to resolve fragmented assemblies, and implement comprehensive and detailed genomic analyses involved in antifungal resistance of Candida spp. Functionally important genes (MDR1, CDR2 and SQN2) involved in antifungal resistance were identified and a phylogenetic analysis of the clinical isolate was performed. Additionally, our clinical isolate was found to share 4 single nucleotide polymorphisms with two other sequenced strains of MDR C. auris (381 and 386) including translation elongation factor EF1α and EF3, ATPase activity associated proteins, and the lysine tRNA ligase.
Collapse
Affiliation(s)
- Monika Kumaraswamy
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), University of California San Diego, La Jolla, CA 92093, USA,Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA,Infectious Diseases Section, VA San Diego Healthcare System, San Diego, CA 92161, USA,Corresponding author at: Division of Infectious Diseases, Department of Medicine, University of California San Diego, 9500 Gilman Drive, Mail Code 0711, La Jolla, CA 92093-0711, USA.
| | - Alison Coady
- Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA
| | - Richard Szubin
- Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Thomas CS Martin
- Department of Medicine, University of California San Diego, La Jolla, CA 92093, USA
| | - Bernhard Palsson
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), University of California San Diego, La Jolla, CA 92093, USA,Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA
| | - Victor Nizet
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), University of California San Diego, La Jolla, CA 92093, USA,Department of Pediatrics, University of California San Diego, La Jolla, CA 92093, USA,Skaggs School of Pharmacy and Pharmaceutical Sciences, University of California San Diego, La Jolla, CA 92093, USA
| | - Jonathan M. Monk
- Collaborative to Halt Antibiotic-Resistant Microbes (CHARM), University of California San Diego, La Jolla, CA 92093, USA,Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA,Corresponding author at: Department of Bioengineering, University of California San Diego, La Jolla, CA 92093, USA.
| |
Collapse
|
17
|
Chernogor L, Eliseikina M, Petrushin I, Chernogor E, Khanaev I, Belikov SI. Janthinobacterium sp. Strain SLB01 as Pathogenic Bacteria for Sponge Lubomirskia baikalensis. Pathogens 2022; 12:pathogens12010008. [PMID: 36678355 PMCID: PMC9860564 DOI: 10.3390/pathogens12010008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2022] [Revised: 12/14/2022] [Accepted: 12/16/2022] [Indexed: 12/24/2022] Open
Abstract
Sponges (phylum Porifera) are ancient, marine and inland water, filter feeding metazoans. In recent years, diseased sponges have been increasingly occurring in marine and freshwater environments. Endemic freshwater sponges of the Lubomirskiidae family are widely distributed in the coastal zone of Lake Baikal. The strain Janthinobacterium sp. SLB01 was isolated previously from the diseased sponge Lubomirskia baikalensis (Pallas, 1776), although its pathogenicity is still unknown. The aim of this study was to confirm whether the Janthinobacterium sp. strain SLB01 is the pathogen found in Baikal sponge. To address this aim, we infected the cell culture of primmorphs of the sponge L. baikalensis with strain SLB01 and subsequently reisolated and sequenced the strain Janthinobacterium sp. PLB02. The results showed that the isolated strain has more than 99% homology with strain SLB01. The genomes of both strains contain genes vioABCDE of violacein biosynthesis and floc formation, for strong biofilm, in addition to the type VI secretion system (T6SS) as the main virulence factor. Based on a comparison of complete genomes, we showed the similarity of the studied bacterial strains of Janthinobacterium spp. with the described strain of Janthinobacterium lividum MTR. This study will help expand our understanding of microbial interactions and determine one of the causes in the development of diseases and death in Baikal sponges.
Collapse
Affiliation(s)
- Lubov Chernogor
- Limnological Institute, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
- Correspondence: (L.C.); (S.I.B.)
| | - Marina Eliseikina
- A.V. Zhirmunsky National Scientific Center of Marine Biology, Far Eastern Branch, Russian Academy of Sciences, 690041 Vladivostok, Russia
| | - Ivan Petrushin
- Limnological Institute, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
| | - Ekaterina Chernogor
- Faculty of Business Communication and Informatics, Irkutsk State University, 664033 Irkutsk, Russia
| | - Igor Khanaev
- Limnological Institute, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
| | - Sergei I. Belikov
- Limnological Institute, Siberian Branch of the Russian Academy of Sciences, 664033 Irkutsk, Russia
- Correspondence: (L.C.); (S.I.B.)
| |
Collapse
|
18
|
Álvarez-González L, Arias-Sardá C, Montes-Espuña L, Marín-Gual L, Vara C, Lister NC, Cuartero Y, Garcia F, Deakin J, Renfree MB, Robinson TJ, Martí-Renom MA, Waters PD, Farré M, Ruiz-Herrera A. Principles of 3D chromosome folding and evolutionary genome reshuffling in mammals. Cell Rep 2022; 41:111839. [PMID: 36543130 DOI: 10.1016/j.celrep.2022.111839] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/18/2022] [Revised: 10/01/2022] [Accepted: 11/24/2022] [Indexed: 12/24/2022] Open
Abstract
Studying the similarities and differences in genomic interactions between species provides fertile grounds for determining the evolutionary dynamics underpinning genome function and speciation. Here, we describe the principles of 3D genome folding in vertebrates and show how lineage-specific patterns of genome reshuffling can result in different chromatin configurations. We (1) identified different patterns of chromosome folding in across vertebrate species (centromere clustering versus chromosomal territories); (2) reconstructed ancestral marsupial and afrotherian genomes analyzing whole-genome sequences of species representative of the major therian phylogroups; (3) detected lineage-specific chromosome rearrangements; and (4) identified the dynamics of the structural properties of genome reshuffling through therian evolution. We present evidence of chromatin configurational changes that result from ancestral inversions and fusions/fissions. We catalog the close interplay between chromatin higher-order organization and therian genome evolution and introduce an interpretative hypothesis that explains how chromatin folding influences evolutionary patterns of genome reshuffling.
Collapse
Affiliation(s)
- Lucía Álvarez-González
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain; Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain
| | | | - Laia Montes-Espuña
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain; Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain
| | - Laia Marín-Gual
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain; Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain
| | - Covadonga Vara
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain; Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain
| | - Nicholas C Lister
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, UNSW Sydney, Sydney, NSW 2052, Australia
| | - Yasmina Cuartero
- CNAG-CRG, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Baldiri Reixac 4, 08028 Barcelona, Spain
| | - Francisca Garcia
- Servei de Cultius Cel.lulars-SCAC, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain
| | - Janine Deakin
- Institute for Applied Ecology, University of Canberra, Bruce, ACT 2617, Australia
| | - Marilyn B Renfree
- School of Biosciences, The University of Melbourne, Victoria, VIC 3010, Australia
| | - Terence J Robinson
- Evolutionary Genomics Group, Department of Botany and Zoology, Faculty of Science, Stellenbosch University, Private Bag X1, Stellenbosch 7602, South Africa
| | - Marc A Martí-Renom
- CNAG-CRG, Centre for Genomic Regulation, The Barcelona Institute of Science and Technology, Baldiri Reixac 4, 08028 Barcelona, Spain; Centre for Genomic Regulation, The Barcelona Institute for Science and Technology, Carrer del Doctor Aiguader 88, 08003 Barcelona, Spain; ICREA, Pg. Lluís Companys 23, 08010 Barcelona, Spain; Universitat Pompeu Fabra (UPF), 08002 Barcelona, Spain
| | - Paul D Waters
- School of Biotechnology and Biomolecular Sciences, Faculty of Science, UNSW Sydney, Sydney, NSW 2052, Australia
| | - Marta Farré
- School of Biosciences, University of Kent, Canterbury, Kent CT2 7NJ, UK
| | - Aurora Ruiz-Herrera
- Departament de Biologia Cel·lular, Fisiologia i Immunologia, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain; Genome Integrity and Instability Group, Institut de Biotecnologia i Biomedicina, Universitat Autònoma de Barcelona, 08193 Cerdanyola del Vallès, Spain.
| |
Collapse
|
19
|
Bursell MG, Dikow RB, Figueiró HV, Dudchenko O, Flanagan JP, Aiden EL, Goossens B, Nathan SK, Johnson WE, Koepfli KP, Frandsen PB. Whole genome analysis of clouded leopard species reveals an ancient divergence and distinct demographic histories. iScience 2022; 25:105647. [PMID: 36590460 PMCID: PMC9801239 DOI: 10.1016/j.isci.2022.105647] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Revised: 08/08/2022] [Accepted: 11/18/2022] [Indexed: 12/14/2022] Open
Abstract
Similar to other apex predator species, populations of mainland (Neofelis nebulosa) and Sunda (Neofelis diardi) clouded leopards are declining. Understanding their patterns of genetic variation can provide critical insights on past genetic erosion and a baseline for understanding their long-term conservation needs. As a step toward this goal, we present draft genome assemblies for the two clouded leopard species to quantify their phylogenetic divergence, genome-wide diversity, and historical population trends. We estimate that the two species diverged 5.1 Mya, much earlier than previous estimates of 1.41 Mya and 2.86 Mya, suggesting they separated when Sundaland was becoming increasingly isolated from mainland Southeast Asia. The Sunda clouded leopard displays a distinct and reduced effective population size trajectory, consistent with a lower genome-wide heterozygosity and SNP density, relative to the mainland clouded leopard. Our results provide new insights into the evolutionary history and genetic health of this unique lineage of felids.
Collapse
Affiliation(s)
- Madeline G. Bursell
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA,Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | - Rebecca B. Dikow
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA
| | - Henrique V. Figueiró
- Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA
| | - Olga Dudchenko
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA,Center for Theoretical Biological Physics, Rice University, Houston, TX, USA
| | | | - Erez Lieberman Aiden
- The Center for Genome Architecture, Department of Molecular and Human Genetics, Baylor College of Medicine, Houston, TX, USA,Center for Theoretical Biological Physics, Rice University, Houston, TX, USA,UWA School of Agriculture and Environment, The University of Western Australia, Crawley, WA 6009, Australia,Departments of Computer Science and Computational and Applied Mathematics, Rice University,Houston, TX, USA,Broad Institute of MIT and Harvard, Cambridge, MA, USA,Shanghai Institute for Advanced Immunochemical Studies, Shanghai Tech University, Shanghai, China
| | - Benoit Goossens
- Sabah Wildlife Department, Kota Kinabalu, Sabah, Malaysia,Organisms and Environment Division, Cardiff School of Biosciences, Cardiff, UK,Danau Girang Field Centre, c/o Sabah Wildlife Department, Kota Kinabalu, Sabah, Malaysia
| | | | - Warren E. Johnson
- Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA,The Walter Reed Biosystematics Unit, Museum Support Center MRC-534, Smithsonian Institution, Suitland, MD, USA,Walter Reed Army Institute of Research, Silver Spring, MD, USA,Loyola University Maryland, Baltimore, MD, USA
| | - Klaus-Peter Koepfli
- Center for Species Survival, Smithsonian Conservation Biology Institute, National Zoological Park, Front Royal, VA 22630, USA,Smithsonian-Mason School of Conservation, George Mason University, Front Royal, VA 22630, USA,Corresponding author
| | - Paul B. Frandsen
- Department of Plant and Wildlife Sciences, Brigham Young University, Provo, UT 84602, USA,Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC 20560, USA,Corresponding author
| |
Collapse
|
20
|
Fernandes Santos CA, Rodrigues da Costa S, Silva Boiteux L, Grattapaglia D, Silva-Junior OB. Genetic associations with resistance to Meloidogyne enterolobii in guava (Psidium sp.) using cross-genera SNPs and comparative genomics to Eucalyptus highlight evolutionary conservation across the Myrtaceae. PLoS One 2022; 17:e0273959. [PMID: 36322533 PMCID: PMC9629644 DOI: 10.1371/journal.pone.0273959] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/17/2022] [Accepted: 10/14/2022] [Indexed: 11/07/2022] Open
Abstract
Tropical fruit tree species constitute a yet untapped supply of outstanding diversity of taste and nutritional value, barely developed from the genetics standpoint, with scarce or no genomic resources to tackle the challenges arising in modern breeding practice. We generated a de novo genome assembly of the Psidium guajava, the super fruit “apple of the tropics”, and successfully transferred 14,268 SNP probesets from Eucalyptus to Psidium at the nucleotide level, to detect genomic loci linked to resistance to the root knot nematode (RKN) Meloidogyne enterolobii derived from the wild relative P. guineense. Significantly associated loci with resistance across alternative analytical frameworks, were detected at two SNPs on chromosome 3 in a pseudo-assembly of Psidium guajava genome built using a syntenic path approach with the Eucalyptus grandis genome to determine the order and orientation of the contigs. The P. guineense-derived resistance response to RKN and disease onset is conceivably triggered by mineral nutrients and phytohormone homeostasis or signaling with the involvement of the miRNA pathway. Hotspots of mapped resistance quantitative trait loci and functional annotation in the same genomic region of Eucalyptus provide further indirect support to our results, highlighting the evolutionary conservation of genomes across genera of Myrtaceae in the adaptation to pathogens. Marker assisted introgression of the resistance loci mapped should accelerate the development of improved guava cultivars and hybrid rootstocks.
Collapse
Affiliation(s)
| | - Soniane Rodrigues da Costa
- Graduate program in Genetic Resources, Universidade Estadual de Feira de Santana, Feira de Santana, Bahia, Brazil
| | | | - Dario Grattapaglia
- Embrapa Genetic Resources and Biotechnology (CENARGEN), Brasília, Distrito Federal, Brazil
- * E-mail:
| | | |
Collapse
|
21
|
Guo R, Papanicolaou A, Fritz ML. Validation of reference-assisted assembly using existing and novel Heliothine genomes. Genomics 2022; 114:110441. [PMID: 35931274 DOI: 10.1016/j.ygeno.2022.110441] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/29/2021] [Revised: 07/19/2022] [Accepted: 07/29/2022] [Indexed: 11/16/2022]
Abstract
Chloridea subflexa and Chloridea virescens are a pair of closely related noctuid species exhibiting pheromone-based sexual isolation and divergent host plant preferences. We produced a novel Illumina short read C. subflexa genome assembly and an improved C. virescens genome assembly, which offer opportunities to study the genomic basis for evolutionarily important traits in this lepidopteran family with few genomic resources. We then examined the feasibility of reference-assisted assembly, an approach that leverages existing high quality genomic resources for genome improvement in closely related taxa and applied it to our Heliothine genomes. Our work demonstrates that reference-assisted assembly has the potential to enhance contiguity and completeness of existing insect genomic resources with minimal additional laboratory costs. We conclude by discussing both the potential and pitfalls of reference-assisted assembly according to the intended downstream assembly application.
Collapse
Affiliation(s)
- Rong Guo
- Department of Entomology, University of Maryland, College Park, MD 20742, USA; Computational Biology, Bioinformatics and Genomics Program, Department of Biological Sciences, University of Maryland, College Park, MD 20742, USA
| | - Alexie Papanicolaou
- Hawkesbury Institute for the Environment, Western Sydney University, Richmond, NSW 2753, Australia.
| | - Megan L Fritz
- Department of Entomology, University of Maryland, College Park, MD 20742, USA; Computational Biology, Bioinformatics and Genomics Program, Department of Biological Sciences, University of Maryland, College Park, MD 20742, USA.
| |
Collapse
|
22
|
Lok S, Lau TNH, Trost B, Tong AHY, Wintle RF, Engstrom MD, Stacy E, Waits LP, Scrafford M, Scherer SW. Chromosomal-level reference genome assembly of the North American wolverine ( Gulo gulo luscus): a resource for conservation genomics. G3 Genes|Genomes|Genetics 2022; 12:6604289. [PMID: 35674384 PMCID: PMC9339297 DOI: 10.1093/g3journal/jkac138] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/01/2022] [Accepted: 05/19/2022] [Indexed: 11/21/2022]
Abstract
We report a chromosomal-level genome assembly of a male North American wolverine (Gulo gulo luscus) from the Kugluktuk region of Nunavut, Canada. The genome was assembled directly from long-reads, comprising: 758 contigs with a contig N50 of 36.6 Mb; contig L50 of 20; base count of 2.39 Gb; and a near complete representation (99.98%) of the BUSCO 5.2.2 set of 9,226 genes. A presumptive chromosomal-level assembly was generated by scaffolding against two chromosomal-level Mustelidae reference genomes, the ermine and the Eurasian river otter, to derive a final scaffold N50 of 144.0 Mb and a scaffold L50 of 7. We annotated a comprehensive set of genes that have been associated with models of aggressive behavior, a trait which the wolverine is purported to have in the popular literature. To support an integrated, genomics-based wildlife management strategy at a time of environmental disruption from climate change, we annotated the principal genes of the innate immune system to provide a resource to study the wolverine’s susceptibility to new infectious and parasitic diseases. As a resource, we annotated genes involved in the modality of infection by the coronaviruses, an important class of viral pathogens of growing concern as shown by the recent spillover infections by severe acute respiratory syndrome coronavirus-2 to naïve wildlife. Tabulation of heterozygous single nucleotide variants in our specimen revealed a heterozygosity level of 0.065%, indicating a relatively diverse genetic pool that would serve as a baseline for the genomics-based conservation of the wolverine, a rare cold-adapted carnivore now under threat.
Collapse
Affiliation(s)
- Si Lok
- The Centre for Applied Genomics, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
| | - Timothy N H Lau
- The Centre for Applied Genomics, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
| | - Brett Trost
- The Centre for Applied Genomics, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
| | - Amy H Y Tong
- Donnelly Centre for Cellular and Biomolecular Research, University of Toronto , ON M5S 3E1, Canada
| | - Richard F Wintle
- The Centre for Applied Genomics, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
| | - Mark D Engstrom
- Department of Natural History, Royal Ontario Museum , Toronto, ON M5S 2C6, Canada
| | - Elise Stacy
- Environmental Science Program, University of Idaho , Moscow, ID 83844, USA
- Wildlife Conservation Society, Arctic Beringia , Fairbanks, AK 99709, USA
| | - Lisette P Waits
- Department of Fish and Wildlife, University of Idaho , Moscow, ID 83844, USA
| | - Matthew Scrafford
- Wildlife Conservation Society Canada , Thunder Bay, ON P7A 4K9, Canada
| | - Stephen W Scherer
- The Centre for Applied Genomics, Peter Gilgan Centre for Research and Learning, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- Program in Genetics and Genome Biology, The Hospital for Sick Children , Toronto, ON M5G 0A4, Canada
- McLaughlin Centre, University of Toronto , Toronto, ON M5G 0A4, Canada
- Department of Molecular Genetics, Faculty of Medicine, University of Toronto , ON M5S 1A8, Canada
| |
Collapse
|
23
|
Liu SC, Ju YR, Lu CL. Multi-CSAR: a web server for scaffolding contigs using multiple reference genomes. Nucleic Acids Res 2022; 50:W500-W509. [PMID: 35524553 PMCID: PMC9252826 DOI: 10.1093/nar/gkac301] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/22/2022] [Revised: 04/09/2022] [Accepted: 04/15/2022] [Indexed: 11/12/2022] Open
Abstract
Multi-CSAR is a web server that can efficiently and more accurately order and orient the contigs in the assembly of a target genome into larger scaffolds based on multiple reference genomes. Given a target genome and multiple reference genomes, Multi-CSAR first identifies sequence markers shared between the target genome and each reference genome, then utilizes these sequence markers to compute a scaffold for the target genome based on each single reference genome, and finally combines all the single reference-derived scaffolds into a multiple reference-derived scaffold. To run Multi-CSAR, the users need to upload a target genome to be scaffolded and one or more reference genomes in multi-FASTA format. The users can also choose to use the ‘weighting scheme of reference genomes’ for Multi-CSAR to automatically calculate different weights for the reference genomes and choose either ‘NUCmer on nucleotides’ or ‘PROmer on translated amino acids’ for Multi-CSAR to identify sequence markers. In the output page, Multi-CSAR displays its multiple reference-derived scaffold in two graphical representations (i.e. Circos plot and dotplot) for the users to visually validate the correctness of scaffolded contigs and in a tabular representation to further validate the scaffold in detail. Multi-CSAR is available online at http://genome.cs.nthu.edu.tw/Multi-CSAR/.
Collapse
Affiliation(s)
- Shu-Cheng Liu
- Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan
| | - Yan-Ru Ju
- Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan
| | - Chin Lung Lu
- Department of Computer Science, National Tsing Hua University, Hsinchu 30013, Taiwan
| |
Collapse
|
24
|
Li J, Llorente B, Liti G, Yue JX. RecombineX: A generalized computational framework for automatic high-throughput gamete genotyping and tetrad-based recombination analysis. PLoS Genet 2022; 18:e1010047. [PMID: 35533184 PMCID: PMC9119626 DOI: 10.1371/journal.pgen.1010047] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2022] [Revised: 05/19/2022] [Accepted: 04/14/2022] [Indexed: 01/09/2023] Open
Abstract
Meiotic recombination is an essential biological process that ensures faithful chromosome segregation and promotes parental allele shuffling. Tetrad analysis is a powerful approach to quantify the genetic makeups and recombination landscapes of meiotic products. Here we present RecombineX (https://github.com/yjx1217/RecombineX), a generalized computational framework that automates the full workflow of marker identification, gamete genotyping, and tetrad-based recombination profiling based on any organism or genetic background with batch processing capability. Aside from conventional reference-based analysis, RecombineX can also perform analysis based on parental genome assemblies, which facilitates analyzing meiotic recombination landscapes in their native genomic contexts. Additional features such as copy number variation profiling and missing genotype inference further enhance downstream analysis. RecombineX also includes a dedicate module for simulating the genomes and reads of recombinant tetrads, which enables fine-tuned simulation-based hypothesis testing. This simulation module revealed the power and accuracy of RecombineX even when analyzing tetrads with very low sequencing depths (e.g., 1-2X). Tetrad sequencing data from the budding yeast Saccharomyces cerevisiae and green alga Chlamydomonas reinhardtii were further used to demonstrate the accuracy and robustness of RecombineX for organisms with both small and large genomes, manifesting RecombineX as an all-around one stop solution for future tetrad analysis. Interestingly, our re-analysis of the budding yeast tetrad sequencing data with RecombineX and Oxford Nanopore sequencing revealed two unusual structural rearrangement events that were not noticed before, which exemplify the occasional genome instability triggered by meiosis.
Collapse
Affiliation(s)
- Jing Li
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
| | - Bertrand Llorente
- Aix-Marseille Université, CNRS, INSERM, CRCM, Institut Paoli-Calmettes, Marseille, France
| | - Gianni Liti
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
- * E-mail: (GL); (JXY)
| | - Jia-Xing Yue
- State Key Laboratory of Oncology in South China, Collaborative Innovation Center for Cancer Medicine, Guangdong Key Laboratory of Nasopharyngeal Carcinoma Diagnosis and Therapy, Sun Yat-sen University Cancer Center, Guangzhou, China
- Université Côte d’Azur, CNRS, INSERM, IRCAN, Nice, France
- * E-mail: (GL); (JXY)
| |
Collapse
|
25
|
Talenti A, Powell J, Hemmink JD, Cook EAJ, Wragg D, Jayaraman S, Paxton E, Ezeasor C, Obishakin ET, Agusi ER, Tijjani A, Amanyire W, Muhanguzi D, Marshall K, Fisch A, Ferreira BR, Qasim A, Chaudhry U, Wiener P, Toye P, Morrison LJ, Connelley T, Prendergast JGD. A cattle graph genome incorporating global breed diversity. Nat Commun 2022; 13:910. [PMID: 35177600 DOI: 10.1038/s41467-022-28605-0] [Citation(s) in RCA: 9] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2020] [Accepted: 01/20/2022] [Indexed: 11/28/2022] Open
Abstract
Despite only 8% of cattle being found in Europe, European breeds dominate current genetic resources. This adversely impacts cattle research in other important global cattle breeds, especially those from Africa for which genomic resources are particularly limited, despite their disproportionate importance to the continent’s economies. To mitigate this issue, we have generated assemblies of African breeds, which have been integrated with genomic data for 294 diverse cattle into a graph genome that incorporates global cattle diversity. We illustrate how this more representative reference assembly contains an extra 116.1 Mb (4.2%) of sequence absent from the current Hereford sequence and consequently inaccessible to current studies. We further demonstrate how using this graph genome increases read mapping rates, reduces allelic biases and improves the agreement of structural variant calling with independent optical mapping data. Consequently, we present an improved, more representative, reference assembly that will improve global cattle research. Cattle reference genomes are valuable resources but are currently heavily biased towards European breeds. Here the authors integrate assemblies for African breeds into a more representative cattle graph genome capturing global breed diversity.
Collapse
|
26
|
Suvorov A, Kim BY, Wang J, Armstrong EE, Peede D, D'Agostino ERR, Price DK, Waddell P, Lang M, Courtier-Orgogozo V, David JR, Petrov D, Matute DR, Schrider DR, Comeault AA. Widespread introgression across a phylogeny of 155 Drosophila genomes. Curr Biol 2022; 32:111-123.e5. [PMID: 34788634 PMCID: PMC8752469 DOI: 10.1016/j.cub.2021.10.052] [Citation(s) in RCA: 70] [Impact Index Per Article: 35.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/30/2021] [Revised: 09/29/2021] [Accepted: 10/22/2021] [Indexed: 01/12/2023]
Abstract
Genome-scale sequence data have invigorated the study of hybridization and introgression, particularly in animals. However, outside of a few notable cases, we lack systematic tests for introgression at a larger phylogenetic scale across entire clades. Here, we leverage 155 genome assemblies from 149 species to generate a fossil-calibrated phylogeny and conduct multilocus tests for introgression across 9 monophyletic radiations within the genus Drosophila. Using complementary phylogenomic approaches, we identify widespread introgression across the evolutionary history of Drosophila. Mapping gene-tree discordance onto the phylogeny revealed that both ancient and recent introgression has occurred across most of the 9 clades that we examined. Our results provide the first evidence of introgression occurring across the evolutionary history of Drosophila and highlight the need to continue to study the evolutionary consequences of hybridization and introgression in this genus and across the tree of life.
Collapse
Affiliation(s)
- Anton Suvorov
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA.
| | - Bernard Y Kim
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Jeremy Wang
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | - David Peede
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | | | - Donald K Price
- School of Life Sciences, University of Nevada, Las Vegas, NV 89119, USA
| | - Peter Waddell
- School of Fundamental Sciences, Massey University, Palmerston North 4442, New Zealand
| | - Michael Lang
- CNRS, Institut Jacques Monod, Université de Paris, Paris 75013, France
| | | | - Jean R David
- Laboratoire Evolution, Génomes, Comportement, Ecologie (EGCE) CNRS, IRD, Univ. Paris-sud, Université Paris-Saclay, Gif sur Yvette 91190, France; Institut de Systématique, Evolution, Biodiversité, CNRS, MNHN, UPMC, EPHE, Muséum National d'Histoire Naturelle, Sorbonne Universités, Paris 75005, France
| | - Dmitri Petrov
- Department of Biology, Stanford University, Stanford, CA, USA
| | - Daniel R Matute
- Department of Biology, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Daniel R Schrider
- Department of Genetics, University of North Carolina, Chapel Hill, NC 27599, USA
| | - Aaron A Comeault
- Molecular Ecology & Evolution Group, School of Natural Sciences, Bangor University, Bangor, Gwynedd LL57 2DGA, UK.
| |
Collapse
|
27
|
Belikov SI, Petrushin IS, Chernogor LI. Genome Analysis of the Janthinobacterium sp. Strain SLB01 from the Diseased Sponge of the Lubomirskia baicalensis. Curr Issues Mol Biol 2021; 43:2220-2237. [PMID: 34940130 PMCID: PMC8929069 DOI: 10.3390/cimb43030156] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/27/2021] [Revised: 12/06/2021] [Accepted: 12/07/2021] [Indexed: 12/22/2022] Open
Abstract
The strain Janthinobacterium sp. SLB01 was isolated from the diseased freshwater sponge Lubomirskia baicalensis (Pallas, 1776) and the draft genome was published previously. The aim of this work is to analyze the genome of the Janthinobacterium sp. SLB01 to search for pathogenicity factors for Baikal sponges. We performed genomic analysis to determine virulence factors, comparing the genome of the strain SLB01 with genomes of other related J. lividum strains from the environment. The strain Janthinobacterium sp. SLB01 contained genes encoding violacein, alpha-amylases, phospholipases, chitinases, collagenases, hemolysin, and a type VI secretion system. In addition, the presence of conservative clusters of genes for the biosynthesis of secondary metabolites of tropodithietic acid and marinocine was found. We present genes for antibiotic resistance, including five genes encoding various lactamases and eight genes for penicillin-binding proteins, which are conserved in all analyzed strains. Major differences were found between the Janthinobacterium sp. SLB01 and J. lividum strains in the spectra of genes for glycosyltransferases and glycoside hydrolases, serine hydrolases, and trypsin-like peptidase, as well as some TonB-dependent siderophore receptors. Thus, the study of the analysis of the genome of the strain SLB01 allows us to conclude that the strain may be one of the pathogens of freshwater sponges.
Collapse
|
28
|
Powell DL, Moran B, Kim B, Banerjee SM, Aguillon SM, Fascinetto-Zago P, Langdon Q, Schumer M. Two new hybrid populations expand the swordtail hybridization model system. Evolution 2021; 75:2524-2539. [PMID: 34460102 PMCID: PMC8659863 DOI: 10.1111/evo.14337] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/08/2021] [Revised: 06/11/2021] [Accepted: 06/22/2021] [Indexed: 12/25/2022]
Abstract
Natural hybridization events provide unique windows into the barriers that keep species apart as well as the consequences of their breakdown. Here, we characterize hybrid populations formed between the northern swordtail fish Xiphophorus cortezi and Xiphophorus birchmanni from collection sites on two rivers. We use simulations and new genetic reference panels to develop sensitive and accurate local ancestry calling in this novel system. Strikingly, we find that hybrid populations on both rivers consist of two genetically distinct subpopulations: a cluster of pure X. birchmanni individuals and one of phenotypically intermediate hybrids that derive ∼85-90% of their genome from X. cortezi. Simulations suggest that initial hybridization occurred ∼150 generations ago at both sites, with little evidence for contemporary gene flow between subpopulations. This population structure is consistent with strong assortative mating between individuals of similar ancestry. The patterns of population structure uncovered here mirror those seen in hybridization between X. birchmanni and its sister species, Xiphophorus malinche, indicating an important role for assortative mating in the evolution of hybrid populations. Future comparisons will provide a window into the shared mechanisms driving the outcomes of hybridization not only among independent hybridization events between the same species but also across distinct species pairs.
Collapse
Affiliation(s)
- Daniel L. Powell
- Department of Biology, Stanford University,Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C.,Correspondence to: and
| | - Ben Moran
- Department of Biology, Stanford University,Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C
| | | | - Shreya M. Banerjee
- Department of Biology, Stanford University,Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C
| | - Stepfanie M. Aguillon
- Department of Biology, Stanford University,Department of Ecology and Evolutionary Biology, Cornell University
| | - Paola Fascinetto-Zago
- Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C.,Department of Biology, Texas A&M University
| | - Quinn Langdon
- Department of Biology, Stanford University,Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C
| | - Molly Schumer
- Department of Biology, Stanford University,Centro de Investigaciones Científicas de las Huastecas “Aguazarca”, A.C.,Hanna H. Gray Fellow, Howard Hughes Medical Institutes,Correspondence to: and
| |
Collapse
|
29
|
Charlesworth D, Graham C, Trivedi U, Gardner J, Bergero R. PromethION sequencing and assembly of the genome of Micropoecilia picta, a fish with a highly Degenerated Y chromosome. Genome Biol Evol 2021; 13:6326803. [PMID: 34297069 PMCID: PMC8449826 DOI: 10.1093/gbe/evab171] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/19/2021] [Indexed: 11/13/2022] Open
Abstract
We here describe sequencing and assembly of both the autosomes and the sex chromosome in M. picta, the closest related species to the guppy, Poecilia reticulata. Poecilia ()Micropoecilia) picta is a close outgroup for studying the guppy, an important organism for studies in evolutionary ecology and in sex chromosome evolution. The guppy XY pair (LG12) has long been studied as a test case for the importance of sexually antagonistic variants in selection for suppressed recombination between Y and X chromosomes. The guppy Y chromosome is not degenerated, but appears to carry functional copies of all genes that are present on its X counterpart. The X chromosomes of M. picta (and its relative M. parae) are homologous to the guppy XY pair, but their Y chromosomes are highly degenerated, and no genes can be identified in the fully Y-linked region. A complete genome sequence of a M. picta male may therefore contribute to understanding how the guppy Y evolved. These fish species' genomes are estimated to be about 750 Mb, with high densities of repetitive sequences, suggesting that long-read sequencing is needed. We evaluated several assembly approaches, and used our results to investigate the extent of Y chromosome degeneration in this species.
Collapse
Affiliation(s)
- Deborah Charlesworth
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, EH9 3LF, UK
| | - Chay Graham
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, EH9 3LF, UK.,University of Cambridge, Department of Biochemistry, Sanger Building, 80 Tennis Ct Rd, Cambridge, CB2 1GA, UK
| | - Urmi Trivedi
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, EH9 3LF, UK
| | - Jim Gardner
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, EH9 3LF, UK
| | - Roberta Bergero
- Institute of Evolutionary Biology, School of Biological Sciences, University of Edinburgh, Charlotte Auerbach Road, EH9 3LF, UK
| |
Collapse
|
30
|
Tvedte ES, Gasser M, Sparklin BC, Michalski J, Hjelmen CE, Johnston JS, Zhao X, Bromley R, Tallon LJ, Sadzewicz L, Rasko DA, Dunning Hotopp JC. Comparison of long-read sequencing technologies in interrogating bacteria and fly genomes. G3 (Bethesda) 2021; 11:jkab083. [PMID: 33768248 PMCID: PMC8495745 DOI: 10.1093/g3journal/jkab083] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/09/2021] [Accepted: 03/07/2021] [Indexed: 12/14/2022]
Abstract
The newest generation of DNA sequencing technology is highlighted by the ability to generate sequence reads hundreds of kilobases in length. Pacific Biosciences (PacBio) and Oxford Nanopore Technologies (ONT) have pioneered competitive long read platforms, with more recent work focused on improving sequencing throughput and per-base accuracy. We used whole-genome sequencing data produced by three PacBio protocols (Sequel II CLR, Sequel II HiFi, RS II) and two ONT protocols (Rapid Sequencing and Ligation Sequencing) to compare assemblies of the bacteria Escherichia coli and the fruit fly Drosophila ananassae. In both organisms tested, Sequel II assemblies had the highest consensus accuracy, even after accounting for differences in sequencing throughput. ONT and PacBio CLR had the longest reads sequenced compared to PacBio RS II and HiFi, and genome contiguity was highest when assembling these datasets. ONT Rapid Sequencing libraries had the fewest chimeric reads in addition to superior quantification of E. coli plasmids versus ligation-based libraries. The quality of assemblies can be enhanced by adopting hybrid approaches using Illumina libraries for bacterial genome assembly or polishing eukaryotic genome assemblies, and an ONT-Illumina hybrid approach would be more cost-effective for many users. Genome-wide DNA methylation could be detected using both technologies, however ONT libraries enabled the identification of a broader range of known E. coli methyltransferase recognition motifs in addition to undocumented D. ananassae motifs. The ideal choice of long read technology may depend on several factors including the question or hypothesis under examination. No single technology outperformed others in all metrics examined.
Collapse
Affiliation(s)
- Eric S Tvedte
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Mark Gasser
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Benjamin C Sparklin
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Jane Michalski
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Carl E Hjelmen
- Department of Biology, Texas A&M University, College Station, TX 77843, USA
| | - J Spencer Johnston
- Department of Entomology, Texas A&M University, College Station, TX 77843, USA
| | - Xuechu Zhao
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Robin Bromley
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Luke J Tallon
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Lisa Sadzewicz
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - David A Rasko
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| | - Julie C Dunning Hotopp
- Institute for Genome Sciences, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- Department of Microbiology and Immunology, University of Maryland School of Medicine, Baltimore, MD 21201, USA
- Greenebaum Cancer Center, University of Maryland School of Medicine, Baltimore, MD 21201, USA
| |
Collapse
|
31
|
Anjanappa RB, Gruissem W. Current progress and challenges in crop genetic transformation. J Plant Physiol 2021; 261:153411. [PMID: 33872932 DOI: 10.1016/j.jplph.2021.153411] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/31/2020] [Revised: 03/29/2021] [Accepted: 03/29/2021] [Indexed: 05/14/2023]
Abstract
Plant transformation remains the most sought-after technology for functional genomics and crop genetic improvement, especially for introducing specific new traits and to modify or recombine already existing traits. Along with many other agricultural technologies, the global production of genetically engineered crops has steadily grown since they were first introduced 25 years ago. Since the first transfer of DNA into plant cells using Agrobacterium tumefaciens, different transformation methods have enabled rapid advances in molecular breeding approaches to bring crop varieties with novel traits to the market that would be difficult or not possible to achieve with conventional breeding methods. Today, transformation to produce genetically engineered crops is the fastest and most widely adopted technology in agriculture. The rapidly increasing number of sequenced plant genomes and information from functional genomics data to understand gene function, together with novel gene cloning and tissue culture methods, is further accelerating crop improvement and trait development. These advances are welcome and needed to make crops more resilient to climate change and to secure their yield for feeding the increasing human population. Despite the success, transformation remains a bottleneck because many plant species and crop genotypes are recalcitrant to established tissue culture and regeneration conditions, or they show poor transformability. Improvements are possible using morphogenetic transcriptional regulators, but their broader applicability remains to be tested. Advances in genome editing techniques and direct, non-tissue culture-based transformation methods offer alternative approaches to enhance varietal development in other recalcitrant crops. Here, we review recent developments in plant transformation and regeneration, and discuss opportunities for new breeding technologies in agriculture.
Collapse
Affiliation(s)
- Ravi B Anjanappa
- Institute of Molecular Plant Biology, Department of Biology, ETH Zurich, Universitätstrasse 2, 8092 Zurich, Switzerland
| | - Wilhelm Gruissem
- Institute of Molecular Plant Biology, Department of Biology, ETH Zurich, Universitätstrasse 2, 8092 Zurich, Switzerland; Advanced Plant Biotechnology Center, National Chung Hsing University, 145 Xingda Road, Taichung City 402, Taiwan.
| |
Collapse
|
32
|
Andras JP, Fields PD, Du Pasquier L, Fredericksen M, Ebert D. Genome-Wide Association Analysis Identifies a Genetic Basis of Infectivity in a Model Bacterial Pathogen. Mol Biol Evol 2021; 37:3439-3452. [PMID: 32658956 PMCID: PMC7743900 DOI: 10.1093/molbev/msaa173] [Citation(s) in RCA: 11] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/05/2020] [Revised: 06/22/2020] [Accepted: 07/08/2020] [Indexed: 12/22/2022] Open
Abstract
Knowledge of the genetic architecture of pathogen infectivity and host resistance is essential for a mechanistic understanding of coevolutionary processes, yet the genetic basis of these interacting traits remains unknown for most host-pathogen systems. We used a comparative genomic approach to explore the genetic basis of infectivity in Pasteuria ramosa, a Gram-positive bacterial pathogen of planktonic crustaceans that has been established as a model for studies of Red Queen host-pathogen coevolution. We sequenced the genomes of a geographically, phenotypically, and genetically diverse collection of P. ramosa strains and performed a genome-wide association study to identify genetic correlates of infection phenotype. We found multiple polymorphisms within a single gene, Pcl7, that correlate perfectly with one common and widespread infection phenotype. We then confirmed this perfect association via Sanger sequencing in a large and diverse sample set of P. ramosa clones. Pcl7 codes for a collagen-like protein, a class of adhesion proteins known or suspected to be involved in the infection mechanisms of a number of important bacterial pathogens. Consistent with expectations under Red Queen coevolution, sequence variation of Pcl7 shows evidence of balancing selection, including extraordinarily high diversity and absence of geographic structure. Based on structural homology with a collagen-like protein of Bacillus anthracis, we propose a hypothesis for the structure of Pcl7 and the physical location of the phenotype-associated polymorphisms. Our results offer strong evidence for a gene governing infectivity and provide a molecular basis for further study of Red Queen dynamics in this model host-pathogen system.
Collapse
Affiliation(s)
- Jason P Andras
- Department of Biological Sciences, Mount Holyoke College, South Hadley, MA
| | - Peter D Fields
- Division of Zoology, Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Louis Du Pasquier
- Division of Zoology, Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Maridel Fredericksen
- Division of Zoology, Department of Environmental Sciences, University of Basel, Basel, Switzerland
| | - Dieter Ebert
- Division of Zoology, Department of Environmental Sciences, University of Basel, Basel, Switzerland
| |
Collapse
|
33
|
Seferbekova Z, Zabelkin A, Yakovleva Y, Afasizhev R, Dranenko NO, Alexeev N, Gelfand MS, Bochkareva OO. High Rates of Genome Rearrangements and Pathogenicity of Shigella spp. Front Microbiol 2021; 12:628622. [PMID: 33912145 PMCID: PMC8072062 DOI: 10.3389/fmicb.2021.628622] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/12/2020] [Accepted: 03/22/2021] [Indexed: 02/01/2023] Open
Abstract
Shigella are pathogens originating within the Escherichia lineage but frequently classified as a separate genus. Shigella genomes contain numerous insertion sequences (ISs) that lead to pseudogenisation of affected genes and an increase of non-homologous recombination. Here, we study 414 genomes of E. coli and Shigella strains to assess the contribution of genomic rearrangements to Shigella evolution. We found that Shigella experienced exceptionally high rates of intragenomic rearrangements and had a decreased rate of homologous recombination compared to pathogenic and non-pathogenic E. coli. The high rearrangement rate resulted in independent disruption of syntenic regions and parallel rearrangements in different Shigella lineages. Specifically, we identified two types of chromosomally encoded E3 ubiquitin-protein ligases acquired independently by all Shigella strains that also showed a high level of sequence conservation in the promoter and further in the 5′-intergenic region. In the only available enteroinvasive E. coli (EIEC) strain, which is a pathogenic E. coli with a phenotype intermediate between Shigella and non-pathogenic E. coli, we found a rate of genome rearrangements comparable to those in other E. coli and no functional copies of the two Shigella-specific E3 ubiquitin ligases. These data indicate that the accumulation of ISs influenced many aspects of genome evolution and played an important role in the evolution of intracellular pathogens. Our research demonstrates the power of comparative genomics-based on synteny block composition and an important role of non-coding regions in the evolution of genomic islands.
Collapse
Affiliation(s)
- Zaira Seferbekova
- Faculty of Bioengineering and Bioinformatics, Lomonosov Moscow State University, Moscow, Russia.,Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Alexey Zabelkin
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia.,JetBrains Research, Saint Petersburg, Russia.,Bioinformatics Institute, Saint Petersburg, Russia
| | - Yulia Yakovleva
- Bioinformatics Institute, Saint Petersburg, Russia.,Department of Cytology and Histology, Saint Petersburg State University, Saint Petersburg, Russia
| | - Robert Afasizhev
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Natalia O Dranenko
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia
| | - Nikita Alexeev
- Computer Technologies Laboratory, ITMO University, Saint Petersburg, Russia
| | - Mikhail S Gelfand
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Skolkovo Institute of Science and Technology, Moscow, Russia
| | - Olga O Bochkareva
- Institute for Information Transmission Problems (The Kharkevich Institute, RAS), Moscow, Russia.,Institute of Science and Technology (IST Austria), Klosterneuburg, Austria
| |
Collapse
|
34
|
García-Bayona L, Coyne MJ, Comstock LE. Mobile Type VI secretion system loci of the gut Bacteroidales display extensive intra-ecosystem transfer, multi-species spread and geographical clustering. PLoS Genet 2021; 17:e1009541. [PMID: 33901198 PMCID: PMC8102008 DOI: 10.1371/journal.pgen.1009541] [Citation(s) in RCA: 24] [Impact Index Per Article: 8.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 05/06/2021] [Accepted: 04/08/2021] [Indexed: 02/07/2023] Open
Abstract
The human gut microbiota is a dense microbial ecosystem with extensive opportunities for bacterial contact-dependent processes such as conjugation and Type VI secretion system (T6SS)-dependent antagonism. In the gut Bacteroidales, two distinct genetic architectures of T6SS loci, GA1 and GA2, are contained on Integrative and Conjugative Elements (ICE). Despite intense interest in the T6SSs of the gut Bacteroidales, there is only a superficial understanding of their evolutionary patterns, and of their dissemination among Bacteroidales species in human gut communities. Here, we combine extensive genomic and metagenomic analyses to better understand their ecological and evolutionary dynamics. We identify new genetic subtypes, document extensive intrapersonal transfer of these ICE to Bacteroidales species within human gut microbiomes, and most importantly, reveal frequent population fixation of these newly armed strains in multiple species within a person. We further show the distribution of each of the distinct T6SSs in human populations and show there is geographical clustering. We reveal that the GA1 T6SS ICE integrates at a minimal recombination site leading to their integration throughout genomes and their frequent interruption of genes, whereas the GA2 T6SS ICE integrate at one of three different tRNA genes. The exclusion of concurrent GA1 and GA2 T6SSs in individual strains is associated with intact T6SS loci and with an ICE-encoded gene. By performing a comprehensive analysis of mobile genetic elements (MGE) in co-resident Bacteroidales species in numerous human gut communities, we identify 74 MGE that transferred to multiple Bacteroidales species within individual gut microbiomes. We further show that only three other MGE demonstrate multi-species spread in human gut microbiomes to the degree demonstrated by the GA1 and GA2 ICE. These data underscore the ubiquity and dissemination of mobile T6SS loci within Bacteroidales communities and across human populations.
Collapse
Affiliation(s)
- Leonor García-Bayona
- Division of Infectious Diseases, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Michael J. Coyne
- Division of Infectious Diseases, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| | - Laurie E. Comstock
- Division of Infectious Diseases, Brigham and Women’s Hospital, Harvard Medical School, Boston, Massachusetts, United States of America
| |
Collapse
|
35
|
Petrushin IS, Markova YA, Karepova MS, Zaytseva YV, Belovezhets LA. Complete Genome Sequence of Rhodococcus qingshengii Strain VKM Ac-2784D, Isolated from Elytrigia repens Rhizosphere. Microbiol Resour Announc 2021; 10:e00107-21. [PMID: 33737361 DOI: 10.1128/MRA.00107-21] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
Rhizosphere bacteria are considered to be promising destructors of oil and its components. Bacterial species of the genus Rhodococcus can degrade a variety of hydrocarbons and are widely used for the bioremediation of polluted environments. Here, we report the complete genome sequence of Rhodococcus qingshengii strain VKM Ac-2784D. Rhizosphere bacteria are considered to be promising destructors of oil and its components. Bacterial species of the genus Rhodococcus can degrade a variety of hydrocarbons and are widely used for the bioremediation of polluted environments. Here, we report the complete genome sequence of Rhodococcus qingshengii strain VKM Ac-2784D.
Collapse
|
36
|
Jay P, Chouteau M, Whibley A, Bastide H, Parrinello H, Llaurens V, Joron M. Mutation load at a mimicry supergene sheds new light on the evolution of inversion polymorphisms. Nat Genet 2021; 53:288-93. [PMID: 33495598 DOI: 10.1038/s41588-020-00771-1] [Citation(s) in RCA: 47] [Impact Index Per Article: 15.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/21/2020] [Accepted: 12/21/2020] [Indexed: 01/30/2023]
Abstract
Chromosomal inversions are ubiquitous in genomes and often coordinate complex phenotypes, such as the covariation of behavior and morphology in many birds, fishes, insects or mammals1-11. However, why and how inversions become associated with polymorphic traits remains obscure. Here we show that despite a strong selective advantage when they form, inversions accumulate recessive deleterious mutations that generate frequency-dependent selection and promote their maintenance at intermediate frequency. Combining genomics and in vivo fitness analyses in a model butterfly for wing-pattern polymorphism, Heliconius numata, we reveal that three ecologically advantageous inversions have built up a heavy mutational load from the sequential accumulation of deleterious mutations and transposable elements. Inversions associate with sharply reduced viability when homozygous, which prevents them from replacing ancestral chromosome arrangements. Our results suggest that other complex polymorphisms, rather than representing adaptations to competing ecological optima, could evolve because chromosomal rearrangements are intrinsically prone to carrying recessive harmful mutations.
Collapse
|
37
|
Luo J, Wei Y, Lyu M, Wu Z, Liu X, Luo H, Yan C. A comprehensive review of scaffolding methods in genome assembly. Brief Bioinform 2021; 22:6149347. [PMID: 33634311 DOI: 10.1093/bib/bbab033] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/14/2020] [Revised: 01/21/2021] [Accepted: 01/22/2021] [Indexed: 12/20/2022] Open
Abstract
In the field of genome assembly, scaffolding methods make it possible to obtain a more complete and contiguous reference genome, which is the cornerstone of genomic research. Scaffolding methods typically utilize the alignments between contigs and sequencing data (reads) to determine the orientation and order among contigs and to produce longer scaffolds, which are helpful for genomic downstream analysis. With the rapid development of high-throughput sequencing technologies, diverse types of reads have emerged over the past decade, especially in long-range sequencing, which have greatly enhanced the assembly quality of scaffolding methods. As the number of scaffolding methods increases, biology and bioinformatics researchers need to perform in-depth analyses of state-of-the-art scaffolding methods. In this article, we focus on the difficulties in scaffolding, the differences in characteristics among various kinds of reads, the methods by which current scaffolding methods address these difficulties, and future research opportunities. We hope this work will benefit the design of new scaffolding methods and the selection of appropriate scaffolding methods for specific biological studies.
Collapse
Affiliation(s)
- Junwei Luo
- College of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Yawei Wei
- College of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Mengna Lyu
- College of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Zhengjiang Wu
- College of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Xiaoyan Liu
- College of Computer Science and Technology, Henan Polytechnic University, Jiaozuo, China
| | - Huimin Luo
- School of Computer and Information Engineering, Henan University, Kaifeng, China
| | - Chaokun Yan
- School of Computer and Information Engineering, Henan University, Kaifeng, China
| |
Collapse
|
38
|
Huang S, He X, Wang G, Bao E. AlignGraph2: similar genome-assisted reassembly pipeline for PacBio long reads. Brief Bioinform 2021; 22:6146772. [PMID: 33621981 DOI: 10.1093/bib/bbab022] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/18/2021] [Revised: 02/12/2021] [Accepted: 02/16/2021] [Indexed: 11/13/2022] Open
Abstract
Contigs assembled from the third-generation sequencing long reads are usually more complete than the second-generation short reads. However, the current algorithms still have difficulty in assembling the long reads into the ideal complete and accurate genome, or the theoretical best result [1]. To improve the long read contigs and with more and more fully sequenced genomes available, it could still be possible to use the similar genome-assisted reassembly method [2], which was initially proposed for the short reads making use of a closely related genome (similar genome) to the sequencing genome (target genome). The method aligns the contigs and reads to the similar genome, and then extends and refines the aligned contigs with the aligned reads. Here, we introduce AlignGraph2, a similar genome-assisted reassembly pipeline for the PacBio long reads. The AlignGraph2 pipeline is the second version of AlignGraph algorithm proposed by us but completely redesigned, can be inputted with either error-prone or HiFi long reads, and contains four novel algorithms: similarity-aware alignment algorithm and alignment filtration algorithm for alignment of the long reads and preassembled contigs to the similar genome, and reassembly algorithm and weight-adjusted consensus algorithm for extension and refinement of the preassembled contigs. In our performance tests on both error-prone and HiFi long reads, AlignGraph2 can align 5.7-27.2% more long reads and 7.3-56.0% more bases than some current alignment algorithm and is more efficient or comparable to the others. For contigs assembled with various de novo algorithms and aligned to similar genomes (aligned contigs), AlignGraph2 can extend 8.7-94.7% of them (extendable contigs), and obtain contigs of 7.0-249.6% larger N50 value and 5.2-87.7% smaller number of indels per 100 kbp (extended contigs). With genomes of decreased similarities, AlignGraph2 also has relatively stable performance. The AlignGraph2 software can be downloaded for free from this site: https://github.com/huangs001/AlignGraph2.
Collapse
Affiliation(s)
- Shien Huang
- Group of Interdisciplinary Information Sciences, School of Software Engineering, Beijing Jiaotong University, China
| | - Xinyu He
- Group of Interdisciplinary Information Sciences, School of Software Engineering, Beijing Jiaotong University, China
| | - Guohua Wang
- College of Information and Computer Engineering, Northeast Forestry University, China
| | - Ergude Bao
- Interdisciplinary Information Sciences, School of Software Engineering, Beijing Jiaotong University, China
| |
Collapse
|
39
|
Burley JT, Kellner JR, Hubbell SP, Faircloth BC. Genome assemblies for two Neotropical trees: Jacaranda copaia and Handroanthus guayacan. G3 (Bethesda) 2021; 11:jkab010. [PMID: 33693604 PMCID: PMC8034707 DOI: 10.1093/g3journal/jkab010] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 12/22/2020] [Indexed: 12/01/2022]
Abstract
The lack of genomic resources for tropical canopy trees is impeding several research avenues in tropical forest biology. We present genome assemblies for two Neotropical hardwood species, Jacaranda copaia and Handroanthus (formerly Tabebuia) guayacan, that are model systems for research on tropical tree demography and flowering phenology. For each species, we combined Illumina short-read data with in vitro proximity-ligation (Chicago) libraries to generate an assembly. For Jacaranda copaia, we obtained 104X physical coverage and produced an assembly with N50/N90 scaffold lengths of 1.020/0.277 Mbp. For H. guayacan, we obtained 129X coverage and produced an assembly with N50/N90 scaffold lengths of 0.795/0.165 Mbp. J. copaia and H. guayacan assemblies contained 95.8% and 87.9% of benchmarking orthologs, although they constituted only 77.1% and 66.7% of the estimated genome sizes of 799 and 512 Mbp, respectively. These differences were potentially due to high repetitive sequence content (>59.31% and 45.59%) and high heterozygosity (0.5% and 0.8%) in each species. Finally, we compared each new assembly to a previously sequenced genome for Handroanthus impetiginosus using whole-genome alignment. This analysis indicated extensive gene duplication in H. impetiginosus since its divergence from H. guayacan.
Collapse
Affiliation(s)
- John T Burley
- Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912, USA
- Institute at Brown for Environment and Society, Brown University, Providence, RI 02912, USA
| | - James R Kellner
- Department of Ecology and Evolutionary Biology, Brown University, Providence, RI 02912, USA
- Institute at Brown for Environment and Society, Brown University, Providence, RI 02912, USA
| | - Stephen P Hubbell
- Department of Ecology and Evolutionary Biology, University of California—Los Angeles, Los Angeles, CA 90095, USA
| | - Brant C Faircloth
- Department of Biological Sciences and Museum of Natural Science, Louisiana State University, Baton Rouge, LA 70803, USA
| |
Collapse
|
40
|
Whibley A, Kelley JL, Narum SR. The changing face of genome assemblies: Guidance on achieving high-quality reference genomes. Mol Ecol Resour 2021; 21:641-652. [PMID: 33326691 DOI: 10.1111/1755-0998.13312] [Citation(s) in RCA: 25] [Impact Index Per Article: 8.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2020] [Revised: 12/08/2020] [Accepted: 12/11/2020] [Indexed: 12/20/2022]
Abstract
The quality of genome assemblies has improved rapidly in recent years due to continual advances in sequencing technology, assembly approaches, and quality control. In the field of molecular ecology, this has led to the development of exceptional quality genome assemblies that will be important long-term resources for broader studies into ecological, conservation, evolutionary, and population genomics of naturally occurring species. Moreover, the extent to which a single reference genome represents the diversity within a species varies: pan-genomes will become increasingly important ecological genomics resources, particularly in systems found to have considerable presence-absence variation in their functional content. Here, we highlight advances in technology that have raised the bar for genome assembly and provide guidance on standards to achieve exceptional quality reference genomes. Key recommendations include the following: (a) Genome assemblies should include long-read sequencing except in rare cases where it is effectively impossible to acquire adequately preserved samples needed for high molecular weight DNA standards. (b) At least one scaffolding approach should be included with genome assembly such as Hi-C or optical mapping. (c) Genome assemblies should be carefully evaluated, this may involve utilising short read data for genome polishing, error correction, k-mer analyses, and estimating the percent of reads that map back to an assembly. Finally, a genome assembly is most valuable if all data and methods are made publicly available and the utility of a genome for further studies is verified through examples. While these recommendations are based on current technology, we anticipate that future advances will push the field further and the molecular ecology community should continue to adopt new approaches that attain the highest quality genome assemblies.
Collapse
Affiliation(s)
| | | | - Shawn R Narum
- University of Idaho, Moscow, ID, USA.,Columbia River Inter-Tribal Fish Commission, Hagerman, ID, USA
| |
Collapse
|
41
|
Tse H, Tsang AKL, Chu YW, Tsang DNC. Draft Genome Sequences of 19 Clinical Isolates of Candida auris from Hong Kong. Microbiol Resour Announc 2021; 10:e00308-20. [PMID: 33414279 DOI: 10.1128/MRA.00308-20] [Citation(s) in RCA: 9] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/11/2023] Open
Abstract
Candida auris is an emerging human pathogen associated with multidrug resistance and nosocomial outbreaks. We report the draft genome sequences of 19 C. auris isolates that were associated with a cluster of cases in a hospital in Hong Kong. Candida auris is an emerging human pathogen associated with multidrug resistance and nosocomial outbreaks. We report the draft genome sequences of 19 C. auris isolates that were associated with a cluster of cases in a hospital in Hong Kong.
Collapse
|
42
|
Tsuchiya MTN, Dikow RB, Koepfli KP, Frandsen PB, Rockwood LL, Maldonado JE. Whole-Genome Sequencing of Procyonids Reveals Distinct Demographic Histories in Kinkajou (Potos flavus) and Northern Raccoon (Procyon lotor). Genome Biol Evol 2020; 13:6040737. [PMID: 33331895 PMCID: PMC7851585 DOI: 10.1093/gbe/evaa255] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/30/2020] [Indexed: 01/20/2023] Open
Abstract
Here, we present the initial comparison of the nuclear genomes of the North American raccoon (Procyon lotor) and the kinkajou (Potos flavus) based on draft assemblies. These two species encompass almost 21 Myr of evolutionary history within Procyonidae. Because assemblies greatly impact downstream results, such as gene prediction and annotation, we tested three de novo assembly strategies (implemented in ALLPATHS-LG, MaSuRCA, and Platanus), some of which are optimized for highly heterozygous genomes. We discovered significant variation in contig and scaffold N50 and L50 statistics and genome completeness depending on the de novo assembler used. We compared the performance of these three assembly algorithms in hopes that this study will aid others looking to improve the quality of existing draft genome assemblies even without additional sequence data. We also estimate the demographic histories of raccoons and kinkajous using the Pairwise Sequentially Markovian Coalescent and discuss the variation in population sizes with respect to climatic change during the Pleistocene, as well as aspects of their ecology and taxonomy. Our goal is to achieve a better understanding of the evolutionary history of procyonids and to create robust genomic resources for future studies regarding adaptive divergence and selection.
Collapse
Affiliation(s)
- Mirian T N Tsuchiya
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC, USA.,Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, USA
| | - Rebecca B Dikow
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC, USA
| | - Klaus-Peter Koepfli
- Smithsonian-Mason School of Conservation, George Mason Univeristy, Front Royal, VA, USA.,Smithsonian Conservation Biology Institute, Center for Species Survival, National Zoological Park, Washington, DC, USA
| | - Paul B Frandsen
- Data Science Lab, Office of the Chief Information Officer, Smithsonian Institution, Washington, DC, USA.,Department of Plant & Wildlife Sciences, Brigham Young University, Provo, UT, USA
| | - Larry L Rockwood
- Department of Biology, George Mason University, Fairfax, VA, USA
| | - Jesús E Maldonado
- Center for Conservation Genomics, Smithsonian Conservation Biology Institute, National Zoological Park, Washington, DC, USA.,Department of Biology, George Mason University, Fairfax, VA, USA
| |
Collapse
|
43
|
Minkin I, Medvedev P. Scalable multiple whole-genome alignment and locally collinear block construction with SibeliaZ. Nat Commun 2020; 11:6327. [PMID: 33303762 PMCID: PMC7728760 DOI: 10.1038/s41467-020-19777-8] [Citation(s) in RCA: 25] [Impact Index Per Article: 6.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2019] [Accepted: 10/29/2020] [Indexed: 11/29/2022] Open
Abstract
Multiple whole-genome alignment is a challenging problem in bioinformatics. Despite many successes, current methods are not able to keep up with the growing number, length, and complexity of assembled genomes, especially when computational resources are limited. Approaches based on compacted de Bruijn graphs to identify and extend anchors into locally collinear blocks have potential for scalability, but current methods do not scale to mammalian genomes. We present an algorithm, SibeliaZ-LCB, for identifying collinear blocks in closely related genomes based on analysis of the de Bruijn graph. We further incorporate this into a multiple whole-genome alignment pipeline called SibeliaZ. SibeliaZ shows run-time improvements over other methods while maintaining accuracy. On sixteen recently-assembled strains of mice, SibeliaZ runs in under 16 hours on a single machine, while other tools did not run to completion for eight mice within a week. SibeliaZ makes a significant step towards improving scalability of multiple whole-genome alignment and collinear block reconstruction algorithms on a single machine. Multiple whole-genome alignment is a challenging problem in bioinformatics, especially when computational resources are limited. Here the authors present SibeliaZ, an algorithm and software based on analysis of de Bruijn graphs, which provides improved computational efficiency and scalability.
Collapse
Affiliation(s)
- Ilia Minkin
- Department of Computer Science and Engineering, The Pennsylvania State University, 506 Wartik Lab University Park, University Park, PA, 16802, USA.
| | - Paul Medvedev
- Department of Computer Science and Engineering, The Pennsylvania State University, 506 Wartik Lab University Park, University Park, PA, 16802, USA.,Department of Biochemistry and Molecular Biology, The Pennsylvania State University, 506 Wartik Lab University Park, University Park, PA, 16802, USA.,Center for Computational Biology and Bioinformatics, The Pennsylvania State University, 506 Wartik Lab University Park, University Park, PA, 16802, USA
| |
Collapse
|
44
|
Xie Y, Zhong Y, Chang J, Kwan HS. Chromosome-level de novo assembly of Coprinopsis cinerea A43mut B43mut pab1-1 #326 and genetic variant identification of mutants using Nanopore MinION sequencing. Fungal Genet Biol 2020; 146:103485. [PMID: 33253902 DOI: 10.1016/j.fgb.2020.103485] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/13/2020] [Revised: 10/22/2020] [Accepted: 11/13/2020] [Indexed: 11/26/2022]
Abstract
The homokaryotic Coprinopsis cinerea strain A43mut B43mut pab1-1 #326 is a widely used experimental model for developmental studies in mushroom-forming fungi. It can grow on defined artificial media and complete the whole lifecycle within two weeks. The mutations in mating type factors A and B result in the special feature of clamp formation and fruiting without mating. This feature allows investigations and manipulations with a homokaryotic genetic background. Current genome assembly of strain #326 was based on short-read sequencing data and was highly fragmented, leading to the bias in gene annotation and downstream analyses. Here, we report a chromosome-level genome assembly of strain #326. Oxford Nanopore Technology (ONT) MinION sequencing was used to get long reads. Illumina short reads was used to polish the sequences. A combined assembly yield 13 chromosomes and a mitochondrial genome as individual scaffolds. The assembly has 15,250 annotated genes with a high synteny with the C. cinerea strain Okayama-7 #130. This assembly has great improvement on contiguity and annotations. It is a suitable reference for further genomic studies, especially for the genetic, genomic and transcriptomic analyses in ONT long reads. Single nucleotide variants and structural variants in six mutagenized and cisplatin-screened mutants could be identified and validated. A 66 bp deletion in Ras GTPase-activating protein (RasGAP) was found in all mutants. To make a better use of ONT sequencing platform, we modified a high-molecular-weight genomic DNA isolation protocol based on magnetic beads for filamentous fungi. This study showed the use of MinION to construct a fungal reference genome and to perform downstream studies in an individual laboratory. An experimental workflow was proposed, from DNA isolation and whole genome sequencing, to genome assembly and variant calling. Our results provided solutions and parameters for fungal genomic analysis on MinION sequencing platform.
Collapse
Affiliation(s)
- Yichun Xie
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region
| | - Yiyi Zhong
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region
| | - Jinhui Chang
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region; The Hong Kong Polytechnic University Shenzhen Research Institute, Shenzhen, China
| | - Hoi Shan Kwan
- School of Life Sciences, The Chinese University of Hong Kong, Shatin, New Territories, Hong Kong Special Administrative Region.
| |
Collapse
|
45
|
Abstract
Eukaryotic genome sequencing and de novo assembly, once the exclusive domain of well-funded international consortia, have become increasingly affordable, thus fitting the budgets of individual research groups. Third-generation long-read DNA sequencing technologies are increasingly used, providing extensive genomic toolkits that were once reserved for a few select model organisms. Generating high-quality genome assemblies and annotations for many aquatic species still presents significant challenges due to their large genome sizes, complexity, and high chromosome numbers. Indeed, selecting the most appropriate sequencing and software platforms and annotation pipelines for a new genome project can be daunting because tools often only work in limited contexts. In genomics, generating a high-quality genome assembly/annotation has become an indispensable tool for better understanding the biology of any species. Herein, we state 12 steps to help researchers get started in genome projects by presenting guidelines that are broadly applicable (to any species), sustainable over time, and cover all aspects of genome assembly and annotation projects from start to finish. We review some commonly used approaches, including practical methods to extract high-quality DNA and choices for the best sequencing platforms and library preparations. In addition, we discuss the range of potential bioinformatics pipelines, including structural and functional annotations (e.g., transposable elements and repetitive sequences). This paper also includes information on how to build a wide community for a genome project, the importance of data management, and how to make the data and results Findable, Accessible, Interoperable, and Reusable (FAIR) by submitting them to a public repository and sharing them with the research community.
Collapse
Affiliation(s)
- Hyungtaek Jung
- School of Biological Sciences, The University of Queensland, St Lucia, Queensland, Australia
- Centre for Agriculture and Bioeconomy, Queensland University of Technology, Brisbane, Queensland, Australia
| | - Tomer Ventura
- Genecology Research Centre, School of Science and Engineering, University of the Sunshine Coast, Sippy Downs, Queensland, Australia
| | - J. Sook Chung
- Institute of Marine and Environmental Technology, University of Maryland Center for Environmental Science, Baltimore, Maryland, United States of America
| | - Woo-Jin Kim
- Genetics and Breeding Research Center, National Institute of Fisheries Science, Geoje, Korea
| | - Bo-Hye Nam
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Hee Jeong Kong
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Young-Ok Kim
- Biotechnology Research Division, National Institute of Fisheries Science, Busan, Korea
| | - Min-Seung Jeon
- Department of Life Science, Chung-Ang University, Seoul, Korea
| | - Seong-il Eyun
- Department of Life Science, Chung-Ang University, Seoul, Korea
| |
Collapse
|
46
|
Petrushin I, Belikov S, Chernogor L. Cooperative Interaction of Janthinobacterium sp. SLB01 and Flavobacterium sp. SLB02 in the Diseased Sponge Lubomirskia baicalensis. Int J Mol Sci 2020; 21:E8128. [PMID: 33143227 DOI: 10.3390/ijms21218128] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/31/2020] [Revised: 10/23/2020] [Accepted: 10/25/2020] [Indexed: 11/17/2022] Open
Abstract
Endemic freshwater sponges (demosponges, Lubomirskiidae) dominate in Lake Baikal, Central Siberia, Russia. These sponges are multicellular filter-feeding animals that represent a complex consortium of many species of eukaryotes and prokaryotes. In recent years, mass disease and death of Lubomirskia baicalensis has been a significant problem in Lake Baikal. The etiology and ecology of these events remain unknown. Bacteria from the families Flavobacteriaceae and Oxalobacteraceae dominate the microbiomes of diseased sponges. Both species are opportunistic pathogens common in freshwater ecosystems. The aim of our study was to analyze the genomes of strains Janthinobacterium sp. SLB01 and Flavobacterium sp. SLB02, isolated from diseased sponges to identify the reasons for their joint dominance. Janthinobacterium sp. SLB01 attacks other cells using a type VI secretion system and suppresses gram-positive bacteria with violacein, and regulates its own activity via quorum sensing. It produces floc and strong biofilm by exopolysaccharide biosynthesis and PEP-CTERM/XrtA protein expression. Flavobacterium sp. SLB02 utilizes the fragments of cell walls produced by polysaccharides. These two strains have a marked difference in carbohydrate acquisition. We described a possible means of joint occupation of the ecological niche in the freshwater sponge microbial community. This study expands the understanding of the symbiotic relationship of microorganisms with freshwater Baikal sponges.
Collapse
|
47
|
Petrushin IS, Belikov SI, Belykh OI, Tikhonova I, Chernogor LI. Draft Genome Sequence of the Green Microalga Chlorella sp. Strain BAC9706, Isolated from Lake Baikal, Russia. Microbiol Resour Announc 2020; 9. [PMID: 33093044 PMCID: PMC7585853 DOI: 10.1128/mra.00966-20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Green algae of the phylum Chlorophyta are the most widespread autotrophic picoplankton in Lake Baikal (Russia). To expand our molecular biological knowledge of these microalgae and compare them in the future with an endosymbiotic strain, we present here the draft genome sequence of Chlorella sp. strain BAC9706. Green algae of the phylum Chlorophyta are the most widespread autotrophic picoplankton in Lake Baika (Russia). To expand our molecular biological knowledge of these microalgae and compare them in the future with an endosymbiotic strain, we present here the draft genome sequence of Chlorella sp. strain BAC9706.
Collapse
|
48
|
Fouret J, Brunet FG, Binet M, Aurine N, Enchéry F, Croze S, Guinier M, Goumaidi A, Preininger D, Volff JN, Bailly-Bechet M, Lachuer J, Horvat B, Legras-Lachuer C. Sequencing the Genome of Indian Flying Fox, Natural Reservoir of Nipah Virus, Using Hybrid Assembly and Conservative Secondary Scaffolding. Front Microbiol 2020; 11:1807. [PMID: 32849415 PMCID: PMC7403528 DOI: 10.3389/fmicb.2020.01807] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/15/2020] [Accepted: 07/09/2020] [Indexed: 11/20/2022] Open
Abstract
Indian fruit bats, flying fox Pteropus medius was identified as an asymptomatic natural host of recently emerged Nipah virus, which is known to induce a severe infectious disease in humans. The absence of P. medius genome sequence presents an important obstacle for further studies of virus–host interactions and better understanding of mechanisms of zoonotic viral emergence. Generation of the high-quality genome sequence is often linked to a considerable effort associated to elevated costs. Although secondary scaffolding methods have reduced sequencing expenses, they imply the development of new tools for the integration of different data sources to achieve more reliable sequencing results. We initially sequenced the P. medius genome using the combination of Illumina paired-end and Nanopore sequencing, with a depth of 57.4x and 6.1x, respectively. Then, we introduced the novel scaff2link software to integrate multiple sources of information for secondary scaffolding, allowing to remove the association with discordant information among two sources. Different quality metrics were next produced to validate the benefits from secondary scaffolding. The P. medius genome, assembled by this method, has a length of 1,985 Mb and consists of 33,613 contigs and 16,113 scaffolds with an NG50 of 19 Mb. At least 22.5% of the assembled sequences is covered by interspersed repeats already described in other species and 19,823 coding genes are annotated. Phylogenetic analysis demonstrated the clustering of P. medius genome with two other Pteropus bat species, P. alecto and P. vampyrus, for which genome sequences are currently available. SARS-CoV entry receptor ACE2 sequence of P. medius was 82.7% identical with ACE2 of Rhinolophus sinicus bats, thought to be the natural host of SARS-CoV. Altogether, our results confirm that a lower depth of sequencing is enough to obtain a valuable genome sequence, using secondary scaffolding approaches and demonstrate the benefits of the scaff2link application. The genome sequence is now available to the scientific community to (i) proceed with further genomic analysis of P. medius, (ii) to characterize the underlying mechanism allowing Nipah virus maintenance and perpetuation in its bat host, and (iii) to monitor their evolutionary pathways toward a better understanding of bats’ ability to control viral infections.
Collapse
Affiliation(s)
- Julien Fouret
- CIRI, International Center for Infectiology Research, Team Immunobiology of Viral Infections, Univ Lyon, INSERM U1111, CNRS UMR 5308, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France.,Viroscan3D, Trévoux, France
| | - Frédéric G Brunet
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | - Martin Binet
- CIRI, International Center for Infectiology Research, Team Immunobiology of Viral Infections, Univ Lyon, INSERM U1111, CNRS UMR 5308, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France.,Viroscan3D, Trévoux, France
| | - Noémie Aurine
- CIRI, International Center for Infectiology Research, Team Immunobiology of Viral Infections, Univ Lyon, INSERM U1111, CNRS UMR 5308, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | - Francois Enchéry
- CIRI, International Center for Infectiology Research, Team Immunobiology of Viral Infections, Univ Lyon, INSERM U1111, CNRS UMR 5308, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | - Séverine Croze
- Plateforme Profilexpert, Université Claude Bernard Lyon 1, Lyon, France
| | | | | | | | - Jean-Nicolas Volff
- Institut de Génomique Fonctionnelle de Lyon, Université de Lyon, CNRS UMR 5242, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | | | - Joël Lachuer
- Cancer Research Center of Lyon, INSERM 1052/CNRS 5286, Université de Lyon, Lyon, France.,Plateforme Profilexpert, Université Claude Bernard Lyon 1, Lyon, France
| | - Branka Horvat
- CIRI, International Center for Infectiology Research, Team Immunobiology of Viral Infections, Univ Lyon, INSERM U1111, CNRS UMR 5308, Ecole Normale Supérieure de Lyon, Université Claude Bernard Lyon 1, Lyon, France
| | - Catherine Legras-Lachuer
- Viroscan3D, Trévoux, France.,Ecologie Microbienne, CNRS UMR 5557, LEM, INRA, VetAgro Sup, Université Claude Bernard Lyon 1, Villeurbanne, France
| |
Collapse
|
49
|
Jung H, Jeon MS, Hodgett M, Waterhouse P, Eyun SI. Comparative Evaluation of Genome Assemblers from Long-Read Sequencing for Plants and Crops. J Agric Food Chem 2020; 68:7670-7677. [PMID: 32530283 DOI: 10.1021/acs.jafc.0c01647] [Citation(s) in RCA: 16] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/23/2023]
Abstract
The availability of recent state-of-the-art long-read sequencing technologies has significantly increased the ease and speed of producing high-quality plant genome assemblies. A wide variety of genome-related software tools are now available and they are typically benchmarked using microbial or model eukaryotic genomes such as Arabidopsis and rice. However, many plant species have much larger and more complex genomes than these, and the choice of tools, parameters, and/or strategies that can be used is not always obvious. Thus, we have compared the metrics of assemblies generated by various pipelines to discuss how assembly quality can be affected by two different assembly strategies. First, we focused on optimizing read preprocessing and assembler variables using eight different de novo assemblers on five different Pacific Biosciences long-read datasets of diploid and tetraploid species. Then, we examined a single scaffolding tool (quickmerge) that has been employed for the postprocessing step. We then merged the outputs from multiple assemblies to produce a higher quality consensus assembly. Then, we benchmarked the assemblies for completeness and accuracy (assembly metrics and BUSCO), computer memory, and CPU times. Two lightweight assemblers, Miniasm/Minimap/Racon and WTDBG, were deemed good for novice users because they involved smaller required learning curves and light computational resources. However, two heavyweight tools, CANU and Flye, should be the first choice when the goal is to achieve accurate and complete assemblies. Our results will provide valuable guidance in future plant genome projects and beyond.
Collapse
Affiliation(s)
- Hyungtaek Jung
- Centre for Agriculture and Biocommodities, Queensland University of Technology, Brisbane, Queensland 4001, Australia
| | - Min-Seung Jeon
- Department of Life Science, Chung-Ang University, Seoul 06974, Korea
| | - Matthew Hodgett
- Information Technology Services, Queensland University of Technology, Brisbane, Queensland 4001, Australia
| | - Peter Waterhouse
- Centre for Agriculture and Biocommodities, Queensland University of Technology, Brisbane, Queensland 4001, Australia
| | - Seong-Il Eyun
- Department of Life Science, Chung-Ang University, Seoul 06974, Korea
| |
Collapse
|
50
|
Petrushin IS, Belikov SI, Chernogor LI. Draft Genome Sequence of Flavobacterium sp. Strain SLB02, Isolated from the Diseased Sponge Lubomirskia baicalensis. Microbiol Resour Announc 2020; 9:e00530-20. [PMID: 32586870 DOI: 10.1128/MRA.00530-20] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
There are significant changes in the consortium of microorganisms of freshwater Baikal sponges during their mass death, which began in 2011. The alleged cause of disease is a significant increase in the number of opportunistic microorganisms. Here, we report the draft genome sequence of Flavobacterium sp. strain SLB02. There are significant changes in the consortium of microorganisms of freshwater Baikal sponges during their mass death, which began in 2011. The alleged cause of disease is a significant increase in the number of opportunistic microorganisms. Here, we report the draft genome sequence of Flavobacterium sp. strain SLB02.
Collapse
|