1
|
Torrance EL, Burton C, Diop A, Bobay LM. Evolution of homologous recombination rates across bacteria. Proc Natl Acad Sci U S A 2024; 121:e2316302121. [PMID: 38657048 PMCID: PMC11067023 DOI: 10.1073/pnas.2316302121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2023] [Accepted: 03/08/2024] [Indexed: 04/26/2024] Open
Abstract
Bacteria are nonsexual organisms but are capable of exchanging DNA at diverse degrees through homologous recombination. Intriguingly, the rates of recombination vary immensely across lineages where some species have been described as purely clonal and others as "quasi-sexual." However, estimating recombination rates has proven a difficult endeavor and estimates often vary substantially across studies. It is unclear whether these variations reflect natural variations across populations or are due to differences in methodologies. Consequently, the impact of recombination on bacterial evolution has not been extensively evaluated and the evolution of recombination rate-as a trait-remains to be accurately described. Here, we developed an approach based on Approximate Bayesian Computation that integrates multiple signals of recombination to estimate recombination rates. We inferred the rate of recombination of 162 bacterial species and one archaeon and tested the robustness of our approach. Our results confirm that recombination rates vary drastically across bacteria; however, we found that recombination rate-as a trait-is conserved in several lineages but evolves rapidly in others. Although some traits are thought to be associated with recombination rate (e.g., GC-content), we found no clear association between genomic or phenotypic traits and recombination rate. Overall, our results provide an overview of recombination rate, its evolution, and its impact on bacterial evolution.
Collapse
Affiliation(s)
- Ellis L Torrance
- Department of Biology, University of North Carolina, Greensboro, NC 27412
| | - Corey Burton
- Department of Biology, University of North Carolina, Greensboro, NC 27412
| | - Awa Diop
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| | - Louis-Marie Bobay
- Department of Biology, University of North Carolina, Greensboro, NC 27412
- Department of Biological Sciences, North Carolina State University, Raleigh, NC 27695
| |
Collapse
|
2
|
Liu J, Feng Y, Chen C, Yan J, Bai X, Li H, Lin C, Xiang Y, Tian W, Qi Z, Yu J, Yan X. Genomic insights into the clonal reproductive Opuntia cochenillifera: mitochondrial and chloroplast genomes of the cochineal cactus for enhanced understanding of structural dynamics and evolutionary implications. Front Plant Sci 2024; 15:1347945. [PMID: 38516667 PMCID: PMC10954886 DOI: 10.3389/fpls.2024.1347945] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 02/20/2024] [Indexed: 03/23/2024]
Abstract
Background The cochineal cactus (Opuntia cochenillifera), notable for its substantial agricultural and industrial applications, predominantly undergoes clonal reproduction, which presents significant challenges in breeding and germplasm innovation. Recent developments in mitochondrial genome engineering offer promising avenues for introducing heritable mutations, potentially facilitating selective sexual reproduction through the creation of cytoplasmic male sterile genotypes. However, the lack of comprehensive mitochondrial genome information for Opuntia species hinders these efforts. Here, we intended to sequence and characterize its mitochondrial genome to maximize the potential of its genomes for evolutionary studies, molecular breeding, and molecular marker developments. Results We sequenced the total DNA of the O. cochenillifera using DNBSEQ and Nanopore platforms. The mitochondrial genome was then assembled using a hybrid assembly strategy using Unicycler software. We found that the mitochondrial genome of O. cochenillifera has a length of 1,156,235 bp, a GC content of 43.06%, and contains 54 unique protein-coding genes and 346 simple repeats. Comparative genomic analysis revealed 48 homologous fragments shared between mitochondrial and chloroplast genomes, with a total length of 47,935 bp. Additionally, the comparison of mitochondrial genomes from four Cactaceae species highlighted their dynamic nature and frequent mitogenomic reorganizations. Conclusion Our study provides a new perspective on the evolution of the organelle genome and its potential application in genetic breeding. These findings offer valuable insights into the mitochondrial genetics of Cactaceae, potentially facilitating future research and breeding programs aimed at enhancing the genetic diversity and adaptability of O. cochenillifera by leveraging its unique mitochondrial genome characteristics.
Collapse
Affiliation(s)
- Jing Liu
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
- Zhejiang Province Key Laboratory of Plant Secondary Metabolism and Regulation, College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Yuqing Feng
- Zhejiang Province Key Laboratory of Plant Secondary Metabolism and Regulation, College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Cheng Chen
- College of Life Science, Shanghai Normal University, Shanghai, China
| | - Jing Yan
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
| | - Xinyu Bai
- Zhejiang Province Key Laboratory of Plant Secondary Metabolism and Regulation, College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Huiru Li
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
| | - Chen Lin
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
| | - Yinan Xiang
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
- Zhejiang Province Key Laboratory of Plant Secondary Metabolism and Regulation, College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Wen Tian
- Animal Plant and Food Inspection Center of Nanjing Customs District, Nanjing, China
| | - Zhechen Qi
- Zhejiang Province Key Laboratory of Plant Secondary Metabolism and Regulation, College of Life Sciences and Medicine, Zhejiang Sci-Tech University, Hangzhou, China
| | - Jing Yu
- College of Life Science, Shanghai Normal University, Shanghai, China
| | - Xiaoling Yan
- Eastern China Conservation Centre for Wild Endangered Plant Resources, Shanghai Chenshan Botanical Garden, Shanghai, China
| |
Collapse
|
3
|
Galtier N. Half a Century of Controversy: The Neutralist/Selectionist Debate in Molecular Evolution. Genome Biol Evol 2024; 16:evae003. [PMID: 38311843 PMCID: PMC10839204 DOI: 10.1093/gbe/evae003] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 01/01/2024] [Indexed: 02/06/2024] Open
Abstract
The neutral and nearly neutral theories, introduced more than 50 yr ago, have raised and still raise passionate discussion regarding the forces governing molecular evolution and their relative importance. The debate, initially focused on the amount of within-species polymorphism and constancy of the substitution rate, has spread, matured, and now underlies a wide range of topics and questions. The neutralist/selectionist controversy has structured the field and influences the way molecular evolutionary scientists conceive their research.
Collapse
Affiliation(s)
- Nicolas Galtier
- ISEM, CNRS, IRD, Université de Montpellier, Montpellier, France
| |
Collapse
|
4
|
Aliperti L, Aptekmann AA, Farfañuk G, Couso LL, Soler-Bistué A, Sánchez IE. r/K selection of GC content in prokaryotes. Environ Microbiol 2023; 25:3255-3268. [PMID: 37813828 DOI: 10.1111/1462-2920.16511] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/27/2023] [Accepted: 09/16/2023] [Indexed: 10/11/2023]
Abstract
The guanine/cytosine (GC) content of prokaryotic genomes is species-specific, taking values from 16% to 77%. This diversity of selection for GC content remains contentious. We analyse the correlations between GC content and a range of phenotypic and genotypic data in thousands of prokaryotes. GC content integrates well with these traits into r/K selection theory when phenotypic plasticity is considered. High GC-content prokaryotes are r-strategists with cheaper descendants thanks to a lower average amino acid metabolic cost, colonize unstable environments thanks to flagella and a bacillus form and are generalists in terms of resource opportunism and their defence mechanisms. Low GC content prokaryotes are K-strategists specialized for stable environments that maintain homeostasis via a high-cost outer cell membrane and endospore formation as a response to nutrient deprivation, and attain a higher nutrient-to-biomass yield. The lower proteome cost of high GC content prokaryotes is driven by the association between GC-rich codons and cheaper amino acids in the genetic code, while the correlation between GC content and genome size may be partly due to functional diversity driven by r/K selection. In all, molecular diversity in the GC content of prokaryotes may be a consequence of ecological r/K selection.
Collapse
Affiliation(s)
- Lucio Aliperti
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Ariel A Aptekmann
- Marine and Coastal Sciences Department, Rutgers University, New Brunswick, New Jersey, USA
| | - Gonzalo Farfañuk
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Luciana L Couso
- Facultad de Agronomía, Cátedra de Genética, Universidad de Buenos Aires, Buenos Aires, Argentina
| | - Alfonso Soler-Bistué
- Instituto de Investigaciones Biotecnológicas Dr. Rodolfo A. Ugalde, CONICET, Universidad Nacional de San Martín, San Martin, Argentina
| | - Ignacio E Sánchez
- Facultad de Ciencias Exactas y Naturales. Laboratorio de Fisiología de Proteínas, Consejo Nacional de Investigaciones Científicas y Técnicas, Instituto de Química Biológica de la Facultad de Ciencias Exactas y Naturales (IQUIBICEN), Universidad de Buenos Aires, Buenos Aires, Argentina
| |
Collapse
|
5
|
Liu Y, Liang N, Xian Q, Zhang W. GC heterogeneity reveals sequence-structures evolution of angiosperm ITS2. BMC Plant Biol 2023; 23:608. [PMID: 38036992 PMCID: PMC10691020 DOI: 10.1186/s12870-023-04634-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/21/2023] [Accepted: 11/26/2023] [Indexed: 12/02/2023]
Abstract
BACKGROUND Despite GC variation constitutes a fundamental element of genome and species diversity, the precise mechanisms driving it remain unclear. The abundant sequence data available for the ITS2, a commonly employed phylogenetic marker in plants, offers an exceptional resource for exploring the GC variation across angiosperms. RESULTS A comprehensive selection of 8666 species, comprising 165 genera, 63 families, and 30 orders were used for the analyses. The alignment of ITS2 sequence-structures and partitioning of secondary structures into paired and unpaired regions were performed using 4SALE. Substitution rates and frequencies among GC base-pairs in the paired regions of ITS2 were calculated using RNA-specific models in the PHASE package. The results showed that the distribution of ITS2 GC contents on the angiosperm phylogeny was heterogeneous, but their increase was generally associated with ITS2 sequence homogenization, thereby supporting the occurrence of GC-biased gene conversion (gBGC) during the concerted evolution of ITS2. Additionally, the GC content in the paired regions of the ITS2 secondary structure was significantly higher than that of the unpaired regions, indicating the selection of GC for thermodynamic stability. Furthermore, the RNA substitution models demonstrated that base-pair transformations favored both the elevation and fixation of GC in the paired regions, providing further support for gBGC. CONCLUSIONS Our findings highlight the significance of secondary structure in GC investigation, which demonstrate that both gBGC and structure-based selection are influential factors driving angiosperm ITS2 GC content.
Collapse
Affiliation(s)
- Yubo Liu
- Marine College, Shandong University, Weihai, 264209, China
- Division of Physical Biology, CAS Key Laboratory of Interfacial Physics and Technology, Shanghai Institute of Applied Physics, Chinese Academy of Sciences, University of Chinese Academy of Sciences, Shanghai, 201800, China
| | - Nan Liang
- Marine College, Shandong University, Weihai, 264209, China
- Allergy Department, State Key Laboratory of Complex Severe and Rare Diseases, Peking Union Medical College Hospital, Chinese Academy of Medical Sciences and Peking Union Medical College, Beijing, 100730, China
| | - Qing Xian
- Marine College, Shandong University, Weihai, 264209, China
| | - Wei Zhang
- Marine College, Shandong University, Weihai, 264209, China.
| |
Collapse
|
6
|
Cai X, Peng Y, Yang G, Feng L, Tian X, Huang P, Mao Y, Xu L. Populational genomic insights of Paraclostridium bifermentans as an emerging human pathogen. Front Microbiol 2023; 14:1293206. [PMID: 38029151 PMCID: PMC10665999 DOI: 10.3389/fmicb.2023.1293206] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/12/2023] [Accepted: 10/26/2023] [Indexed: 12/01/2023] Open
Abstract
Paraclostridium bifermentans (P.b) is an emerging human pathogen that is phylogenomically close to Paeniclostridium sordellii (P.s), while their populational genomic features and virulence capacity remain understudied. Here, we performed comparative genomic analyses of P.b and compared their pan-genomic features and virulence coding profiles to those of P.s. Our results revealed that P.b has a more plastic pangenome, a larger genome size, and a higher GC content than P.s. Interestingly, the P.b and P.s share similar core-genomic functions, but P.b encodes more functions in nutrient metabolism and energy conversion and fewer functions in host defense in their accessory-genomes. The P.b may initiate extracellular infection processes similar to those of P.s and Clostridium perfringens by encoding three toxin homologs (i.e., microbial collagenase, thiol-activated cytolysin, phospholipase C, which are involved in extracellular matrices degradation and membrane damaging) in their core-genomes. However, P.b is less toxic than the P.s by encoding fewer secretion toxins in the core-genome and fewer lethal toxins in the accessory-genome. Notably, P.b carries more toxins genes in their accessory-genomes, particularly those of plasmid origin. Moreover, three within-species and highly conserved plasmid groups, encoding virulence, gene acquisition, and adaptation, were carried by 25-33% of P.b strains and clustered by isolation source rather than geography. This study characterized the pan-genomic virulence features of P.b for the first time, and revealed that P. bifermentans is an emerging pathogen that can threaten human health in many aspects, emphasizing the importance of phenotypic and genomic characterizations of in situ clinical isolates.
Collapse
Affiliation(s)
- Xunchao Cai
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
- Marshall Laboratory of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong, China
| | - Yao Peng
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
| | - Gongli Yang
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
| | - Lijuan Feng
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
| | - Xiaojuan Tian
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
| | - Ping Huang
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
| | - Yanping Mao
- College of Chemistry and Environmental Engineering, Shenzhen University, Shenzhen, Guangdong, China
| | - Long Xu
- Department of Gastroenterology and Hepatology, Shenzhen University General Hospital, Shenzhen University, Shenzhen, Guangdong, China
- Marshall Laboratory of Biomedical Engineering, Shenzhen University, Shenzhen, Guangdong, China
| |
Collapse
|
7
|
Ruis C, Weimann A, Tonkin-Hill G, Pandurangan AP, Matuszewska M, Murray GGR, Lévesque RC, Blundell TL, Floto RA, Parkhill J. Mutational spectra are associated with bacterial niche. Nat Commun 2023; 14:7091. [PMID: 37925514 PMCID: PMC10625568 DOI: 10.1038/s41467-023-42916-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2023] [Accepted: 10/25/2023] [Indexed: 11/06/2023] Open
Abstract
As observed in cancers, individual mutagens and defects in DNA repair create distinctive mutational signatures that combine to form context-specific spectra within cells. We reasoned that similar processes must occur in bacterial lineages, potentially allowing decomposition analysis to detect both disruption of DNA repair processes and exposure to niche-specific mutagens. Here we reconstruct mutational spectra for 84 clades from 31 diverse bacterial species and find distinct mutational patterns. We extract signatures driven by specific DNA repair defects using hypermutator lineages, and further deconvolute the spectra into multiple signatures operating within different clades. We show that these signatures are explained by both bacterial phylogeny and replication niche. By comparing mutational spectra of clades from different environmental and biological locations, we identify niche-associated mutational signatures, and then employ these signatures to infer the predominant replication niches for several clades where this was previously obscure. Our results show that mutational spectra may be associated with sites of bacterial replication when mutagen exposures differ, and can be used in these cases to infer transmission routes for established and emergent human bacterial pathogens.
Collapse
Affiliation(s)
- Christopher Ruis
- Molecular Immunity Unit, University of Cambridge Department of Medicine, MRC-Laboratory of Molecular Biology, Cambridge, UK
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, UK
| | - Aaron Weimann
- Molecular Immunity Unit, University of Cambridge Department of Medicine, MRC-Laboratory of Molecular Biology, Cambridge, UK
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, UK
| | | | | | - Marta Matuszewska
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK
- Department of Medicine, University of Cambridge, Cambridge, UK
| | - Gemma G R Murray
- Parasites and Microbes Programme, Wellcome Sanger Institute; Wellcome Genome Campus, Cambridge, UK
| | - Roger C Lévesque
- Institut de biologie intégrative et des systèmes (IBIS), Université Laval, Québec City, Québec, Canada
| | - Tom L Blundell
- Department of Biochemistry, Sanger Building, University of Cambridge, Cambridge, UK
| | - R Andres Floto
- Molecular Immunity Unit, University of Cambridge Department of Medicine, MRC-Laboratory of Molecular Biology, Cambridge, UK.
- Cambridge Centre for AI in Medicine, University of Cambridge, Cambridge, UK.
- Cambridge Centre for Lung Infection, Papworth Hospital, Cambridge, UK.
| | - Julian Parkhill
- Department of Veterinary Medicine, University of Cambridge, Cambridge, UK.
| |
Collapse
|
8
|
Kim J, Yoon S, Kondakala S, Foley SL, Hart M, Baek DH, Wang W, Kim SK, Sutherland JB, Kim SJ, Kweon O. CPGminer: An Interactive Dashboard to Explore the Genomic Features and Taxonomy of Complete Prokaryotic Genomes. Microorganisms 2023; 11:2556. [PMID: 37894214 PMCID: PMC10609142 DOI: 10.3390/microorganisms11102556] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/29/2023] [Revised: 09/22/2023] [Accepted: 10/08/2023] [Indexed: 10/29/2023] Open
Abstract
Prokaryotes, the earliest forms of life on Earth, play crucial roles in global biogeochemical processes in virtually all ecosystems. The ever-increasing amount of prokaryotic genome sequencing data provides a wealth of information to examine fundamental and applied questions through systematic genome comparison. Genomic features, such as genome size and GC content, and taxonomy-centric genomic features of complete prokaryotic genomes (CPGs) are crucial for various fields of microbial research and education, yet they are often overlooked. Additionally, creating systematically curated datasets that align with research concerns is an essential yet challenging task for wet-lab researchers. In this study, we introduce CPGminer, a user-friendly tool that allows researchers to quickly and easily examine the genomic features and taxonomy of CPGs and curate genome datasets. We also provide several examples to demonstrate its practical utility in addressing descriptive questions.
Collapse
Affiliation(s)
- Jaehyun Kim
- Division of Bioinformatics and Biostatistics, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA;
| | - Sunghyun Yoon
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Sandeep Kondakala
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Steven L. Foley
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Mark Hart
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Dong-Heon Baek
- Department of Oral Microbiology and Immunology, School of Dentistry, Dankook University, Cheonan 31116, Republic of Korea;
| | - Wenjun Wang
- Department of Management, Marketing, and Technology, University of Arkansas at Little Rock, Little Rock, AR 72204, USA; (W.W.); (S.-K.K.)
| | - Sung-Kwan Kim
- Department of Management, Marketing, and Technology, University of Arkansas at Little Rock, Little Rock, AR 72204, USA; (W.W.); (S.-K.K.)
| | - John B. Sutherland
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Seong-Jae Kim
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| | - Ohgew Kweon
- Division of Microbiology, National Center for Toxicological Research, U.S. Food and Drug Administration, Jefferson, AR 72079, USA; (S.Y.); (S.K.); (S.L.F.); (M.H.); (J.B.S.)
| |
Collapse
|
9
|
Molteni C, Forni D, Cagliani R, Bravo IG, Sironi M. Evolution and diversity of nucleotide and dinucleotide composition in poxviruses. J Gen Virol 2023; 104. [PMID: 37792576 DOI: 10.1099/jgv.0.001897] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/06/2023] Open
Abstract
Poxviruses (family Poxviridae) have long dsDNA genomes and infect a wide range of hosts, including insects, birds, reptiles and mammals. These viruses have substantial incidence, prevalence and disease burden in humans and in other animals. Nucleotide and dinucleotide composition, mostly CpG and TpA, have been largely studied in viral genomes because of their evolutionary and functional implications. We analysed here the nucleotide and dinucleotide composition, as well as codon usage bias, of a set of representative poxvirus genomes, with a very diverse host spectrum. After correcting for overall nucleotide composition, entomopoxviruses displayed low overall GC content, no enrichment in TpA and large variation in CpG enrichment, while chordopoxviruses showed large variation in nucleotide composition, no obvious depletion in CpG and a weak trend for TpA depletion in GC-rich genomes. Overall, intergenome variation in dinucleotide composition in poxviruses is largely accounted for by variation in overall genomic GC levels. Nonetheless, using vaccinia virus as a model, we found that genes expressed at the earliest times in infection are more CpG-depleted than genes expressed at later stages. This observation has parallels in betahepesviruses (also large dsDNA viruses) and suggests an antiviral role for the innate immune system (e.g. via the zinc-finger antiviral protein ZAP) in the early phases of poxvirus infection. We also analysed codon usage bias in poxviruses and we observed that it is mostly determined by genomic GC content, and that stratification after host taxonomy does not contribute to explaining codon usage bias diversity. By analysis of within-species diversity, we show that genomic GC content is the result of mutational biases. Poxvirus genomes that encode a DNA ligase are significantly AT-richer than those that do not, suggesting that DNA repair systems shape mutation biases. Our data shed light on the evolution of poxviruses and inform strategies for their genetic manipulation for therapeutic purposes.
Collapse
Affiliation(s)
- Cristian Molteni
- Scientific Institute IRCCS E. MEDEA, Bioinformatics, Bosisio Parini, Italy
| | - Diego Forni
- Scientific Institute IRCCS E. MEDEA, Bioinformatics, Bosisio Parini, Italy
| | - Rachele Cagliani
- Scientific Institute IRCCS E. MEDEA, Bioinformatics, Bosisio Parini, Italy
| | - Ignacio G Bravo
- Laboratoire MIVEGEC (Univ Montpellier CNRS, IRD), Centre National de la Recherche Scientifique, Montpellier, France
| | - Manuela Sironi
- Scientific Institute IRCCS E. MEDEA, Bioinformatics, Bosisio Parini, Italy
| |
Collapse
|
10
|
Gralka M, Pollak S, Cordero OX. Genome content predicts the carbon catabolic preferences of heterotrophic bacteria. Nat Microbiol 2023; 8:1799-1808. [PMID: 37653010 DOI: 10.1038/s41564-023-01458-z] [Citation(s) in RCA: 2] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2023] [Accepted: 07/24/2023] [Indexed: 09/02/2023]
Abstract
Heterotrophic bacteria-bacteria that utilize organic carbon sources-are taxonomically and functionally diverse across environments. It is challenging to map metabolic interactions and niches within microbial communities due to the large number of metabolites that could serve as potential carbon and energy sources for heterotrophs. Whether their metabolic niches can be understood using general principles, such as a small number of simplified metabolic categories, is unclear. Here we perform high-throughput metabolic profiling of 186 marine heterotrophic bacterial strains cultured in media containing one of 135 carbon substrates to determine growth rates, lag times and yields. We show that, despite high variability at all levels of taxonomy, the catabolic niches of heterotrophic bacteria can be understood in terms of their preference for either glycolytic (sugars) or gluconeogenic (amino and organic acids) carbon sources. This preference is encoded by the total number of genes found in pathways that feed into the two modes of carbon utilization and can be predicted using a simple linear model based on gene counts. This allows for coarse-grained descriptions of microbial communities in terms of prevalent modes of carbon catabolism. The sugar-acid preference is also associated with genomic GC content and thus with the carbon-nitrogen requirements of their encoded proteome. Our work reveals how the evolution of bacterial genomes is structured by fundamental constraints rooted in metabolism.
Collapse
Affiliation(s)
- Matti Gralka
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
- Systems Biology Group, Amsterdam Institute for Life and Environment (A-LIFE) and Amsterdam Institute of Molecular and Life Sciences (AIMMS), Vrije Universiteit Amsterdam, Amsterdam, The Netherlands.
| | - Shaul Pollak
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
- Division of Microbial Ecology, Centre for Microbiology and Environmental Systems Science, University of Vienna, Vienna, Austria
| | - Otto X Cordero
- Department of Civil and Environmental Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA.
| |
Collapse
|
11
|
Crestani C, Arcari G, Landier A, Passet V, Garnier D, Brémont S, Armatys N, Carmi-Leroy A, Toubiana J, Badell E, Brisse S. Corynebacterium ramonii sp. nov., a novel toxigenic member of the Corynebacterium diphtheriae species complex. Res Microbiol 2023; 174:104113. [PMID: 37572824 DOI: 10.1016/j.resmic.2023.104113] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/12/2023] [Revised: 08/04/2023] [Accepted: 08/07/2023] [Indexed: 08/14/2023]
Abstract
The Corynebacterium diphtheriae species complex comprises seven bacterial species, including Corynebacterium ulcerans, a zoonotic pathogen from multiple animal species. In this work, we characterise phenotypically and genotypically isolates belonging to two C. ulcerans lineages. Results from phylogenetic analyses, in silico DNA-DNA hybridization (DDH) and MALDI-TOF spectra differentiate lineage 2 from C. ulcerans lineage 1, which, together with their distinct transmission dynamics (probable human-to-human vs animal-to-human), indicates that lineage 2 is a separate Corynebacterium species, which we propose to name Corynebacterium ramonii. This species is of particular medical interest considering that its human-to-human transmission is likely, and that some C. ramonii isolates carry the diphtheria toxin gene.
Collapse
Affiliation(s)
- Chiara Crestani
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France.
| | - Gabriele Arcari
- Department of Molecular Medicine, Sapienza University of Rome, Rome, Italy
| | - Annie Landier
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France
| | - Virginie Passet
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France
| | - Dorian Garnier
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France
| | - Sylvie Brémont
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France; Institut Pasteur, French National Reference Center for Corynebacteria of the Diphtheriae Complex, Paris, France
| | - Nathalie Armatys
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France
| | - Annick Carmi-Leroy
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France; Institut Pasteur, French National Reference Center for Corynebacteria of the Diphtheriae Complex, Paris, France
| | - Julie Toubiana
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France; Institut Pasteur, French National Reference Center for Corynebacteria of the Diphtheriae Complex, Paris, France; Necker-Enfants Malades University Hospital, Paris, France
| | - Edgar Badell
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France; Institut Pasteur, French National Reference Center for Corynebacteria of the Diphtheriae Complex, Paris, France
| | - Sylvain Brisse
- Institut Pasteur, Université Paris Cité, Biodiversity and Epidemiology of Bacterial Pathogens, Paris, France; Institut Pasteur, French National Reference Center for Corynebacteria of the Diphtheriae Complex, Paris, France.
| |
Collapse
|
12
|
Muselmani W, Kashif-Khan N, Bagnéris C, Santangelo R, Williams MA, Savva R. A Multimodal Approach towards Genomic Identification of Protein Inhibitors of Uracil-DNA Glycosylase. Viruses 2023; 15:1348. [PMID: 37376646 DOI: 10.3390/v15061348] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/14/2023] [Revised: 06/02/2023] [Accepted: 06/06/2023] [Indexed: 06/29/2023] Open
Abstract
DNA-mimicking proteins encoded by viruses can modulate processes such as innate cellular immunity. An example is Ung-family uracil-DNA glycosylase inhibition, which prevents Ung-mediated degradation via the stoichiometric protein blockade of the Ung DNA-binding cleft. This is significant where uracil-DNA is a key determinant in the replication and distribution of virus genomes. Unrelated protein folds support a common physicochemical spatial strategy for Ung inhibition, characterised by pronounced sequence plasticity within the diverse fold families. That, and the fact that relatively few template sequences are biochemically verified to encode Ung inhibitor proteins, presents a barrier to the straightforward identification of Ung inhibitors in genomic sequences. In this study, distant homologs of known Ung inhibitors were characterised via structural biology and structure prediction methods. A recombinant cellular survival assay and in vitro biochemical assay were used to screen distant variants and mutants to further explore tolerated sequence plasticity in motifs supporting Ung inhibition. The resulting validated sequence repertoire defines an expanded set of heuristic sequence and biophysical signatures shared by known Ung inhibitor proteins. A computational search of genome database sequences and the results of recombinant tests of selected output sequences obtained are presented here.
Collapse
Affiliation(s)
- Wael Muselmani
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | - Naail Kashif-Khan
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | - Claire Bagnéris
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | - Rosalia Santangelo
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | - Mark A Williams
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| | - Renos Savva
- Institute of Structural and Molecular Biology, Department of Biological Sciences, Birkbeck, University of London, Malet Street, London WC1E 7HX, UK
| |
Collapse
|
13
|
Xian Q, Wang S, Liu Y, Kan S, Zhang W. Structure-Based GC Investigation Sheds New Light on ITS2 Evolution in Corydalis Species. Int J Mol Sci 2023; 24:ijms24097716. [PMID: 37175423 PMCID: PMC10178233 DOI: 10.3390/ijms24097716] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/30/2023] [Revised: 04/20/2023] [Accepted: 04/21/2023] [Indexed: 05/15/2023] Open
Abstract
Guanine and cytosine (GC) content is a fundamental component of genetic diversity and essential for phylogenetic analyses. However, the GC content of the ribosomal internal transcribed spacer 2 (ITS2) remains unknown, despite the fact that ITS2 is a widely used phylogenetic marker. Here, the ITS2 was high-throughput sequenced from 29 Corydalis species, and their GC contents were comparatively investigated in the context of ITS2's characteristic secondary structure and concerted evolution. Our results showed that the GC contents of ITS2 were 131% higher than those of their adjacent 5.8S regions, suggesting that ITS2 underwent GC-biased evolution. These GCs were distributed in a heterogeneous manner in the ITS2 secondary structure, with the paired regions being 130% larger than the unpaired regions, indicating that GC is chosen for thermodynamic stability. In addition, species with homogeneous ITS2 sequences were always GC-rich, supporting GC-biased gene conversion (gBGC), which occurred with ITS2's concerted evolution. The RNA substitution model inferred also showed a GC preference among base pair transformations, which again supports gBGC. Overall, structurally based GC investigation reveals that ITS2 evolves under structural stability and gBGC selection, significantly increasing its GC content.
Collapse
Affiliation(s)
- Qing Xian
- Marine College, Shandong University, Weihai 264209, China
| | - Suyin Wang
- Marine College, Shandong University, Weihai 264209, China
| | - Yanyan Liu
- College of Plant Protection, Henan Agricultural University, Zhengzhou 450002, China
| | - Shenglong Kan
- Shenzhen Branch, Guangdong Laboratory of Lingnan Modern Agriculture, Genome Analysis Laboratory of the Ministry of Agriculture and Rural Affairs, Agricultural Genomics Institute at Shenzhen, Chinese Academy of Agricultural Sciences, Shenzhen 518120, China
| | - Wei Zhang
- Marine College, Shandong University, Weihai 264209, China
| |
Collapse
|
14
|
Teng W, Liao B, Chen M, Shu W. Genomic Legacies of Ancient Adaptation Illuminate GC-Content Evolution in Bacteria. Microbiol Spectr 2023; 11:e0214522. [PMID: 36511682 PMCID: PMC9927291 DOI: 10.1128/spectrum.02145-22] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Bacterial evolution is characterized by strong purifying selection as well as rapid adaptive evolution in changing environments. In this context, the genomic GC content (genomic GC) varies greatly but presents some level of phylogenetic stability, making it challenging to explain based on current hypotheses. To illuminate the evolutionary mechanisms of the genomic GC, we analyzed the base composition and functional inventory of 11,083 representative genomes. A phylogenetically constrained bimodal distribution of the genomic GC, which mainly originated from parallel divergences in the early evolution, was demonstrated. Such variation of the genomic GC can be well explained by DNA replication and repair (DRR), in which multiple pathways correlate with the genomic GC. Furthermore, the biased conservation of various stress-related genes, especially the DRR-related ones, implies distinct adaptive processes in the ancestral lineages of high- or low-GC clades which are likely induced by major environmental changes. Our findings support that the mutational biases resulting from these legacies of ancient adaptation have changed the course of adaptive evolution and generated great variation in the genomic GC. This highlights the importance of indirect effects of natural selection, which indicates a new model for bacterial evolution. IMPORTANCE GC content has been shown to be an important factor in microbial ecology and evolution, and the genomic GC of bacteria can be characterized by great intergenomic heterogeneity, high intragenomic homogeneity, and strong phylogenetic inertia, as well as being associated with the environment. Current hypotheses concerning direct selection or mutational biases cannot well explain these features simultaneously. Our findings of the genomic GC showing that ancient adaptations have transformed the DRR system and that the resulting mutational biases further contributed to a bimodal distribution of it offer a more reasonable scenario for the mechanism. This would imply that, when thinking about the evolution of life, diverse processes of adaptation exist, and combined effects of natural selection should be considered.
Collapse
Affiliation(s)
- Wenkai Teng
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Bin Liao
- School of Life Sciences, Sun Yat-sen University, Guangzhou, Guangdong, China
| | - Mengyun Chen
- School of Life Sciences, South China Normal University, Guangzhou, Guangdong, China
| | - Wensheng Shu
- School of Life Sciences, South China Normal University, Guangzhou, Guangdong, China
| |
Collapse
|
15
|
Guardia AE, Wagner A, Busalmen JP, Di Capua C, Cortéz N, Beligni MV. The draft genome of Andean Rhodopseudomonas sp. strain AZUL predicts genome plasticity and adaptation to chemical homeostasis. BMC Microbiol 2022; 22:297. [PMID: 36494611 PMCID: PMC9733117 DOI: 10.1186/s12866-022-02685-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/07/2021] [Accepted: 10/29/2022] [Indexed: 12/13/2022] Open
Abstract
The genus Rhodopseudomonas comprises purple non-sulfur bacteria with extremely versatile metabolisms. Characterization of several strains revealed that each is a distinct ecotype highly adapted to its specific micro-habitat. Here we present the sequencing, genomic comparison and functional annotation of AZUL, a Rhodopseudomonas strain isolated from a high altitude Andean lagoon dominated by extreme conditions and fluctuating levels of chemicals. Average nucleotide identity (ANI) analysis of 39 strains of this genus showed that the genome of AZUL is 96.2% identical to that of strain AAP120, which suggests that they belong to the same species. ANI values also show clear separation at the species level with the rest of the strains, being more closely related to R. palustris. Pangenomic analyses revealed that the genus Rhodopseudomonas has an open pangenome and that its core genome represents roughly 5 to 12% of the total gene repertoire of the genus. Functional annotation showed that AZUL has genes that participate in conferring genome plasticity and that, in addition to sharing the basal metabolic complexity of the genus, it is also specialized in metal and multidrug resistance and in responding to nutrient limitation. Our results also indicate that AZUL might have evolved to use some of the mechanisms involved in resistance as redox reactions for bioenergetic purposes. Most of those features are shared with strain AAP120, and mainly involve the presence of additional orthologs responsible for the mentioned processes. Altogether, our results suggest that AZUL, one of the few bacteria from its habitat with a sequenced genome, is highly adapted to the extreme and changing conditions that constitute its niche.
Collapse
Affiliation(s)
- Aisha E. Guardia
- grid.473319.b0000 0004 0461 9871Ingeniería de Interfases y Bioprocesos, Instituto de Tecnología de Materiales (INTEMA-CONICET-UNMdP), Mar del Plata, Argentina
| | - Agustín Wagner
- grid.10814.3c0000 0001 2097 3211Facultad de Ciencias Agrarias, Universidad Nacional de Rosario, Zavalla, Argentina
| | - Juan P. Busalmen
- grid.473319.b0000 0004 0461 9871Ingeniería de Interfases y Bioprocesos, Instituto de Tecnología de Materiales (INTEMA-CONICET-UNMdP), Mar del Plata, Argentina
| | - Cecilia Di Capua
- grid.501777.30000 0004 0638 1836Facultad de Ciencias Bioquímicas y Farmacéuticas, Instituto de Biología Molecular y Celular de Rosario (IBR-CONICET-UNR), Universidad Nacional de Rosario, Rosario, Argentina
| | - Néstor Cortéz
- grid.501777.30000 0004 0638 1836Facultad de Ciencias Bioquímicas y Farmacéuticas, Instituto de Biología Molecular y Celular de Rosario (IBR-CONICET-UNR), Universidad Nacional de Rosario, Rosario, Argentina
| | - María V. Beligni
- grid.412221.60000 0000 9969 0902Instituto de Investigaciones Biológicas (IIB-CONICET-UNMdP), Facultad de Ciencias Exactas y Naturales, Universidad Nacional de Mar del Plata, Mar del Plata, Argentina
| |
Collapse
|
16
|
Li Y, Khandia R, Papadakis M, Alexiou A, Simonov AN, Khan AA. An investigation of codon usage pattern analysis in pancreatitis associated genes. BMC Genom Data 2022; 23:81. [PMID: 36434531 PMCID: PMC9700901 DOI: 10.1186/s12863-022-01089-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2022] [Accepted: 10/10/2022] [Indexed: 11/27/2022] Open
Abstract
BACKGROUND Pancreatitis is an inflammatory disorder resulting from the autoactivation of trypsinogen in the pancreas. The genetic basis of the disease is an old phenomenon, and evidence is accumulating for the involvement of synonymous/non-synonymous codon variants in disease initiation and progression. RESULTS The present study envisaged a panel of 26 genes involved in pancreatitis for their codon choices, compositional analysis, relative dinucleotide frequency, nucleotide disproportion, protein physical properties, gene expression, codon bias, and interrelated of all these factors. In this set of genes, gene length was positively correlated with nucleotide skews and codon usage bias. Codon usage of any gene is dependent upon its AT and GC component; however, AGG, CGT, and CGA encoding for Arg, TCG for Ser, GTC for Val, and CCA for Pro were independent of nucleotide compositions. In addition, Codon GTC showed a correlation with protein properties, isoelectric point, instability index, and frequency of basic amino acids. We also investigated the effect of various evolutionary forces in shaping the codon usage choices of genes. CONCLUSIONS This study will enable us to gain insight into the molecular signatures associated with the disease that might help identify more potential genes contributing to enhanced risk for pancreatitis. All the genes associated with pancreatitis are generally associated with physiological function, and mutations causing loss of function, over or under expression leads to an ailment. Therefore, the present study attempts to envisage the molecular signature in a group of genes that lead to pancreatitis in case of malfunction.
Collapse
Affiliation(s)
- Yuanyang Li
- Third-Grade Pharmacological Laboratory On Chinese Medicine Approved By State Administration of Traditional Chinese Medicine, Medical College of China Three Gorges, Yichang, China ,grid.254148.e0000 0001 0033 6389College of Medical Science, China Three Gorges University, Yichang, China
| | - Rekha Khandia
- grid.411530.20000 0001 0694 3745Department of Biochemistry and Genetics, Barkatullah University, Bhopal, MP 462026 India
| | - Marios Papadakis
- grid.412581.b0000 0000 9024 6397Department of Surgery II, University Hospital Witten-Herdecke, University of Witten-Herdecke, Heusnerstrasse 40, 42283 Wuppertal, Germany
| | - Athanasios Alexiou
- Department of Science and Engineering, Novel Global Community Educational Foundation, Hebersham, Australia ,AFNP Med Austria, Vienna, Austria
| | | | - Azmat Ali Khan
- grid.56302.320000 0004 1773 5396Pharmaceutical Biotechnology Laboratory, Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, Riyadh, 11451 Saudi Arabia
| |
Collapse
|
17
|
Bohlin J. A simple stochastic model describing the evolution of genomic GC content in asexually reproducing organisms. Sci Rep 2022; 12:18569. [PMID: 36329129 PMCID: PMC9631610 DOI: 10.1038/s41598-022-21709-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/01/2022] [Accepted: 09/30/2022] [Indexed: 11/06/2022] Open
Abstract
A genome's nucleotide composition can usually be summarized with (G)uanine + (C)ytosine (GC) or (A)denine + (T)hymine (AT) frequencies as GC% = 100% - AT%. Genomic AT/GC content has been linked to environment and selective processes in asexually reproducing organisms. A model is presented relating the evolution of genomic GC content over time to AT [Formula: see text] GC and GC [Formula: see text] AT mutation rates. By employing Itô calculus it is shown that if mutation rates are subject to random perturbations, that can vary over time, several implications follow. In particular, an extra Brownian motion term appears influencing genomic nucleotide variability; the greater the random perturbations the more genomic nucleotide variability. This can have several interpretations depending on the context. For instance, reducing the influence of the random perturbations on the AT/GC mutation rates and thus genomic nucleotide variability, to limit fitness decreasing and deleterious mutations, will likely suggest channeling of resources. On the other hand, increased genomic nucleotide diversity may be beneficial in variable environments. In asexually reproducing organisms, the Brownian motion term can be considered to be inversely reflective of the selective pressures an organism is subjected to at the molecular level. The presented model is a generalization of a previous model, limited to microbial symbionts, to all asexually reproducing, non-recombining organisms. Last, a connection between the presented model and the classical Luria-Delbrück mutation model is presented in an Itô calculus setting.
Collapse
Affiliation(s)
- Jon Bohlin
- grid.418193.60000 0001 1541 4204Division of Infection Control, Department of Methods Development and Analysis, Norwegian Institute of Public Health, Oslo, Norway ,grid.418193.60000 0001 1541 4204Centre for Fertility and Health, Norwegian Institute of Public Health, P.O. Box 4404, Lovisenberggata 8, 0403 Oslo, Norway
| |
Collapse
|
18
|
Qian C, Ma J, Liang J, Zhang L, Liang X. Comprehensive deciphering prophages in genus Acetobacter on the ecology, genomic features, toxin–antitoxin system, and linkage with CRISPR-Cas system. Front Microbiol 2022; 13:951030. [PMID: 35983328 PMCID: PMC9379143 DOI: 10.3389/fmicb.2022.951030] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2022] [Accepted: 07/07/2022] [Indexed: 11/13/2022] Open
Abstract
Acetobacter is the predominant microbe in vinegar production, particularly in those natural fermentations that are achieved by complex microbial communities. Co-evolution of prophages with Acetobacter, including integration, release, and dissemination, heavily affects the genome stability and production performance of industrial strains. However, little has been discussed yet about prophages in Acetobacter. Here, prophage prediction analysis using 148 available genomes from 34 Acetobacter species was carried out. In addition, the type II toxin–antitoxin systems (TAs) and CRISPR-Cas systems encoded by prophages or the chromosome were analyzed. Totally, 12,000 prophage fragments were found, of which 350 putatively active prophages were identified in 86.5% of the selected genomes. Most of the active prophages (83.4%) belonged to the order Caudovirales dominated by the families Siphoviridae and Myroviridae prophages (71.4%). Notably, Acetobacter strains survived in complex environments that frequently carried multiple prophages compared with that in restricted habits. Acetobacter prophages showed high genome diversity and horizontal gene transfer across different bacterial species by genomic feature characterization, average nucleotide identity (ANI), and gene structure visualization analyses. About 31.14% of prophages carry type II TAS, suggesting its important role in addiction, bacterial defense, and growth-associated bioprocesses to prophages and hosts. Intriguingly, the genes coding for Cse1, Cse2, Cse3, Cse4, and Cas5e involved in type I-E and Csy4 involved in type I-F CRISPR arrays were firstly found in two prophages. Type II-C CRISPR-Cas system existed only in Acetobacter aceti, while the other Acetobacter species harbored the intact or eroded type I CRISPR-Cas systems. Totally, the results of this study provide fundamental clues for future studies on the role of prophages in the cell physiology and environmental behavior of Acetobacter.
Collapse
|
19
|
Mahajan S, Agashe D. Evolutionary jumps in bacterial GC content. G3 Genes|Genomes|Genetics 2022; 12:6586800. [PMID: 35579351 PMCID: PMC9339322 DOI: 10.1093/g3journal/jkac108] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/14/2022] [Accepted: 04/20/2022] [Indexed: 11/29/2022]
Abstract
Genomic GC (Guanine-Cytosine) content is a fundamental molecular trait linked with many key genomic features such as codon and amino acid use. Across bacteria, GC content is surprisingly diverse and has been studied for many decades; yet its evolution remains incompletely understood. Since it is difficult to observe GC content evolve on laboratory time scales, phylogenetic comparative approaches are instrumental; but this dimension is rarely studied systematically in the case of bacterial GC content. We applied phylogenetic comparative models to analyze GC content evolution in multiple bacterial groups across 2 major bacterial phyla. We find that GC content diversifies via a combination of gradual evolution and evolutionary “jumps.” Surprisingly, unlike prior reports that solely focused on reductions in GC, we found a comparable number of jumps with both increased and decreased GC content. Overall, many of the identified jumps occur in lineages beyond the well-studied peculiar examples of endosymbiotic and AT-rich marine bacteria and do not support the predicted role of oxygen dependence. Our analysis of rapid and large shifts in GC content thus identifies new clades and novel contexts to further understand the ecological and evolutionary drivers of this important genomic trait.
Collapse
Affiliation(s)
- Saurabh Mahajan
- National Centre for Biological Sciences, Tata Institute of Fundamental Research , Bengaluru 560065, India
- Atria University , Bengaluru 560024, India
| | - Deepa Agashe
- National Centre for Biological Sciences, Tata Institute of Fundamental Research , Bengaluru 560065, India
| |
Collapse
|
20
|
Han D, Zhen H, Liu X, Zulewska J, Yang Z. Organelle 16S rRNA amplicon sequencing enables profiling of active gut microbiota in murine model. Appl Microbiol Biotechnol 2022; 106:5715-5728. [PMID: 35896837 DOI: 10.1007/s00253-022-12083-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2022] [Revised: 07/14/2022] [Accepted: 07/16/2022] [Indexed: 11/02/2022]
Abstract
High-throughput sequencing of ribosomal RNA (rRNA) amplicons has served as a cornerstone in microbiome studies. Despite crucial implication of organelle 16S rRNA measurements to host gut microbial activities, genomic DNA (gDNA) was overwhelmingly targeted for amplicon sequencings. Although gDNA could be a reliable resource for gene existing validation, little information is revealed in regard to the activity of microorganisms owing to the limited changes gDNA undertaken in inactive, dormant, and dead bacteria. We applied both rRNA- and gDNA-derived sequencings on mouse cecal contents. Respective experimental designs were verified to be suitable for nucleic acid (NA) purification. Via benchmarking, mainstream 16S rRNA hypervariable region targets and reference databases were proven adequate for respective amplicon sequencing study. In phylogenetic studies, significant microbial composition differences were observed between two methods. Desulfovibrio spp. (an important group of anaerobic gut microorganisms that has caused analytical difficulties), Pediococcus spp., and Proteobacteria were drastically lower as represented by gDNA-derived compositions, while microbes like Firmicutes were higher as represented by gDNA-derived microbiome compositions. Also, using PICRUSt2 as an example, we illustrated that rRNA-derived sequencing might be more suitable for microbiome function predictions since pathways like sugar metabolism were lower as represented by rRNA-derived results. The findings of this study demonstrated that rRNA-derived amplicon sequencing could improve identification capability of specific gut microorganisms and might be more suitable for in silico microbiome function predictions. Therefore, rRNA-derived amplicon sequencings, preferably coupled with gDNA-derived ones, could be used as a capable tool to unveil active microbial components in host gut. KEY POINTS: • Conventional pipelines were adequate for the respective amplicon sequencing study • Groups, such as Desulfovibrio spp., were differently represented by two methods • Comparative amplicon sequencings could be useful in host active microbiota studies.
Collapse
Affiliation(s)
- Dong Han
- Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, School of Food and Health, Beijing Technology and Business University, Beijing, China.,Key Laboratory of Food Bioengineering, (China National Light Industry), College of Food Science and Nutritional Engineering, China Agricultural University, Beijing, China
| | - Hongmin Zhen
- Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, School of Food and Health, Beijing Technology and Business University, Beijing, China
| | - Xiaoyan Liu
- Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, School of Food and Health, Beijing Technology and Business University, Beijing, China
| | - Justyna Zulewska
- Department of Dairy Science and Quality Management, Faculty of Food Sciences, University of Warmia and Mazury in Olsztyn, Olsztyn, Poland
| | - Zhennai Yang
- Innovation Center for Food Nutrition and Human Health, Beijing Engineering and Technology Research Center of Food Additives, School of Food and Health, Beijing Technology and Business University, Beijing, China.
| |
Collapse
|
21
|
Ho AT, Hurst LD. Stop codon usage as a window into genome evolution: mutation, selection, biased gene conversion and the TAG paradox. Genome Biol Evol 2022; 14:6648529. [PMID: 35867377 PMCID: PMC9348620 DOI: 10.1093/gbe/evac115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2022] [Indexed: 11/16/2022] Open
Abstract
Protein coding genes terminate with one of three stop codons (TAA, TGA, or TAG) that, like synonymous codons, are not employed equally. With TGA and TAG having identical nucleotide content, analysis of their differential usage provides an unusual window into the forces operating on what are ostensibly functionally identical residues. Across genomes and between isochores within the human genome, TGA usage increases with G + C content but, with a common G + C → A + T mutation bias, this cannot be explained by mutation bias-drift equilibrium. Increased usage of TGA in G + C-rich genomes or genomic regions is also unlikely to reflect selection for the optimal stop codon, as TAA appears to be universally optimal, probably because it has the lowest read-through rate. Despite TAA being favored by selection and mutation bias, as with codon usage bias G + C pressure is the prime determinant of between-species TGA usage trends. In species with strong G + C-biased gene conversion (gBGC), such as mammals and birds, the high usage and conservation of TGA is best explained by an A + T → G + C repair bias. How to explain TGA enrichment in other G + C-rich genomes is less clear. Enigmatically, across bacterial and archaeal species and between human isochores TAG usage is mostly unresponsive to G + C pressure. This unresponsiveness we dub the TAG paradox as currently no mutational, selective, or gBGC model provides a well-supported explanation. That TAG does increase with G + C usage across eukaryotes makes the usage elsewhere yet more enigmatic. We suggest resolution of the TAG paradox may provide insights into either an unknown but common selective preference (probably at the DNA/RNA level) or an unrecognized complexity to the action of gBGC.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, UK
| | | |
Collapse
|
22
|
Cope AL, Shah P. Intragenomic variation in non-adaptive nucleotide biases causes underestimation of selection on synonymous codon usage. PLoS Genet 2022; 18:e1010256. [PMID: 35714134 PMCID: PMC9246145 DOI: 10.1371/journal.pgen.1010256] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2021] [Revised: 06/30/2022] [Accepted: 05/13/2022] [Indexed: 11/20/2022] Open
Abstract
Patterns of non-uniform usage of synonymous codons vary across genes in an organism and between species across all domains of life. This codon usage bias (CUB) is due to a combination of non-adaptive (e.g. mutation biases) and adaptive (e.g. natural selection for translation efficiency/accuracy) evolutionary forces. Most models quantify the effects of mutation bias and selection on CUB assuming uniform mutational and other non-adaptive forces across the genome. However, non-adaptive nucleotide biases can vary within a genome due to processes such as biased gene conversion (BGC), potentially obfuscating signals of selection on codon usage. Moreover, genome-wide estimates of non-adaptive nucleotide biases are lacking for non-model organisms. We combine an unsupervised learning method with a population genetics model of synonymous coding sequence evolution to assess the impact of intragenomic variation in non-adaptive nucleotide bias on quantification of natural selection on synonymous codon usage across 49 Saccharomycotina yeasts. We find that in the absence of a priori information, unsupervised learning can be used to identify genes evolving under different non-adaptive nucleotide biases. We find that the impact of intragenomic variation in non-adaptive nucleotide bias varies widely, even among closely-related species. We show that the overall strength and direction of translational selection can be underestimated by failing to account for intragenomic variation in non-adaptive nucleotide biases. Interestingly, genes falling into clusters identified by machine learning are also physically clustered across chromosomes. Our results indicate the need for more nuanced models of sequence evolution that systematically incorporate the effects of variable non-adaptive nucleotide biases on codon frequencies. Codon usage bias (CUB), or the unequal usage of codons of the same amino acid (i.e. synonymous codons), has been observed in species across all domains of life. CUB is known to be shaped by both non-adaptive (e.g. mutation biases) and adaptive (e.g. natural selection for translation efficiency/accuracy) evolution. A key challenge for researchers is disentangling the role of various processes shaping codon usage, often for the purpose of identifying codons favored by natural selection, sometimes referred to as “optimal” or “preferred” codons. Despite large variation in non-adaptive nucleotide biases within a genome, most methods to quantify natural selection typically ignore this variation for the sake of simplicity. Here, we combine a population genetics model with unsupervised machine learning to identify genes evolving under different non-adaptive nucleotide biases across 49 budding yeasts species. We find that ignoring for variation in non-adaptive nucleotide biases can obfuscate signals of selection on codon usage. Our results indicate the need for more nuanced models of coding sequence evolution.
Collapse
Affiliation(s)
- Alexander L. Cope
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
- Robert Wood Johnson Medical School, Rutgers University, Piscataway, New Jersey, United States of America
- * E-mail: (ALC); (PS)
| | - Premal Shah
- Department of Genetics, Rutgers University, Piscataway, New Jersey, United States of America
- Human Genetics Institute of New Jersey, Rutgers University, Piscataway, New Jersey, United States of America
- * E-mail: (ALC); (PS)
| |
Collapse
|
23
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
24
|
Carpentier F, Rodríguez de la Vega RC, Jay P, Duhamel M, Shykoff JA, Perlin MH, Wallen RM, Hood ME, Giraud T. Tempo of degeneration across independently evolved non-recombining regions. Mol Biol Evol 2022; 39:6553583. [PMID: 35325190 PMCID: PMC9004411 DOI: 10.1093/molbev/msac060] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022] Open
Abstract
Recombination is beneficial over the long term, allowing more effective selection. Despite long-term advantages of recombination, local recombination suppression can evolve and lead to genomic degeneration, in particular on sex chromosomes. Here, we investigated the tempo of degeneration in nonrecombining regions, that is, the function curve for the accumulation of deleterious mutations over time, leveraging on 22 independent events of recombination suppression identified on mating-type chromosomes of anther-smut fungi, including newly identified ones. Using previously available and newly generated high-quality genome assemblies of alternative mating types of 13 Microbotryum species, we estimated degeneration levels in terms of accumulation of nonoptimal codons and nonsynonymous substitutions in nonrecombining regions. We found a reduced frequency of optimal codons in the nonrecombining regions compared with autosomes, that was not due to less frequent GC-biased gene conversion or lower ancestral expression levels compared with recombining regions. The frequency of optimal codons rapidly decreased following recombination suppression and reached an asymptote after ca. 3 Ma. The strength of purifying selection remained virtually constant at dN/dS = 0.55, that is, at an intermediate level between purifying selection and neutral evolution. Accordingly, nonsynonymous differences between mating-type chromosomes increased linearly with stratum age, at a rate of 0.015 per My. We thus develop a method for disentangling effects of reduced selection efficacy from GC-biased gene conversion in the evolution of codon usage and we quantify the tempo of degeneration in nonrecombining regions, which is important for our knowledge on genomic evolution and on the maintenance of regions without recombination.
Collapse
Affiliation(s)
- Fantin Carpentier
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
- Université de Lille, CNRS, UMR 8198-Evo-Eco-Paleo F-59000, Lille, France
| | - Ricardo C. Rodríguez de la Vega
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
- Corresponding authors: E-mails: ;
| | - Paul Jay
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
| | - Marine Duhamel
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
- Evolution der Pflanzen und Pilze, Ruhr-Universität Bochum, Universitätsstraße 150, 44780, Bochum, Germany
| | - Jacqui A. Shykoff
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
| | - Michael H. Perlin
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY 40292, USA
| | - R. Margaret Wallen
- Department of Biology, Program on Disease Evolution, University of Louisville, Louisville, KY 40292, USA
| | | | - Tatiana Giraud
- Laboratoire Ecologie Systématique et Evolution, Bâtiment 360, CNRS, AgroParisTech, Université Paris-Saclay, 91400 Orsay, France
- Corresponding authors: E-mails: ;
| |
Collapse
|
25
|
Grébert T, Garczarek L, Daubin V, Humily F, Marie D, Ratin M, Devailly A, Farrant GK, Mary I, Mella-Flores D, Tanguy G, Labadie K, Wincker P, Kehoe DM, Partensky F. Diversity and evolution of pigment types in marine Synechococcus cyanobacteria. Genome Biol Evol 2022; 14:6547267. [PMID: 35276007 PMCID: PMC8995045 DOI: 10.1093/gbe/evac035] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/07/2022] [Indexed: 11/14/2022] Open
Abstract
Synechococcus cyanobacteria are ubiquitous and abundant in the marine environment and contribute to an estimated 16% of the ocean net primary productivity. Their light-harvesting complexes, called phycobilisomes (PBS), are composed of a conserved allophycocyanin core, from which radiates six to eight rods with variable phycobiliprotein and chromophore content. This variability allows Synechococcus cells to optimally exploit the wide variety of spectral niches existing in marine ecosystems. Seven distinct pigment types or subtypes have been identified so far in this taxon based on the phycobiliprotein composition and/or the proportion of the different chromophores in PBS rods. Most genes involved in their biosynthesis and regulation are located in a dedicated genomic region called the PBS rod region. Here, we examine the variability of gene content and organization of this genomic region in a large set of sequenced isolates and natural populations of Synechococcus representative of all known pigment types. All regions start with a tRNA-PheGAA and some possess mobile elements for DNA integration and site-specific recombination, suggesting that their genomic variability relies in part on a “tycheposon”-like mechanism. Comparison of the phylogenies obtained for PBS and core genes revealed that the evolutionary history of PBS rod genes differs from the core genome and is characterized by the co-existence of different alleles and frequent allelic exchange. We propose a scenario for the evolution of the different pigment types and highlight the importance of incomplete lineage sorting in maintaining a wide diversity of pigment types in different Synechococcus lineages despite multiple speciation events.
Collapse
Affiliation(s)
- Théophile Grébert
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Laurence Garczarek
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Vincent Daubin
- Université Lyon 1, UMR 5558 Biometry and Evolutionary Biology, 69622 Villeurbanne, France
| | - Florian Humily
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Dominique Marie
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Morgane Ratin
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Alban Devailly
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Gregory K Farrant
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Isabelle Mary
- Université Clermont Auvergne, CNRS, Laboratoire Microorganismes: Génome et Environnement, 63000 Clermont-Ferrand, France
| | - Daniella Mella-Flores
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| | - Gwenn Tanguy
- Centre National de la Recherche Scientifique, FR 2424, Station Biologique, 29680 Roscoff, France
| | - Karine Labadie
- Genoscope, Institut de biologie François-Jacob, Commissariat à l'Énergie Atomique (CEA), Université Paris-Saclay, Evry, France
| | - Patrick Wincker
- Génomique Métabolique, Genoscope, Institut de biologie François Jacob, CEA, CNRS, Université d'Evry, Université Paris-Saclay, Evry, France
| | - David M Kehoe
- Department of Biology, Indiana University, Bloomington, IN 47405, USA
| | - Frédéric Partensky
- Sorbonne Université, Centre National de la Recherche Scientifique, UMR 7144 Adaptation and Diversity in the Marine Environment, Station Biologique, 29680 Roscoff, France
| |
Collapse
|
26
|
Hu EZ, Lan XR, Liu ZL, Gao J, Niu DK. A positive correlation between GC content and growth temperature in prokaryotes. BMC Genomics 2022; 23:110. [PMID: 35139824 PMCID: PMC8827189 DOI: 10.1186/s12864-022-08353-7] [Citation(s) in RCA: 30] [Impact Index Per Article: 15.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2021] [Accepted: 01/31/2022] [Indexed: 01/27/2023] Open
Abstract
Background GC pairs are generally more stable than AT pairs; GC-rich genomes were proposed to be more adapted to high temperatures than AT-rich genomes. Previous studies consistently showed positive correlations between growth temperature and the GC contents of structural RNA genes. However, for the whole genome sequences and the silent sites of the codons in protein-coding genes, the relationship between GC content and growth temperature is in a long-lasting debate. Results With a dataset much larger than previous studies (681 bacteria and 155 archaea with completely assembled genomes), our phylogenetic comparative analyses showed positive correlations between optimal growth temperature (Topt) and GC content both in bacterial and archaeal structural RNA genes and in bacterial whole genome sequences, chromosomal sequences, plasmid sequences, core genes, and accessory genes. However, in the 155 archaea, we did not observe a significant positive correlation of Topt with whole-genome GC content (GCw) or GC content at four-fold degenerate sites. We randomly drew 155 samples from the 681 bacteria for 1000 rounds. In most cases (> 95%), the positive correlations between Topt and genomic GC contents became statistically nonsignificant (P > 0.05). This result suggested that the small sample sizes might account for the lack of positive correlations between growth temperature and genomic GC content in the 155 archaea and the bacterial samples of previous studies. Comparing the GC content among four categories (psychrophiles/psychrotrophiles, mesophiles, thermophiles, and hyperthermophiles) also revealed a positive correlation between GCw and growth temperature in bacteria. By including the GCw of incompletely assembled genomes, we expanded the sample size of archaea to 303. Positive correlations between GCw and Topt appear especially after excluding the halophilic archaea whose GC contents might be strongly shaped by intense UV radiation. Conclusions This study explains the previous contradictory observations and ends a long debate. Prokaryotes growing in high temperatures have higher GC contents. Thermal adaptation is one possible explanation for the positive association. Meanwhile, we propose that the elevated efficiency of DNA repair in response to heat mutagenesis might have the by-product of increasing GC content like that happens in intracellular symbionts and marine bacterioplankton. Supplementary Information The online version contains supplementary material available at 10.1186/s12864-022-08353-7.
Collapse
Affiliation(s)
- En-Ze Hu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Xin-Ran Lan
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Zhi-Ling Liu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Jie Gao
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China
| | - Deng-Ke Niu
- MOE Key Laboratory for Biodiversity Science and Ecological Engineering and Beijing Key Laboratory of Gene Resource and Molecular Development, College of Life Sciences, Beijing Normal University, Beijing, 100875, China.
| |
Collapse
|
27
|
Krasovec M, Filatov DA. Codon Usage Bias in Phytoplankton. JMSE 2022; 10:168. [DOI: 10.3390/jmse10020168] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/10/2022]
Abstract
Non-random usage of synonymous codons, known as “codon bias”, has been described in many organisms, from bacteria to Drosophila, but little is known about it in phytoplankton. This phenomenon is thought to be driven by selection for translational efficiency. As the efficacy of selection is proportional to the effective population size, species with large population sizes, such as phytoplankton, are expected to have strong codon bias. To test this, we measured codon bias in 215 strains from Haptophyta, Chlorophyta, Ochrophyta (except diatoms that were studied previously), Dinophyta, Cryptophyta, Ciliophora, unicellular Rhodophyta and Chlorarachniophyta. Codon bias is modest in most groups, despite the astronomically large population sizes of marine phytoplankton. The strength of the codon bias, measured with the effective number of codons, is the strongest in Haptophyta and the weakest in Chlorarachniophyta. The optimal codons are GC-ending in most cases, but several shifts to AT-ending codons were observed (mainly in Ochrophyta and Ciliophora). As it takes a long time to reach a new equilibrium after such shifts, species having AT-ending codons show a lower frequency of optimal codons compared to other species. Genetic diversity, calculated for species with more than three strains sequenced, is modest, indicating that the effective population sizes are many orders of magnitude lower than the astronomically large census population sizes, which helps to explain the modest codon bias in marine phytoplankton. This study represents the first comparative analysis of codon bias across multiple major phytoplankton groups.
Collapse
|
28
|
Abstract
In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
29
|
Abstract
Despite the importance of natural selection in species’ evolutionary history, phylogenetic methods that take into account population-level processes typically ignore selection. The assumption of neutrality is often based on the idea that selection occurs at a minority of loci in the genome and is unlikely to compromise phylogenetic inferences significantly. However, genome-wide processes like GC-bias and some variation segregating at the coding regions are known to evolve in the nearly neutral range. As we are now using genome-wide data to estimate species trees, it is natural to ask whether weak but pervasive selection is likely to blur species tree inferences. We developed a polymorphism-aware phylogenetic model tailored for measuring signatures of nucleotide usage biases to test the impact of selection in the species tree. Our analyses indicate that although the inferred relationships among species are not significantly compromised, the genetic distances are systematically underestimated in a node-height-dependent manner: that is, the deeper nodes tend to be more underestimated than the shallow ones. Such biases have implications for molecular dating. We dated the evolutionary history of 30 worldwide fruit fly populations, and we found signatures of GC-bias considerably affecting the estimated divergence times (up to 23%) in the neutral model. Our findings call for the need to account for selection when quantifying divergence or dating species evolution.
Collapse
Affiliation(s)
- Rui Borges
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria
| | - Bastien Boussau
- Université de Lyon, Université Claude Bernard Lyon 1, CNRS UMR 5558, LBBE, Villeurbanne, France
| | - Gergely J Szöllősi
- Department of Biological Physics, Eötvös University, Budapest , Hungary.,MTA-ELTE "Lendület" Evolutionary Genomics Research Group, Budapest, Hungary.,Evolutionary Systems Research Group, Centre for Ecological Research, Hungarian Academy of Sciences, Tihany, Hungary
| | - Carolin Kosiol
- Institut für Populationsgenetik, Vetmeduni Vienna, Wien, Austria.,Centre for Biological Diversity, University of St Andrews, St Andrews, United Kingdom
| |
Collapse
|
30
|
Tikhomirova TS, Matyunin MA, Lobanov MY, Galzitskaya OV. In-depth analysis of amino acid and nucleotide sequences of Hsp60: how conserved is this protein? Proteins 2021; 90:1119-1141. [PMID: 34964171 DOI: 10.1002/prot.26294] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2021] [Revised: 12/21/2021] [Accepted: 12/23/2021] [Indexed: 11/07/2022]
Abstract
Chaperonin Hsp60, as a protein found in all organisms, is of great interest in medicine, since it is present in many tissues and can be used both as a drug and as an object of targeted therapy. Hence, Hsp60 deserves a fundamental comparative analysis to assess its evolutionary characteristics. It was found that the percent identity of Hsp60 amino acid sequences both within and between phyla was not high enough to identify Hsp60s as highly conserved proteins. However, their ATP binding sites are largely conserved. The amino acid composition of Hsp60s remained relatively constant. At the same time, the analysis of the nucleotide sequences showed that GC content in the Hsp60 genes was comparable to or greater than the genomic values, which may indicate a high resistance to mutations due to tight control of the nucleotide composition by DNA repair systems. Natural selection plays a dominant role in the evolution of Hsp60 genes. The degree of mutational pressure affecting the Hsp60 genes is quite low, and its direction does not depend on taxonomy. Interestingly, for the Hsp60 genes from Chordata, Arthropoda, and Proteobacteria the exact direction of mutational pressure could not be determined. However, upon further division into classes, it was found that the direction of the mutational pressure for Hsp60 genes from Fish differs from that for other chordates. The direction of the mutational pressure affects the synonymous codon usage bias. The number of high and low represented codons increases with increasing GC content, which can improve codon usage. Special server has been created for bioinformatics analysis of Hsp60: http://oka.protres.ru:4202/.
Collapse
Affiliation(s)
- Tatyana S Tikhomirova
- Institute for Biological Instrumentation of the Russian Academy of Sciences, Federal Research Center "Pushchino Scientific Center for Biological Research of the Russian Academy of Sciences", Pushchino, Moscow Region, Russia
| | - Maxim A Matyunin
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - Michail Yu Lobanov
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| | - Oxana V Galzitskaya
- Institute of Protein Research, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
- Institute of Theoretical and Experimental Biophysics, Russian Academy of Sciences, Pushchino, Moscow Region, Russia
| |
Collapse
|
31
|
Nair RR, Mohan M, Rudramurthy GR, Vivekanandam R, Satheshkumar PS. Strategies and Patterns of Codon Bias in Molluscum Contagiosum Virus. Pathogens 2021; 10:1649. [PMID: 34959603 PMCID: PMC8703355 DOI: 10.3390/pathogens10121649] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2021] [Revised: 12/14/2021] [Accepted: 12/16/2021] [Indexed: 11/22/2022] Open
Abstract
Trends associated with codon usage in molluscum contagiosum virus (MCV) and factors governing the evolution of codon usage have not been investigated so far. In this study, attempts were made to decipher the codon usage trends and discover the major evolutionary forces that influence the patterns of codon usage in MCV with special reference to sub-types 1 and 2, MCV-1 and MCV-2, respectively. Three hypotheses were tested: (1) codon usage patterns of MCV-1 and MCV-2 are identical; (2) SCUB (synonymous codon usage bias) patterns of MCV-1 and MCV-2 slightly deviate from that of human host to avoid affecting the fitness of host; and (3) translational selection predominantly shapes the SCUB of MCV-1 and MCV-2. Various codon usage indices viz. relative codon usage value, effective number of codons and codon adaptation index were calculated to infer the nature of codon usage. Correspondence analysis and correlation analysis were performed to assess the relative contribution of silent base contents and significance of codon usage indices in defining bias in codon usage. Among the tested hypotheses, only the second and third hypotheses were accepted.
Collapse
Affiliation(s)
- Rahul Raveendran Nair
- Centre for Evolutionary Ecology, Aushmath Biosciences, Vadavalli Post, Coimbatore 641041, India
| | - Manikandan Mohan
- College of Pharmacy, University of Georgia, Athens, GA 30605, USA;
| | | | - Reethu Vivekanandam
- Department of Biotechnology, Bharathiyar University, Coimbatore 641046, India;
| | | |
Collapse
|
32
|
Murphy CL, Yang R, Decker T, Cavalliere C, Andreev V, Bircher N, Cornell J, Dohmen R, Pratt CJ, Grinnell A, Higgs J, Jett C, Gillett E, Khadka R, Mares S, Meili C, Liu J, Mukhtar H, Elshahed MS, Youssef NH. Genomes of Novel Myxococcota Reveal Severely Curtailed Machineries for Predation and Cellular Differentiation. Appl Environ Microbiol 2021; 87:e0170621. [PMID: 34524899 DOI: 10.1128/AEM.01706-21] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
Cultured Myxococcota are predominantly aerobic soil inhabitants, characterized by their highly coordinated predation and cellular differentiation capacities. Little is currently known regarding yet-uncultured Myxococcota from anaerobic, nonsoil habitats. We analyzed genomes representing one novel order (o__JAFGXQ01) and one novel family (f__JAFGIB01) in the Myxococcota from an anoxic freshwater spring (Zodletone Spring) in Oklahoma, USA. Compared to their soil counterparts, anaerobic Myxococcota possess smaller genomes and a smaller number of genes encoding biosynthetic gene clusters (BGCs), peptidases, one- and two-component signal transduction systems, and transcriptional regulators. Detailed analysis of 13 distinct pathways/processes crucial to predation and cellular differentiation revealed severely curtailed machineries, with the notable absence of homologs for key transcription factors (e.g., FruA and MrpC), outer membrane exchange receptor (TraA), and the majority of sporulation-specific and A-motility-specific genes. Further, machine learning approaches based on a set of 634 genes informative of social lifestyle predicted a nonsocial behavior for Zodletone Myxococcota. Metabolically, Zodletone Myxococcota genomes lacked aerobic respiratory capacities but carried genes suggestive of fermentation, dissimilatory nitrite reduction, and dissimilatory sulfate-reduction (in f_JAFGIB01) for energy acquisition. We propose that predation and cellular differentiation represent a niche adaptation strategy that evolved circa 500 million years ago (Mya) in response to the rise of soil as a distinct habitat on Earth. IMPORTANCE The phylum Myxococcota is a phylogenetically coherent bacterial lineage that exhibits unique social traits. Cultured Myxococcota are predominantly aerobic soil-dwelling microorganisms that are capable of predation and fruiting body formation. However, multiple yet-uncultured lineages within the Myxococcota have been encountered in a wide range of nonsoil, predominantly anaerobic habitats, and the metabolic capabilities, physiological preferences, and capacity of social behavior of such lineages remain unclear. Here, we analyzed genomes recovered from a metagenomic analysis of an anoxic freshwater spring in Oklahoma, USA, that represent novel, yet-uncultured, orders and families in the Myxococcota. The genomes appear to lack the characteristic hallmarks for social behavior encountered in Myxococcota genomes and displayed a significantly smaller genome size and a smaller number of genes encoding biosynthetic gene clusters, peptidases, signal transduction systems, and transcriptional regulators. Such perceived lack of social capacity was confirmed through detailed comparative genomic analysis of 13 pathways associated with Myxococcota social behavior, as well as the implementation of machine learning approaches to predict social behavior based on genome composition. Metabolically, these novel Myxococcota are predicted to be strict anaerobes, utilizing fermentation, nitrate reduction, and dissimilarity sulfate reduction for energy acquisition. Our results highlight the broad patterns of metabolic diversity within the yet-uncultured Myxococcota and suggest that the evolution of predation and fruiting body formation in the Myxococcota has occurred in response to soil formation as a distinct habitat on Earth.
Collapse
|
33
|
Jiang S, Du Q, Feng C, Ma L, Zhang Z. CompoDynamics: a comprehensive database for characterizing sequence composition dynamics. Nucleic Acids Res 2021; 50:D962-D969. [PMID: 34718745 PMCID: PMC8728180 DOI: 10.1093/nar/gkab979] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/15/2021] [Revised: 10/02/2021] [Accepted: 10/06/2021] [Indexed: 11/15/2022] Open
Abstract
Sequence compositions of nucleic acids and proteins have significant impact on gene expression, RNA stability, translation efficiency, RNA/protein structure and molecular function, and are associated with genome evolution and adaptation across all kingdoms of life. Therefore, a devoted resource of sequence compositions and associated features is fundamentally crucial for a wide range of biological research. Here, we present CompoDynamics (https://ngdc.cncb.ac.cn/compodynamics/), a comprehensive database of sequence compositions of coding sequences (CDSs) and genomes for all kinds of species. Taking advantage of the exponential growth of RefSeq data, CompoDynamics presents a wealth of sequence compositions (nucleotide content, codon usage, amino acid usage) and derived features (coding potential, physicochemical property and phase separation) for 118 689 747 high-quality CDSs and 34 562 genomes across 24 995 species. Additionally, interactive analytical tools are provided to enable comparative analyses of sequence compositions and molecular features across different species and gene groups. Collectively, CompoDynamics bears the great potential to better understand the underlying roles of sequence composition dynamics across genes and genomes, providing a fundamental resource in support of a broad spectrum of biological studies.
Collapse
Affiliation(s)
- Shuai Jiang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China
| | - Qiang Du
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Changrui Feng
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Lina Ma
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| | - Zhang Zhang
- National Genomics Data Center & CAS Key Laboratory of Genome Sciences and Information, Beijing Institute of Genomics, Chinese Academy of Sciences, Beijing 100101, China.,China National Center for Bioinformation, Beijing 100101, China.,University of Chinese Academy of Sciences, Beijing 100049, China
| |
Collapse
|
34
|
Pidcock SE, Skvortsov T, Santos FG, Courtney SJ, Sui-Ting K, Creevey CJ, Huws SA. Phylogenetic systematics of Butyrivibrio and Pseudobutyrivibrio genomes illustrate vast taxonomic diversity, open genomes and an abundance of carbohydrate-active enzyme family isoforms. Microb Genom 2021; 7. [PMID: 34605764 PMCID: PMC8627218 DOI: 10.1099/mgen.0.000638] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/21/2023] Open
Abstract
Butyrivibrio and Pseudobutyrivibrio dominate in anaerobic gastrointestinal microbiomes, particularly the rumen, where they play a key role in harvesting dietary energy. Within these genera, five rumen species have been classified (Butyrivibrio fibrisolvens, Butyrivibrio hungatei, Butyrivibrio proteoclasticus, Pseudobutyrivibrio ruminis and Pseudobutyrivibrio xylanivorans) and more recently an additional Butyrivibrio sp. group was added. Given the recent increase in available genomes, we re-investigated the phylogenetic systematics and evolution of Butyrivibrio and Pseudobutyrivibrio. Across 71 genomes, we show using 16S rDNA and 40 gene marker phylogenetic trees that the current six species designations (P. ruminis, P. xylanivorans, B. fibrisolvens, Butyrivibrio sp., B. hungatei and B. proteclasticus) are found. However, pangenome analysis showed vast genomic variation and a high abundance of accessory genes (91.50–99.34 %), compared with core genes (0.66–8.50 %), within these six taxonomic groups, suggesting incorrectly assigned taxonomy. Subsequent pangenome accessory genomes under varying core gene cut-offs (%) and average nucleotide identity (ANI) analysis suggest the existence of 42 species within 32 genera. Pangenome analysis of those that still group within B. fibrisolvens, B. hungatei and P. ruminis, based on revised ANI phylogeny, also showed possession of very open genomes, illustrating the diversity that exists even within these groups. All strains of both Butyrivibrio and Pseudobutyrivibrio also shared a broad range of clusters of orthologous genes (COGs) (870), indicating recent evolution from a common ancestor. We also demonstrate that the carbohydrate-active enzymes (CAZymes) predominantly belong to glycosyl hydrolase (GH)2, 3, 5, 13 and 43, with numerous within family isoforms apparent, likely facilitating metabolic plasticity and resilience under dietary perturbations. This study provides a major advancement in our functional and evolutionary understanding of these important anaerobic bacteria.
Collapse
Affiliation(s)
- Sara E Pidcock
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| | - Timofey Skvortsov
- School of Pharmacy, Medical Biology Centre, 97 Lisburn Road, Queen's University Belfast, Belfast BT9 7BL, UK
| | - Fernanda G Santos
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| | - Stephen J Courtney
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| | - Karen Sui-Ting
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| | - Christopher J Creevey
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| | - Sharon A Huws
- School of Biological Sciences and Institute for Global Food Security, 19 Chlorine Gardens, Queen's University Belfast, Belfast BT9 5DL, UK
| |
Collapse
|
35
|
Lin MJ, Haynes AM, Addetia A, Lieberman NAP, Phung Q, Xie H, Nguyen TV, Molini BJ, Lukehart SA, Giacani L, Greninger AL. Longitudinal TprK profiling of in vivo and in vitro-propagated Treponema pallidum subsp. pallidum reveals accumulation of antigenic variants in absence of immune pressure. PLoS Negl Trop Dis 2021; 15:e0009753. [PMID: 34492041 PMCID: PMC8480903 DOI: 10.1371/journal.pntd.0009753] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/24/2021] [Revised: 09/29/2021] [Accepted: 08/23/2021] [Indexed: 11/25/2022] Open
Abstract
Immune evasion by Treponema pallidum subspecies pallidum (T. pallidum) has been attributed to antigenic variation of its putative outer-membrane protein TprK. In TprK, amino acid diversity is confined to seven variable (V) regions, and generation of sequence diversity within the V regions occurs via a non-reciprocal segmental gene conversion mechanism where donor cassettes recombine into the tprK expression site. Although previous studies have shown the significant role of immune selection in driving accumulation of TprK variants, the contribution of baseline gene conversion activity to variant diversity is less clear. Here, combining longitudinal tprK deep sequencing of near clonal Chicago C from immunocompetent and immunosuppressed rabbits along with the newly developed in vitro cultivation system for T. pallidum, we directly characterized TprK alleles in the presence and absence of immune selection. Our data confirm significantly greater sequence diversity over time within the V6 region during syphilis infection in immunocompetent rabbits compared to immunosuppressed rabbits, consistent with previous studies on the role of TprK in evasion of the host immune response. Compared to strains grown in immunocompetent rabbits, strains passaged in vitro displayed low level changes in allele frequencies of TprK variable region sequences similar to that of strains passaged in immunosuppressed rabbits. Notably, we found significantly increased rates of V6 allele generation relative to other variable regions in in vitro cultivated T, pallidum strains, illustrating that the diversity within these hypervariable regions occurs in the complete absence of immune selection. Together, our results demonstrate antigenic variation in T. pallidum can be studied in vitro and occurs even in the complete absence of immune pressure, allowing the T. pallidum population to continuously evade the immune system of the infected host.
Collapse
Affiliation(s)
- Michelle J. Lin
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Austin M. Haynes
- Pathobiology Graduate Program, University of Washington, Seattle, Washington, United States of America
| | - Amin Addetia
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
- Molecular and Cellular Biology Graduate Program, University of Washington, Seattle, Washington, United States of America
| | - Nicole A. P. Lieberman
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Quynh Phung
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Hong Xie
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Tien V. Nguyen
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
| | - Barbara J. Molini
- Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, Washington, United States of America
| | - Sheila A. Lukehart
- Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, Washington, United States of America
- Department of Global Health, University of Washington, Seattle, Washington, United States of America
| | - Lorenzo Giacani
- Department of Medicine, Division of Allergy and Infectious Diseases, University of Washington, Seattle, Washington, United States of America
| | - Alexander L. Greninger
- Department of Laboratory Medicine and Pathology, University of Washington, Seattle, Washington, United States of America
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
- * E-mail:
| |
Collapse
|
36
|
Wasser D, Borst A, Hammelmann M, Ludt K, Soppa J. Characterization of Non-selected Intermolecular Gene Conversion in the Polyploid Haloarchaeon Haloferax volcanii. Front Microbiol 2021; 12:680854. [PMID: 34177864 PMCID: PMC8223754 DOI: 10.3389/fmicb.2021.680854] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/15/2021] [Accepted: 05/14/2021] [Indexed: 11/13/2022] Open
Abstract
Gene conversion is defined as the non-reciprocal transfer of genetic information from one site to a homologous, but not identical site of the genome. In prokaryotes, gene conversion can increase the variance of sequences, like in antigenic variation, but can also lead to a homogenization of sequences, like in the concerted evolution of multigene families. In contrast to these intramolecular mechanisms, the intermolecular gene conversion in polyploid prokaryotes, which leads to the equalization of the multiple genome copies, has hardly been studied. We have previously shown the intermolecular gene conversion in halophilic and methanogenic archaea is so efficient that it can be studied without selecting for conversion events. Here, we have established an approach to characterize unselected intermolecular gene conversion in Haloferax volcanii making use of two genes that encode enzymes involved in carotenoid biosynthesis. Heterozygous strains were generated by protoplast fusion, and gene conversion was quantified by phenotype analysis or/and PCR. It was verified that unselected gene conversion is extremely efficient and it was shown that gene conversion tracts are much longer than in antigenic variation or concerted evolution in bacteria. Two sites were nearly always co-converted when they were 600 bp apart, and more than 30% co-conversion even occurred when two sites were 5 kbp apart. The gene conversion frequency was independent from the extent of genome differences, and even a one nucleotide difference triggered conversion.
Collapse
Affiliation(s)
- Daniel Wasser
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Andreas Borst
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Mathias Hammelmann
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Katharina Ludt
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| | - Jörg Soppa
- Institute for Molecular Biosciences, Biocentre, Goethe-University, Frankfurt, Germany
| |
Collapse
|
37
|
Auboeuf D. The Physics-Biology continuum challenges darwinism: Evolution is directed by the homeostasis-dependent bidirectional relation between genome and phenotype. Prog Biophys Mol Biol 2021; 167:121-139. [PMID: 34097984 DOI: 10.1016/j.pbiomolbio.2021.05.008] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 01/19/2021] [Revised: 05/19/2021] [Accepted: 05/31/2021] [Indexed: 10/21/2022]
Abstract
The physics-biology continuum relies on the fact that life emerged from prebiotic molecules. Here, I argue that life emerged from the coupling between nucleic acid and protein synthesis during which proteins (or proto-phenotypes) maintained the physicochemical parameter equilibria (or proto-homeostasis) in the proximity of their encoding nucleic acids (or proto-genomes). This protected the proto-genome physicochemical integrity (i.e., atomic composition) from environmental physicochemical constraints, and therefore increased the probability of reproducing the proto-genome without variation. From there, genomes evolved depending on the biological activities they generated in response to environmental fluctuations. Thus, a genome maintaining homeostasis (i.e., internal physicochemical parameter equilibria), despite and in response to environmental fluctuations, maintains its physicochemical integrity and has therefore a higher probability to be reproduced without variation. Consequently, descendants have a higher probability to share the same phenotype than their parents. Otherwise, the genome is modified during replication as a consequence of the imbalance of the internal physicochemical parameters it generates, until new mutation-deriving biological activities maintain homeostasis in offspring. In summary, evolution depends on feedforward and feedback loops between genome and phenotype, as the internal physicochemical conditions that a genome generates ─ through its derived phenotype in response to environmental fluctuations ─ in turn either guarantee its stability or direct its variation. Evolution may not be explained by the Darwinism-derived, unidirectional principle (random mutations-phenotypes-natural selection) but rather by the bidirectional relationship between genome and phenotype, in which the phenotype in interaction with the environment directs the evolution of the genome it derives from.
Collapse
Affiliation(s)
- Didier Auboeuf
- ENS de Lyon, Univ Claude Bernard, CNRS UMR 5239, INSERM U1210, Laboratory of Biology and Modelling of the Cell, 46 Allée D'Italie, Site Jacques Monod, F-69007, Lyon, France.
| |
Collapse
|
38
|
Callens M, Scornavacca C, Bedhomme S. Evolutionary responses to codon usage of horizontally transferred genes in Pseudomonas aeruginosa: gene retention, amelioration and compensatory evolution. Microb Genom 2021; 7:000587. [PMID: 34165421 PMCID: PMC8461475 DOI: 10.1099/mgen.0.000587] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2020] [Accepted: 04/19/2021] [Indexed: 12/18/2022] Open
Abstract
Prokaryote genome evolution is characterized by the frequent gain of genes through horizontal gene transfer (HGT). For a gene, being horizontally transferred can represent a strong change in its genomic and physiological context. If the codon usage of a transferred gene deviates from that of the receiving organism, the fitness benefits it provides can be reduced due to a mismatch with the expression machinery. Consequently, transferred genes with a deviating codon usage can be selected against or elicit evolutionary responses that enhance their integration, such as gene amelioration and compensatory evolution. Within bacterial species, the extent and relative importance of these different mechanisms has never been considered altogether. In this study, a phylogeny-based method was used to investigate the occurrence of these different evolutionary responses in Pseudomonas aeruginosa. Selection on codon usage of genes acquired through HGT was observed over evolutionary time, with the overall codon usage converging towards that of the core genome. Gene amelioration, through the accumulation of synonymous mutations after HGT, did not seem to systematically affect transferred genes. This pattern therefore seemed to be mainly driven by selective retention of transferred genes with an initial codon usage similar to that of the core genes. Additionally, variation in the copy number of tRNA genes was often associated with the acquisition of genes for which the observed variation could enhance their expression. This provides evidence that compensatory evolution might be an important mechanism for the integration of horizontally transferred genes.
Collapse
Affiliation(s)
- Martijn Callens
- CEFE, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, Montpellier, France
| | - Celine Scornavacca
- Institut des Sciences de l’Evolution, Université Montpellier, CNRS, IRD, EPHE, Montpellier, France
| | - Stéphanie Bedhomme
- CEFE, Univ Montpellier, CNRS, EPHE, IRD, Univ Paul Valéry Montpellier 3, Montpellier, France
| |
Collapse
|
39
|
Abstract
In every kingdom of life, GC->AT transitions occur more frequently than any other type of mutation due to the spontaneous deamination of cytidine. In eukaryotic genomes, this slow loss of GC base pairs is counteracted by biased gene conversion which increases genomic GC content as part of the recombination process. However, this type of biased gene conversion has not been observed in bacterial genomes, so we hypothesized that GC->AT transitions cause a reduction of genomic GC content in prokaryotic genomes on an evolutionary time scale. To test this hypothesis, we used a phylogenetic approach to analyze triplets of closely related genomes representing a wide range of the bacterial kingdom. The resulting data indicate that genomic GC content is drifting downward in bacterial genomes where GC base pairs comprise 40% or more of the total genome. In contrast, genomes containing less than 40% GC base pairs have fewer opportunities for GC->AT transitions to occur so genomic GC content is relatively stable or actually increasing. It should be noted that this observed change in genomic GC content is the net change in shared parts of the genome and does not apply to parts of the genome that have been lost or acquired since the genomes being compared shared common ancestor. However, a more detailed analysis of two Caulobacter genomes revealed that the acquisition of mobile elements by the two genomes actually reduced the total genomic GC content as well.
Collapse
Affiliation(s)
- Bert Ely
- Department of Biological Sciences, University of South Carolina, Columbia, South Carolina, United States of America
- * E-mail:
| |
Collapse
|
40
|
Duan B, Ding P, Navarre WW, Liu J, Xia B. Xenogeneic Silencing and Bacterial Genome Evolution: Mechanisms for DNA Recognition Imply Multifaceted Roles of Xenogeneic Silencers. Mol Biol Evol 2021; 38:4135-4148. [PMID: 34003286 PMCID: PMC8476142 DOI: 10.1093/molbev/msab136] [Citation(s) in RCA: 14] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2021] [Revised: 04/08/2021] [Indexed: 12/14/2022] Open
Abstract
Horizontal gene transfer (HGT) is a major driving force for bacterial evolution. To avoid the deleterious effects due to the unregulated expression of newly acquired foreign genes, bacteria have evolved specific proteins named xenogeneic silencers to recognize foreign DNA sequences and suppress their transcription. As there is considerable diversity in genomic base compositions among bacteria, how xenogeneic silencers distinguish self- from nonself DNA in different bacteria remains poorly understood. This review summarizes the progress in studying the DNA binding preferences and the underlying molecular mechanisms of known xenogeneic silencer families, represented by H-NS of Escherichia coli, Lsr2 of Mycobacterium, MvaT of Pseudomonas, and Rok of Bacillus. Comparative analyses of the published data indicate that the differences in DNA recognition mechanisms enable these xenogeneic silencers to have clear characteristics in DNA sequence preferences, which are further correlated with different host genomic features. These correlations provide insights into the mechanisms of how these xenogeneic silencers selectively target foreign DNA in different genomic backgrounds. Furthermore, it is revealed that the genomic AT contents of bacterial species with the same xenogeneic silencer family proteins are distributed in a limited range and are generally lower than those species without any known xenogeneic silencers in the same phylum/class/genus, indicating that xenogeneic silencers have multifaceted roles on bacterial genome evolution. In addition to regulating horizontal gene transfer, xenogeneic silencers also act as a selective force against the GC to AT mutational bias found in bacterial genomes and help the host genomic AT contents maintained at relatively low levels.
Collapse
Affiliation(s)
- Bo Duan
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| | - Pengfei Ding
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| | - William Wiley Navarre
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, M5G 1M1, Canada
| | - Jun Liu
- Department of Molecular Genetics, University of Toronto, Toronto, Ontario, M5G 1M1, Canada
| | - Bin Xia
- Beijing Nuclear Magnetic Resonance Center, College of Chemistry and Molecular Engineering, and School of Life Sciences, Peking University, Beijing, 100871, China
| |
Collapse
|
41
|
Nguyen DT, Wu B, Long H, Zhang N, Patterson C, Simpson S, Morris K, Thomas WK, Lynch M, Hao W. Variable Spontaneous Mutation and Loss of Heterozygosity among Heterozygous Genomes in Yeast. Mol Biol Evol 2021; 37:3118-3130. [PMID: 33219379 DOI: 10.1093/molbev/msaa150] [Citation(s) in RCA: 12] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022] Open
Abstract
Mutation and recombination are the primary sources of genetic variation. To better understand the evolution of genetic variation, it is crucial to comprehensively investigate the processes involving mutation accumulation and recombination. In this study, we performed mutation accumulation experiments on four heterozygous diploid yeast species in the Saccharomycodaceae family to determine spontaneous mutation rates, mutation spectra, and losses of heterozygosity (LOH). We observed substantial variation in mutation rates and mutation spectra. We also observed high LOH rates (1.65-11.07×10-6 events per heterozygous site per cell division). Biases in spontaneous mutation and LOH together with selection ultimately shape the variable genome-wide nucleotide landscape in yeast species.
Collapse
Affiliation(s)
- Duong T Nguyen
- Department of Biological Sciences, Wayne State University, Detroit, MI
| | - Baojun Wu
- Department of Biological Sciences, Wayne State University, Detroit, MI
| | - Hongan Long
- Institute of Evolution and Marine Biodiversity, KLMME, Ocean University of China, Qingdao, Shandong Province, China
| | - Nan Zhang
- Department of Biological Sciences, Wayne State University, Detroit, MI
| | | | | | | | | | - Michael Lynch
- Center for Mechanisms of Evolution, The Biodesign Institute, Arizona State University, Tempe, AZ
| | - Weilong Hao
- Department of Biological Sciences, Wayne State University, Detroit, MI
| |
Collapse
|
42
|
Murray GGR, Charlesworth J, Miller EL, Casey MJ, Lloyd CT, Gottschalk M, Tucker AW(D, Welch JJ, Weinert LA. Genome Reduction Is Associated with Bacterial Pathogenicity across Different Scales of Temporal and Ecological Divergence. Mol Biol Evol 2021; 38:1570-1579. [PMID: 33313861 PMCID: PMC8042751 DOI: 10.1093/molbev/msaa323] [Citation(s) in RCA: 34] [Impact Index Per Article: 11.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
Emerging bacterial pathogens threaten global health and food security, and so it is important to ask whether these transitions to pathogenicity have any common features. We present a systematic study of the claim that pathogenicity is associated with genome reduction and gene loss. We compare broad-scale patterns across all bacteria, with detailed analyses of Streptococcus suis, an emerging zoonotic pathogen of pigs, which has undergone multiple transitions between disease and carriage forms. We find that pathogenicity is consistently associated with reduced genome size across three scales of divergence (between species within genera, and between and within genetic clusters of S. suis). Although genome reduction is also found in mutualist and commensal bacterial endosymbionts, genome reduction in pathogens cannot be solely attributed to the features of their ecology that they share with these species, that is, host restriction or intracellularity. Moreover, other typical correlates of genome reduction in endosymbionts (reduced metabolic capacity, reduced GC content, and the transient expansion of nonfunctional elements) are not consistently observed in pathogens. Together, our results indicate that genome reduction is a consistent correlate of pathogenicity in bacteria.
Collapse
Affiliation(s)
- Gemma G R Murray
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| | - Jane Charlesworth
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- European Bioinformatics Institute, Wellcome Genome Campus, Cambridge, United Kingdom
- Warwick Medical School, University of Warwick, Coventry, United Kingdom
| | - Eric L Miller
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
- Haverford College, Haverford, PA, USA
| | - Michael J Casey
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
- School of Mathematical Sciences, University of Southampton, Southampton, United Kingdom
| | - Catrin T Lloyd
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Marcelo Gottschalk
- Département de Pathologie et Microbiologie, Université de Montréal, Montréal, QC, Canada
| | | | - John J Welch
- Department of Genetics, University of Cambridge, Cambridge, United Kingdom
| | - Lucy A Weinert
- Department of Veterinary Medicine, University of Cambridge, Cambridge, United Kingdom
| |
Collapse
|
43
|
Castillo AI, Almeida RPP. Evidence of gene nucleotide composition favoring replication and growth in a fastidious plant pathogen. G3 (Bethesda) 2021; 11:6170658. [PMID: 33715000 PMCID: PMC8495750 DOI: 10.1093/g3journal/jkab076] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 02/19/2021] [Accepted: 03/02/2021] [Indexed: 11/13/2022]
Abstract
Nucleotide composition (GC content) varies across bacteria species, genome regions, and specific genes. In Xylella fastidiosa, a vector-borne fastidious plant pathogen infecting multiple crops, GC content ranges between ∼51-52%; however, these values were gathered using limited genomic data. We evaluated GC content variations across X. fastidiosa subspecies fastidiosa (N = 194), subsp. pauca (N = 107), and subsp. multiplex (N = 39). Genomes were classified based on plant host and geographic origin; individual genes within each genome were classified based on gene function, strand, length, ortholog group, Core vs. Accessory, and Recombinant vs. Non-recombinant. GC content was calculated for each gene within each evaluated genome. The effects of genome and gene level variables were evaluated with a mixed effect ANOVA, and the marginal-GC content was calculated for each gene. Also, the correlation between gene-specific GC content vs. natural selection (dN/dS) and recombination/mutation (r/m) was estimated. Our analyses show that intra-genomic changes in nucleotide composition in X. fastidiosa are small and influenced by multiple variables. Higher AT-richness is observed in genes involved in replication and translation, and genes in the leading strand. In addition, we observed a negative correlation between high-AT and dN/dS in subsp. pauca. The relationship between recombination and GC content varied between core and accessory genes. We hypothesize that distinct evolutionary forces and energetic constraints both drive and limit these small variations in nucleotide composition.
Collapse
Affiliation(s)
- Andreina I Castillo
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA 94720, USA
| | - Rodrigo P P Almeida
- Department of Environmental Science, Policy and Management, University of California, Berkeley, CA 94720, USA
| |
Collapse
|
44
|
Meyer MM. Revisiting the Relationships Between Genomic G + C Content, RNA Secondary Structures, and Optimal Growth Temperature. J Mol Evol 2021; 89:165-71. [PMID: 33216148 DOI: 10.1007/s00239-020-09974-w] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/16/2020] [Accepted: 11/09/2020] [Indexed: 10/23/2022]
Abstract
Over twenty years ago Galtier and Lobry published a manuscript entitled "Relationships between Genomic G + C Content, RNA Secondary Structure, and Optimal Growth Temperature" in the Journal of Molecular Evolution that showcased the lack of a relationship between genomic G + C content and optimal growth temperature (OGT) in a set of about 200 prokaryotes. Galtier and Lobry also assessed the relationship between RNA secondary structures (rRNA stems, tRNAs) and OGT, and in this case a clear relationship emerged. Increasing structured RNA G + C content (particularly in regions that are double-stranded) correlates with increased OGT. Both of these fundamental relationships have withstood test of many additional sequences and spawned a variety of different applications that include prediction of OGT from rRNA sequence and computational ncRNA identification approaches. In this work, I present the motivation behind Galtier and Lobry's original paper and the larger questions addressed by the work, how these questions have evolved over the last two decades, and the impact of Galtier and Lobry's manuscript in fields beyond these questions.
Collapse
|
45
|
Nguyen DT, Wu B, Xiao S, Hao W. Evolution of a Record-Setting AT-Rich Genome: Indel Mutation, Recombination, and Substitution Bias. Genome Biol Evol 2020; 12:2344-2354. [PMID: 32986811 PMCID: PMC7846184 DOI: 10.1093/gbe/evaa202] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/19/2020] [Indexed: 12/16/2022] Open
Abstract
Genome-wide nucleotide composition varies widely among species. Despite extensive research, the source of genome-wide nucleotide composition diversity remains elusive. Yeast mitochondrial genomes (mitogenomes) are highly A + T rich, and they provide a unique opportunity to study the evolution of AT-biased landscape. In this study, we sequenced ten complete mitogenomes of the Saccharomycodes ludwigii yeast with 8% G + C content, the lowest genome-wide %(G + C) in all published genomes to date. The S. ludwigii mitogenomes have high densities of short tandem repeats but severely underrepresented mononucleotide repeats. Comparative population genomics of these record-setting A + T-rich genomes shows dynamic indel mutations and strong mutation bias toward A/T. Indel mutations play a greater role in genomic variation among very closely related strains than nucleotide substitutions. Indels have resulted in presence–absence polymorphism of tRNAArg (ACG) among S. ludwigii mitogenomes. Interestingly, these mitogenomes have undergone recombination, a genetic process that can increase G + C content by GC-biased gene conversion. Finally, the expected equilibrium G + C content under mutation pressure alone is higher than observed G + C content, suggesting existence of mechanisms other than AT-biased mutation operating to increase A/T. Together, our findings shed new lights on mechanisms driving extremely AT-rich genomes.
Collapse
Affiliation(s)
- Duong T Nguyen
- Department of Biological Sciences, Wayne State University
| | - Baojun Wu
- Department of Biological Sciences, Wayne State University
| | - Shujie Xiao
- Department of Biological Sciences, Wayne State University
| | - Weilong Hao
- Department of Biological Sciences, Wayne State University
| |
Collapse
|
46
|
Wang D, Yu X, Xu K, Bi G, Cao M, Zelzion E, Fu C, Sun P, Liu Y, Kong F, Du G, Tang X, Yang R, Wang J, Tang L, Wang L, Zhao Y, Ge Y, Zhuang Y, Mo Z, Chen Y, Gao T, Guan X, Chen R, Qu W, Sun B, Bhattacharya D, Mao Y. Pyropia yezoensis genome reveals diverse mechanisms of carbon acquisition in the intertidal environment. Nat Commun 2020. [PMID: 32788591 DOI: 10.1038/s41467s-020-17689-1] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/08/2023] Open
Abstract
Changes in atmospheric CO2 concentration have played a central role in algal and plant adaptation and evolution. The commercially important red algal genus, Pyropia (Bangiales) appears to have responded to inorganic carbon (Ci) availability by evolving alternating heteromorphic generations that occupy distinct habitats. The leafy gametophyte inhabits the intertidal zone that undergoes frequent emersion, whereas the sporophyte conchocelis bores into mollusk shells. Here, we analyze a high-quality genome assembly of Pyropia yezoensis to elucidate the interplay between Ci availability and life cycle evolution. We find horizontal gene transfers from bacteria and expansion of gene families (e.g. carbonic anhydrase, anti-oxidative related genes), many of which show gametophyte-specific expression or significant up-regulation in gametophyte in response to dehydration. In conchocelis, the release of HCO3- from shell promoted by carbonic anhydrase provides a source of Ci. This hypothesis is supported by the incorporation of 13C isotope by conchocelis when co-cultured with 13C-labeled CaCO3.
Collapse
Affiliation(s)
- Dongmei Wang
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Xinzi Yu
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Kuipeng Xu
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Guiqi Bi
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Min Cao
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Ehud Zelzion
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08901, USA
| | - Chunxiang Fu
- Shandong Provincial Key Laboratory of Energy Genetics, Key Laboratory of Biofuels, Qingdao Institute of BioEnergy and Bioprocess Technology, Chinese Academy of Sciences, 266101, Qingdao, China
| | - Peipei Sun
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Yang Liu
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Fanna Kong
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Guoying Du
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Xianghai Tang
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Ruijuan Yang
- Shandong Provincial Key Laboratory of Energy Genetics, Key Laboratory of Biofuels, Qingdao Institute of BioEnergy and Bioprocess Technology, Chinese Academy of Sciences, 266101, Qingdao, China
| | - Junhao Wang
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Lei Tang
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Lu Wang
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Yingjun Zhao
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Yuan Ge
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Yunyun Zhuang
- Key Laboratory of Marine Environment and Ecology, Ministry of Education, Ocean University of China, 266100, Qingdao, China
| | - Zhaolan Mo
- Key Laboratory of Maricultural Organism Disease Control, Ministry of Agriculture and Rural Affairs, Yellow Sea Fisheries Research Institute, Chinese Academy of Fishery Sciences, 266071, Qingdao, China
| | - Yu Chen
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Tian Gao
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Xiaowei Guan
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Rui Chen
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Weihua Qu
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Bin Sun
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China
- College of Marine Life Sciences, Ocean University of China, 266100, Qingdao, China
| | - Debashish Bhattacharya
- Department of Biochemistry and Microbiology, Rutgers University, New Brunswick, NJ, 08901, USA.
| | - Yunxiang Mao
- Key Laboratory of Marine Genetics and Breeding (OUC), Ministry of Education, 266100, Qingdao, China.
- Key Laboratory of Utilization and Conservation for Tropical Marine Bioresources (Hainan Tropical Ocean University), Ministry of Education, 572022, Sanya, China.
- The Laboratory for Marine Biology and Biotechnology, Qingdao National Laboratory for Marine Science and Technology, 266237, Qingdao, China.
| |
Collapse
|
47
|
Wang D, Yu X, Xu K, Bi G, Cao M, Zelzion E, Fu C, Sun P, Liu Y, Kong F, Du G, Tang X, Yang R, Wang J, Tang L, Wang L, Zhao Y, Ge Y, Zhuang Y, Mo Z, Chen Y, Gao T, Guan X, Chen R, Qu W, Sun B, Bhattacharya D, Mao Y. Pyropia yezoensis genome reveals diverse mechanisms of carbon acquisition in the intertidal environment. Nat Commun 2020; 11:4028. [PMID: 32788591 DOI: 10.1038/s41467-020-17689-1] [Citation(s) in RCA: 26] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2019] [Accepted: 07/09/2020] [Indexed: 12/15/2022] Open
Abstract
Changes in atmospheric CO2 concentration have played a central role in algal and plant adaptation and evolution. The commercially important red algal genus, Pyropia (Bangiales) appears to have responded to inorganic carbon (Ci) availability by evolving alternating heteromorphic generations that occupy distinct habitats. The leafy gametophyte inhabits the intertidal zone that undergoes frequent emersion, whereas the sporophyte conchocelis bores into mollusk shells. Here, we analyze a high-quality genome assembly of Pyropia yezoensis to elucidate the interplay between Ci availability and life cycle evolution. We find horizontal gene transfers from bacteria and expansion of gene families (e.g. carbonic anhydrase, anti-oxidative related genes), many of which show gametophyte-specific expression or significant up-regulation in gametophyte in response to dehydration. In conchocelis, the release of HCO3- from shell promoted by carbonic anhydrase provides a source of Ci. This hypothesis is supported by the incorporation of 13C isotope by conchocelis when co-cultured with 13C-labeled CaCO3. The nori producing seaweed Pyropia yezoensis has heteromorphic generations that occupy distinct habitats. Here, via genome assembly, transcriptome analysis, and 13 C isotope labeling, the authors show the interplay between inorganic carbon availability and life cycle evolution in the intertidal environment.
Collapse
|
48
|
Kogay R, Wolf YI, Koonin EV, Zhaxybayeva O. Selection for Reducing Energy Cost of Protein Production Drives the GC Content and Amino Acid Composition Bias in Gene Transfer Agents. mBio 2020; 11:e01206-20. [PMID: 32665274 DOI: 10.1128/mBio.01206-20] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Kin selection and group selection remain controversial topics in evolutionary biology. We argue that these types of selection are likely to operate in bacterial populations by showing that bacterial gene transfer agents (GTAs), but not related viruses, evolve under conditions of positive selection for the reduction of the energy cost of GTA particle production. We hypothesize that GTAs are dedicated devices mediating the survival of bacteria under conditions of nutrient limitation. The benefits conferred by GTAs under nutritional stress conditions appear to include horizontal dissemination of genes that could provide bacteria with enhanced capabilities for nutrient utilization and increases of nutrient availability occurring through the lysis of GTA-producing bacteria. Gene transfer agents (GTAs) are virus-like elements integrated into bacterial genomes, particularly, those of Alphaproteobacteria. The GTAs can be induced under conditions of nutritional stress, incorporate random fragments of bacterial DNA into miniphage particles, lyse the host cells, and infect neighboring bacteria, thus enhancing horizontal gene transfer. We show that GTA genes evolve under conditions of pronounced positive selection for the reduction of the energy cost of protein production as shown by comparison of the amino acid compositions with those of both homologous viral genes and host genes. The energy saving in GTA genes is comparable to or even more pronounced than that in the genes encoding the most abundant, essential bacterial proteins. In cases in which viruses acquire genes from GTAs, the bias in amino acid composition disappears in the course of evolution, showing that reduction of the energy cost of protein production is an important factor of evolution of GTAs but not bacterial viruses. These findings strongly suggest that GTAs represent bacterial adaptations rather than selfish, virus-like elements. Because GTA production kills the host cell and does not propagate the GTA genome, it appears likely that the GTAs are retained in the course of evolution via kin or group selection. Therefore, we hypothesize that GTAs facilitate the survival of bacterial populations under energy-limiting conditions through the spread of metabolic and transport capabilities via horizontal gene transfer and increases in nutrient availability resulting from the altruistic suicide of GTA-producing cells.
Collapse
|
49
|
Abstract
Host-beneficial endosymbioses, which are formed when a microorganism takes up residence inside another cell and provides a fitness advantage to the host, have had a dramatic influence on the evolution of life. These intimate relationships have yielded the mitochondrion and the plastid (chloroplast) - the ancient organelles that in part define eukaryotic life - along with many more recent associations involving a wide variety of hosts and microbial partners. These relationships are often envisioned as stable associations that appear cooperative and persist for extremely long periods of time. But recent evidence suggests that this stable state is often born from turbulent and conflicting origins, and that the apparent stability of many beneficial endosymbiotic relationships - although certainly real in many cases - is not an inevitable outcome of these associations. Here we review how stable endosymbioses form, how they are maintained, and how they sometimes break down and are reborn. We focus on relationships formed by insects and their resident microorganisms because these symbioses have been the focus of significant empirical work over the last two decades. We review these relationships over five life stages: origin, birth, middle age, old age, and death.
Collapse
|
50
|
Munjal A, Khandia R, Shende KK, Das J. Mycobacterium lepromatosis genome exhibits unusually high CpG dinucleotide content and selection is key force in shaping codon usage. Infect Genet Evol 2020; 84:104399. [PMID: 32512206 DOI: 10.1016/j.meegid.2020.104399] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [What about the content of this article? (0)] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 09/12/2019] [Revised: 05/30/2020] [Accepted: 06/03/2020] [Indexed: 01/06/2023]
Abstract
Mycobacterium lepromatosis was identified as a causative agent for leprosy in the year 2008 in the United States and later more cases were identified in Canada, Singapore, Brazil, and Myanmar. It is known to cause diffuse lepromatosis leprosy among humans. Since it is invasive, the mortality rates are higher in comparison to the M. leprae. At genomic level, there exists 90.9% similarity between M. lepromatosis and M. leprae. Codon usage analysis based on analyses of 228 coding sequences (CDSs) of M. lepromatosis, revealed that the genome is GC rich. Among the total 16 dinucleotides, CpG dinucleotide possesses the highest dinucleotide frequency in M. lepromatosis, that is strikingly an unobvious observation since higher CpG is associated with higher proinflammatory cytokine production and NF-κB activation that eventually leads to high pathogenicity. To evade immune response, CpG content is generally less in pathogens. The unusually high CpG content can be explained by the fact that the nucleotide composition of M. lepromatosis is CG rich. Various forces interplay to shape codon usage pattern of any organism including selection; mutation, nucleotide composition as well as GC biased gene conversion. To understand the interplay between various forces; neutrality, parity, Nc-GC3 (Effective number of codons-GC content at 3rd position of the codon), aromaticity (AROMO) and the general average hydropathicity score (GRAVY) analyses have been carried out. The analyses revealed that selection force is the major contributory force. Along with the selection; mutation, nucleotide composition as well as GC biased gene conversion also play role in shaping codon usage bias in M. lepromatosis. This is the first report on the codon usage in M. lepromatosis.
Collapse
|