1
|
Arora P, Kumar S, Mukhopadhyay CS, Kaur S. Codon usage analysis in selected virulence genes of Staphylococcal species. Curr Genet 2025; 71:5. [PMID: 39853506 DOI: 10.1007/s00294-025-01308-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2024] [Revised: 12/25/2024] [Accepted: 01/03/2025] [Indexed: 01/26/2025]
Abstract
The Staphylococcus genus, composed of Gram-positive bacteria, includes several pathogenic species such as Staphylococcus aureus, S. epidermidis, S. haemolyticus, and S. saprophyticus, each implicated in a range of infections. This study investigates the codon usage patterns in key virulence genes, including Autolysin (alt), Elastin Binding protein (EbpS), Lipase, Thermonuclease, Intercellular Adhesion Protein (IcaR), and V8 Protease, across four Staphylococcus species. Using metrics such as the Effective Number of Codons (ENc), Relative Synonymous Codon Usage (RSCU), Codon Adaptation Index (CAI), alongside neutrality and parity plots, we explored the codon preferences and nucleotide composition biases. Our findings revealed a pronounced AT-rich codon preference, with AT-rich genomes likely aiding in energy-efficient translation and bacterial survival in host environments. These insights provide a deeper understanding of the evolutionary adaptations and translational efficiency mechanisms that contribute to the pathogenicity of Staphylococcus species. This knowledge could pave the way for novel therapeutic interventions targeting codon usage to disrupt virulence gene expression.
Collapse
Affiliation(s)
- Pinky Arora
- School of Bioengineering and Biosciences, Lovely Professional University, Jalandhar-Delhi G.T. Road, Phagwara, Punjab, 144411, India
| | - Shubham Kumar
- School of Pharmaceutical Sciences, Lovely Professional, University, Jalandhar- G.T. Road, Phagwara, Punjab, 144411, India
| | - Chandra Shekhar Mukhopadhyay
- Department of Bioinformatics, College of Animal Biotechnology, Guru Angad Dev Veterinary and Animal Sciences University, Ferozepur G.T. Road, Ludhiana, Punjab, 141004, India
| | - Sandeep Kaur
- Department of Medical Laboratory Sciences, Lovely Professional University, Phagwara, 144411, Punjab, India.
| |
Collapse
|
2
|
Arora P, Mukhopadhyay CS, Kaur S. Comparative genome wise analysis of codon usage of Staphylococcus Genus. Curr Genet 2024; 70:10. [PMID: 39083100 DOI: 10.1007/s00294-024-01297-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/29/2024] [Revised: 07/05/2024] [Accepted: 07/22/2024] [Indexed: 12/14/2024]
Abstract
The genus Staphylococcus encompasses a diverse array of bacteria with significant implications for human health, including disreputable pathogens such as Staphylococcus aureus and Staphylococcus epidermidis. Understanding the genetic composition and codon usage patterns of Staphylococcus species is crucial for unraveling their evolutionary dynamics, adaptive strategies, and pathogenic potential. In this study, we conducted a comprehensive analysis of codon usage patterns across 48 species within the Staphylococcus genus. Our findings uncovered variations in genomic G-C content across Staphylococcus species, impacting codon usage preferences, with a notable preference for A/T-rich codons observed in pathogenic strains. This preference for A/T-rich codons suggests an energy-saving strategy in pathogenic organisms. Analysis of dinucleotide pair expression patterns unveiled insights into genomic dynamics, with overrepresented codon pairs reflecting trends in dinucleotide expression across genomes. Additionally, a significant correlation between CAI and genomic G-C content underscored the intricate relationship between codon usage patterns and gene expression strategies. Amino acid usage analysis highlighted preferences for energetically cheaper amino acids, suggesting adaptive strategies promoting energy efficiency. This comprehensive analysis sheds light on the evolutionary dynamics and adaptive mechanisms employed by Staphylococcus species, providing valuable insights into their pathogenic potential and clinical implications. Understanding these genomic features is crucial for devising strategies to combat staphylococcal infections and improve public health outcomes.
Collapse
Affiliation(s)
- Pinky Arora
- School of Bioengineering and Biosciences, Lovely Professional University, Jalandhar-Delhi G.T. Road, Phagwara, Punjab, 144411, India
| | - Chandra Shekhar Mukhopadhyay
- Department of Bioinformatics, College of Animal Biotechnology, Guru Angad Dev Veterinary and Animal Sciences University, Ferozepur G.T. Road, Ludhiana, Punjab, 141004, India
| | - Sandeep Kaur
- Department of Medical Laboratory Sciences, Lovely Professional University, Phagwara, Punjab, 144411, India.
| |
Collapse
|
3
|
Sharma A, Gupta S, Paul K. Codon usage behavior distinguishes pathogenic Clostridium species from the non-pathogenic species. Gene 2023; 873:147394. [PMID: 37137382 DOI: 10.1016/j.gene.2023.147394] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 03/19/2023] [Accepted: 03/21/2023] [Indexed: 05/05/2023]
Abstract
Genus Clostridium is of the largest genus in class Clostridia. It is comprised of spore-forming, anaerobic, gram-positive organisms. The members of this genus include human pathogens to free-living nitrogen fixing bacteria. In the present study, we have performed a comparison of the choice of preferred codons, codon usage patterns, dinucleotide and amino acid usage pattern of 76 species of Genus Clostridium. We found the pathogenic clostridium species to have smaller AT-rich genomes as compared to opportunistic and non-pathogenic clostridium species. The choice of preferred and optimal codons was also influenced by genomic GC/AT content of the respective clostridium species. The pathogenic clostridium species displayed a strict bias in the codon usage, employing 35 of the 61 codons encoding for 20 amino acids. Comparison of amino acid usage revealed an increased usage of amino acids with lower biosynthetic cost by pathogenic clostridium species as compared to opportunistic and non-pathogenic clostridium species. Smaller genome, strict codon usage bias and amino acid usage lead to lower protein energetic cost for the clostridial pathogens. Overall, we found the pathogenic members of genus Clostridium to prefer small, AT-rich codons to reduce biosynthetic costs and match the cellular environment of its AT-rich human host.
Collapse
Affiliation(s)
- Anuj Sharma
- Department of Biochemistry, DAV University, Jalandhar, Punjab 144012, India
| | - Shelly Gupta
- Department of Biochemistry, School of Bioengineering and Biosciences, Lovely Professional University, Punjab 144411, India
| | - Karan Paul
- Department of Biochemistry, DAV University, Jalandhar, Punjab 144012, India.
| |
Collapse
|
4
|
Sharma A, Gupta S, Paul K. Evolution of codon and amino acid usage in bacterial protein toxins. Biochem Biophys Res Commun 2023; 651:47-55. [PMID: 36791498 DOI: 10.1016/j.bbrc.2023.02.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/23/2023] [Revised: 02/01/2023] [Accepted: 02/01/2023] [Indexed: 02/07/2023]
Abstract
Toxin proteins are secreted by most pathogens as an integral part of pathogenic mechanism(s). The toxins act by either damaging the host cell membrane (for example, pore-forming toxins and RTX toxins) or by modulation of important cellular pathways (for example, inhibition of protein translation by ribosome-inactivating proteins). The mechanism of action of these toxins provides the pathogen with strategies for adaptation in the unfavorable host environment. Though, secreted by different pathogenic species, the protein toxins seem to share common features that allow the protein to bind to specific molecules and enter the host cell. Earlier studies have suggested role of several events like horizontal gene transfer and insertion-deletion mutations in evolution of protein toxins. The present study involving 125 bacterial protein toxins secreted by 49 pathogenic bacteria focuses on the role and constraints of the bacterial genome on evolution of codon and amino acid usage in respective bacterial protein toxins. We compare the nucleotide composition, codon and dinucleotide usage trends between different classes of bacterial protein toxins and between individual toxins and the parent bacterial genome expressing the toxin(s).
Collapse
Affiliation(s)
- Anuj Sharma
- Department of Biochemistry, DAV University, Jalandhar, 144012, India
| | - Shelly Gupta
- Department of Biochemistry, School of Bioengineering and Biosciences, Lovely Professional University, Phagwara, India
| | - Karan Paul
- Department of Biochemistry, DAV University, Jalandhar, 144012, India.
| |
Collapse
|
5
|
Huang Y, Lin T, Lu L, Cai F, Lin J, Jiang YE, Lin Y. Codon pair optimization (CPO): a software tool for synthetic gene design based on codon pair bias to improve the expression of recombinant proteins in Pichia pastoris. Microb Cell Fact 2021; 20:209. [PMID: 34736476 PMCID: PMC8567542 DOI: 10.1186/s12934-021-01696-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/29/2021] [Accepted: 10/19/2021] [Indexed: 11/29/2022] Open
Abstract
Background Codon optimization is a common method to improve protein expression levels in Pichia pastoris and the current strategy is to replace rare codons with preferred codons to match the codon usage bias. However, codon-pair contexts have a profound effect on translation efficiency by influencing both translational elongation rates and accuracy. Until now, it remains untested whether optimized genes based on codon pair bias results in higher protein expression levels compared to codon usage bias. Results In this study, an algorithm based on dynamic programming was introduced to develop codon pair optimization (CPO) which is a software tool to provide simple and efficient codon pair optimization for synthetic gene design in Pichia pastoris. Two reporters (MT1-MMP E2C6 and ADAM17 A9B8 scFvs) were employed to test the effects of codon pair bias and CPO optimization on their protein expression levels. Four variants of MT1-MMP E2C6 and ADAM17 A9B8 for each were generated, one variant with the best codon-pair context, one with the worst codon-pair context, one with unbiased codon-pair context, and another optimized based on codon usage. The expression levels of variants with the worst codon-pair context were almost undetectable by Western blot and the variants with the best codon-pair context were expressed well. The expression levels on MT1-MMP E2C6 and ADAM17 A9B8 were more than five times and seven times higher in the optimized sequences based on codon-pair context compared to that based on codon usage, respectively. The results indicated that the codon-pair context-based codon optimization is more effective in enhancing expression of protein in Pichia pastoris. Conclusions Codon-pair context plays an important role on the protein expression in Pichia pastoris. The codon pair optimization (CPO) software developed in this study efficiently improved the protein expression levels of exogenous genes in Pichia pastoris, suggesting gene design based on codon pair bias is an alternative strategy for high expression of recombinant proteins in Pichia pastoris. Supplementary Information The online version contains supplementary material available at 10.1186/s12934-021-01696-y.
Collapse
Affiliation(s)
- Yide Huang
- Engineering Research Center of Industrial Microbiology, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China. .,Provincial University Key Laboratory of Cellular Stress Response and Metabolic Regulation, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China.
| | - Ting Lin
- Engineering Research Center of Industrial Microbiology, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China
| | - Lingfang Lu
- Engineering Research Center of Industrial Microbiology, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China
| | - Fan Cai
- Provincial University Key Laboratory of Cellular Stress Response and Metabolic Regulation, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China
| | - Jie Lin
- College of Mathematics and Informatics, Fujian Normal University, Fuzhou, 350007, China
| | - Yu E Jiang
- College of Mathematics and Informatics, Fujian Normal University, Fuzhou, 350007, China.
| | - Yao Lin
- Engineering Research Center of Industrial Microbiology, College of Life Sciences, Fujian Normal University, Fuzhou, 350007, China. .,Provincial University Key Laboratory of Sport and Health Science, School of Physical Education and Sport Sciences, Fujian Normal University, Fuzhou, 350007, China.
| |
Collapse
|
6
|
Kokate PP, Techtmann SM, Werner T. Codon usage bias and dinucleotide preference in 29 Drosophila species. G3 GENES|GENOMES|GENETICS 2021; 11:6291245. [PMID: 34849812 PMCID: PMC8496323 DOI: 10.1093/g3journal/jkab191] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2021] [Accepted: 05/13/2021] [Indexed: 12/30/2022]
Abstract
Abstract
Codon usage bias, where certain codons are used more frequently than their synonymous counterparts, is an interesting phenomenon influenced by three evolutionary forces: mutation, selection, and genetic drift. To better understand how these evolutionary forces affect codon usage bias, an extensive study to detect how codon usage patterns change across species is required. This study investigated 668 single-copy orthologous genes independently in 29 Drosophila species to determine how the codon usage patterns change with phylogenetic distance. We found a strong correlation between phylogenetic distance and codon usage bias and observed striking differences in codon preferences between the two subgenera Drosophila and Sophophora. As compared to the subgenus Sophophora, species of the subgenus Drosophila showed reduced codon usage bias and a reduced preference specifically for codons ending with C, except for codons with G in the second position. We found that codon usage patterns in all species were influenced by the nucleotides in the codon’s 2nd and 3rd positions rather than the biochemical properties of the amino acids encoded. We detected a concordance between preferred codons and preferred dinucleotides (at positions 2 and 3 of codons). Furthermore, we observed an association between speciation, codon preferences, and dinucleotide preferences. Our study provides the foundation to understand how selection acts on dinucleotides to influence codon usage bias.
Collapse
Affiliation(s)
- Prajakta P Kokate
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | - Stephen M Techtmann
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| | - Thomas Werner
- Department of Biological Sciences, Michigan Technological University, Houghton, MI 49931, USA
| |
Collapse
|
7
|
An interplay between compositional constraint and natural selection dictates the codon usage pattern among select Galliformes. Biosystems 2021; 204:104390. [PMID: 33636205 DOI: 10.1016/j.biosystems.2021.104390] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/28/2021] [Accepted: 02/18/2021] [Indexed: 11/20/2022]
Abstract
Galliformes are believed to be the first avian order that started living in human association and became domesticated. Members of this order ranged from common to rare species. Next-generation sequencing has availed researchers with the whole genome sequences of five Galliformes; chicken, helmeted Guinea fowl, turkey, Japanese quail, and peafowl. Bioinformatic analysis based on codon usage, evolution, and species-specific functional enrichment can provide some crucial information aiding proper understanding of their genomic strategies. In this study, we investigated the genomic features of chicken, helmeted guinea fowl, turkey, and Japanese quail. Their genomes were AT biased although the potentially highly expressed genes contained more GC than AT. Cytosine dominated the third position of frequently used optimal codons. Mutational pressures in the analyzed Galliformes were in the range of 0.2-0.6%. Neutrality plot, translational selection index, and mutational responsive index indicated the dominance of selection pressure over mutational pressure among Galliformes. A pair of di-nucleotides, TpA and CpG, was found to be used less frequently than others in protein-coding genes since both of them are associated with the conversion of euchromatin to heterochromatin. Functional enrichment analysis revealed the dominance of proteins associated with fundamental biological processes. In turkey, chicken and helmeted Guinea fowl proteins with immunity-boosting capacity prevailed along with proteins needed for signal transduction and maintenance of central dogma. Evolutionary analysis indicated a bias towards synonymous substitution than non-synonymous mutation.
Collapse
|
8
|
Gupta S, Paul K, Roy A. Codon usage signatures in the genus Cryptococcus: A complex interplay of gene expression, translational selection and compositional bias. Genomics 2020; 113:821-830. [PMID: 33096254 DOI: 10.1016/j.ygeno.2020.10.013] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2020] [Revised: 09/16/2020] [Accepted: 10/05/2020] [Indexed: 11/30/2022]
Abstract
The fungal genus Cryptococcus comprises of several diverse species. The pathogens forming Cryptococcus neoformans/ Cryptococcus gatti species complex are of immense clinical significance owing to the high frequency of infections and deaths globally. Three closely related non-pathogenic species namely, Cryptococcus amylolentus, Cryptococcus wingfieldii and Cryptococcus depauperatus are the non-pathogenic ancestral species from which pathogenic lineages have diverged. In the current study, a comprehensive analysis of factors influencing the codon and amino acid usage bias in six pathogenic and three non-pathogenic species was performed. Our results revealed that though compositional bias played a crucial role, translational selection and gene expression were the key determinants of codon usage variations. Analysis of relative dinucleotide abundance and codon context signatures revealed strict avoidance of TpA dinucleotide across genomes. Multivariate statistical analysis based on codon usage data resulted in discrete clustering of pathogens and non-pathogens which correlated with previous reports on their phylogenetic distribution.
Collapse
Affiliation(s)
- Shelly Gupta
- Department of Biochemistry, School of Bioengineering and Biosciences, Lovely Professional University, Punjab 144411, India.
| | - Karan Paul
- Department of Biochemistry, DAV University, Jalandhar, Punjab 144001, India
| | - Ayan Roy
- Department of Biotechnology, School of Bioengineering and Biosciences, Lovely Professional University, Punjab 144411, India.
| |
Collapse
|