1
|
Cote-L'Heureux AE, Sterner EG, Maurer-Alcalá XX, Katz LA. Lost in translation: conserved amino acid usage despite extreme codon bias in foraminifera. mBio 2025; 16:e0391624. [PMID: 40042280 PMCID: PMC11980380 DOI: 10.1128/mbio.03916-24] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/20/2025] [Accepted: 02/04/2025] [Indexed: 04/10/2025] Open
Abstract
Analyses of codon usage in eukaryotes suggest that amino acid usage responds to GC pressure so AT-biased substitutions drive higher usage of amino acids with AT-ending codons. Here, we combine single-cell transcriptomics and phylogenomics to explore codon usage patterns in foraminifera, a diverse and ancient clade of predominantly uncultivable microeukaryotes. We curate data from 1,044 gene families in 49 individuals representing 28 genera, generating perhaps the largest existing dataset of data from a predominantly uncultivable clade of protists, to analyze compositional bias and codon usage. We find extreme variation in composition, with a median GC content at fourfold degenerate silent sites below 3% in some species and above 75% in others. The most AT-biased species are distributed among diverse non-monophyletic lineages. Surprisingly, despite the extreme variation in compositional bias, amino acid usage is highly conserved across all foraminifera. By analyzing nucleotide, codon, and amino acid composition within this diverse clade of amoeboid eukaryotes, we expand our knowledge of patterns of genome evolution across the eukaryotic tree of life.IMPORTANCEPatterns of molecular evolution in protein-coding genes reflect trade-offs between substitution biases and selection on both codon and amino acid usage. Most analyses of these factors in microbial eukaryotes focus on model species such as Acanthamoeba, Plasmodium, and yeast, where substitution bias is a primary contributor to patterns of amino acid usage. Foraminifera, an ancient clade of single-celled eukaryotes, present a conundrum, as we find highly conserved amino acid usage underlain by divergent nucleotide composition, including extreme AT-bias at silent sites among multiple non-sister lineages. We speculate that these paradoxical patterns are enabled by the dynamic genome structure of foraminifera, whose life cycles can include genome endoreplication and chromatin extrusion.
Collapse
Affiliation(s)
| | - Elinor G. Sterner
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, USA
| | - Xyrus X. Maurer-Alcalá
- Division of Invertebrate Zoology, Sackler Institute for Comparative Genomics, American Museum of Natural History, New York, New York, USA
| | - Laura A. Katz
- Department of Biological Sciences, Smith College, Northampton, Massachusetts, USA
- Program in Organismic Biology and Evolution, University of Massachusetts Amherst, Amherst, Massachusetts, USA
| |
Collapse
|
2
|
Mou QH, Hu Z, Zhang J, Daroch M, Tang J. Comparative genomics of thermosynechococcaceae and thermostichaceae: insights into codon usage bias. Acta Biochim Pol 2025; 71:13825. [PMID: 39845100 PMCID: PMC11750575 DOI: 10.3389/abp.2024.13825] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/19/2024] [Accepted: 12/20/2024] [Indexed: 01/24/2025]
Abstract
Members of the families Thermosynechococcaceae and Thermostichaceae are well-known unicellular thermophilic cyanobacteria and a non-thermophilic genus Pseudocalidococcus was newly classified into the former. Analysis of the codon usage bias (CUB) of cyanobacterial species inhabiting different thermal and non-thermal niches will benefit the understanding of their genetic and evolutionary characteristics. Herein, the CUB and codon context patterns of protein-coding genes were systematically analyzed and compared between members of the two families. Overall, the nucleotide composition and CUB indices were found to differ between thermophiles and non-thermophiles. The thermophiles showed a higher G/C content in the codon base composition and tended to end with G/C compared to the non-thermophiles. Correlation analysis indicated significant associations between codon base composition and CUB indices. The results of the effective number of codons, parity-rule 2, neutral and correspondence analyses indicated that mutational pressure and natural selection primarily account for CUB in these cyanobacterial species, but the primary driving forces exhibit variation among genera. Moreover, the optimal codons identified based on relative synonymous codon usage values were found to differ among genera and even within genera. In addition, codon context pattern analysis revealed the specificity of the sequence context of start and stop codons among genera. Intriguingly, the clustering of codon context patterns appeared to be more related to thermotolerance than to phylogenomic relationships. In conclusion, this study facilitates the understanding of the characteristics and sources of variation of CUB and the evolution of the surveyed cyanobacterial clades with different thermotolerance and provides insights into their adaptation to different environments.
Collapse
Affiliation(s)
- Qiao-Hui Mou
- School of Food and Bioengineering, Chengdu University, Chengdu, China
| | - Zhe Hu
- School of Food and Bioengineering, Chengdu University, Chengdu, China
| | - Jing Zhang
- Food Safety Detection Key Laboratory of Sichuan, Technical Center of Chengdu Customs, Chengdu, China
| | - Maurycy Daroch
- School of Environment and Energy, Peking University Shenzhen Graduate School, Shenzhen, China
| | - Jie Tang
- School of Food and Bioengineering, Chengdu University, Chengdu, China
| |
Collapse
|
3
|
Jacquat AG, Theumer MG, Dambolena JS. Selective and non-selective evolutionary signatures found in the simplest replicative biological entities. J Evol Biol 2024; 37:862-876. [PMID: 38822575 DOI: 10.1093/jeb/voae070] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/23/2023] [Accepted: 05/30/2024] [Indexed: 06/03/2024]
Abstract
Mitoviruses, which are considered evolutionary relics of extinct alpha-proteobacteria RNA phages, represent one of the simplest self-replicating biological systems. This study aims to quantitatively describe genomes and identify potential genomic signatures that support the protein phylogenetic-based classification criterion. Genomic variables, such as mononucleotide and dinucleotide composition, codon usage bias, and minimal free energy derived from optimized predicted RNA secondary structure, were analyzed. From the values obtained, the main evolutionary pressures were discussed, indicating that natural selection plays a significant role in shaping mitovirus genomes. However, neutral evolution also makes a significant contribution. This study reveals a significant discovery of structural divergence in Kvaramitovirus. The energy minimization approach employed to study 2D folding in this study reveals a distinct spatial organization of their genomes, providing evidence for the hypothesis of a single evolutionary event of circularization in the most recent common ancestor of the lineage. This hypothesis was discussed in light of recent discoveries by other researchers that partially support the existence of mitoviruses with circular genomes. Finally, this study represents a significant advancement in the understanding of mitoviruses, as it quantitatively describes the nucleotide sequence at the family and genus taxonomic levels. Additionally, we provide hypotheses that can be experimentally validated to inspire new research and address the gaps in knowledge of this fascinating, basally divergent RNA virus lineage.
Collapse
Affiliation(s)
- Andrés Gustavo Jacquat
- Facultad de Ciencias Exactas Físicas y Naturales (FCEFyN), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| | - Martín Gustavo Theumer
- Departamento de Bioquímica Clínica, Facultad de Ciencias Químicas (FCQ), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Centro de Investigaciones en Bioquímica Clínica e Inmunología (CIBICI), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| | - José Sebastián Dambolena
- Facultad de Ciencias Exactas Físicas y Naturales (FCEFyN), Universidad Nacional de Córdoba (UNC), Córdoba, Argentina
- Instituto Multidisciplinario de Biología Vegetal (IMBIV), Consejo Nacional de Investigaciones Científicas y Técnicas (CONICET), Córdoba, Argentina
| |
Collapse
|
4
|
Hu X, Li Y, Meng F, Duan Y, Sun M, Yang S, Liu H. Analysis of chloroplast genome characteristics and codon usage bias in 14 species of Annonaceae. Funct Integr Genomics 2024; 24:109. [PMID: 38797780 DOI: 10.1007/s10142-024-01389-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2024] [Revised: 05/18/2024] [Accepted: 05/21/2024] [Indexed: 05/29/2024]
Abstract
For the study of species evolution, chloroplast gene expression, and transformation, the chloroplast genome is an invaluable resource. Codon usage bias (CUB) analysis is a tool that is utilized to improve gene expression and investigate evolutionary connections in genetic transformation. In this study, we analysed chloroplast genome differences, codon usage patterns and the sources of variation on CUB in 14 Annonaceae species using bioinformatics tools. The study showed that there was a significant variation in both gene sizes and numbers between the 14 species, but conservation was still maintained. It's worth noting that there were noticeable differences in the IR/SC sector boundary and the types of SSRs among the 14 species. The mono-nucleotide repeat type was the most common, with A/T repeats being more prevalent than G/C repeats. Among the different types of repeats, forward and palindromic repeats were the most abundant, followed by reverse repeats, and complement repeats were relatively rare. Codon composition analysis revealed that all 14 species had a frequency of GC lower than 50%. Additionally, it was observed that the proteins in-coding sequences of chloroplast genes tend to end with A/T at the third codon position. Among these species, 21 codons exhibited bias (RSCU > 1), and there were 8 high-frequency (HF) codons and 5 optimal codons that were identical across the species. According to the ENC-plot and Neutrality plot analysis, natural selection had less impact on the CUB of A. muricate and A. reticulata. Based on the PR2-plot, it was evident that base G had a higher frequency than C, and T had a higher frequency A. The correspondence analysis (COA) revealed that codon usage patterns different in Annonaceae.
Collapse
Affiliation(s)
- Xiang Hu
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China
| | - Yaqi Li
- Tropical and Subtropical Cash Crops Research Institute, Yunnan Academy of Agricultural Sciences, Baoshan, Yunnan, 678000, China
| | - Fuxuan Meng
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China
| | - Yuanjie Duan
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China
| | - Manying Sun
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China
| | - Shiying Yang
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China
| | - Haigang Liu
- Tropical Eco-agriculture Research Institute, Yunnan Academy of Agricultural Sciences, Yuanmou, Yunnan, 651300, China.
| |
Collapse
|
5
|
Li Z, Huang Z, Wan X, Yu J, Dong H, Zhang J, Zhang C, Wang S. Complete chloroplast genome sequence of Rhododendronmariesii and comparative genomics of related species in the family Ericaeae. COMPARATIVE CYTOGENETICS 2023; 17:163-180. [PMID: 37650109 PMCID: PMC10464601 DOI: 10.3897/compcytogen.17.101427] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/03/2023] [Accepted: 07/26/2023] [Indexed: 09/01/2023]
Abstract
Rhododendronmariesii Hemsley et Wilson, 1907, a typical member of the family Ericaeae, possesses valuable medicinal and horticultural properties. In this research, the complete chloroplast (cp) genome of R.mariesii was sequenced and assembled, which proved to be a typical quadripartite structure with the length of 203,480 bp. In particular, the lengths of the large single copy region (LSC), small single copy region (SSC), and inverted repeat regions (IR) were 113,715 bp, 7,953 bp, and 40,918 bp, respectively. Among the 151 unique genes, 98 were protein-coding genes, 8 were tRNA genes, and 45 were rRNA genes. The structural characteristics of the R.mariesiicp genome was similar to other angiosperms. Leucine was the most representative amino acid, while cysteine was the lowest representative. Totally, 30 codons showed obvious codon usage bias, and most were A/U-ending codons. Six highly variable regions were observed, such as trnK-pafI and atpE-rpoB, which could serve as potential markers for future barcoding and phylogenetic research of R.mariesii species. Coding regions were more conserved than non-coding regions. Expansion and contraction in the IR region might be the main length variation in R.mariesii and related Ericaeae species. Maximum-likelihood (ML) phylogenetic analysis revealed that R.mariesii was relatively closed to the R.simsii Planchon, 1853 and R.pulchrum Sweet,1831. This research will supply rich genetic resource for R.mariesii and related species of the Ericaeae.
Collapse
Affiliation(s)
- Zhiliang Li
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Zhiwei Huang
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Xuchun Wan
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Jiaojun Yu
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Hongjin Dong
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Jialiang Zhang
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| | - Chunyu Zhang
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
- College of Plant Science & Technology, Huazhong Agricultural University, Wuhan, 430070, Hubei Province, ChinaHuazhong Agricultural UniversityWuhanChina
| | - Shuzhen Wang
- College of Biology and Agricultural Resources, Huanggang Normal University, Huanggang, 438000, Hubei Province, ChinaHuanggang Normal UniversityHuanggangChina
| |
Collapse
|
6
|
Cervantes S, Kesälahti R, Kumpula TA, Mattila TM, Helanterä H, Pyhäjärvi T. Strong Purifying Selection in Haploid Tissue-Specific Genes of Scots Pine Supports the Masking Theory. Mol Biol Evol 2023; 40:msad183. [PMID: 37565532 PMCID: PMC10457172 DOI: 10.1093/molbev/msad183] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/06/2023] [Revised: 06/16/2023] [Accepted: 08/10/2023] [Indexed: 08/12/2023] Open
Abstract
The masking theory states that genes expressed in a haploid stage will be under more efficient selection. In contrast, selection will be less efficient in genes expressed in a diploid stage, where the fitness effects of recessive deleterious or beneficial mutations can be hidden from selection in heterozygous form. This difference can influence several evolutionary processes such as the maintenance of genetic variation, adaptation rate, and genetic load. Masking theory expectations have been confirmed in single-cell haploid and diploid organisms. However, in multicellular organisms, such as plants, the effects of haploid selection are not clear-cut. In plants, the great majority of studies indicating haploid selection have been carried out using male haploid tissues in angiosperms. Hence, evidence in these systems is confounded with the effects of sexual selection and intraspecific competition. Evidence from other plant groups is scarce, and results show no support for the masking theory. Here, we have used a gymnosperm Scots pine megagametophyte, a maternally derived seed haploid tissue, and four diploid tissues to test the strength of purifying selection on a set of genes with tissue-specific expression. By using targeted resequencing data of those genes, we obtained estimates of genetic diversity, the site frequency spectrum of 0-fold and 4-fold sites, and inferred the distribution of fitness effects of new mutations in haploid and diploid tissue-specific genes. Our results show that purifying selection is stronger for tissue-specific genes expressed in the haploid megagametophyte tissue and that this signal of strong selection is not an artifact driven by high expression levels.
Collapse
Affiliation(s)
- Sandra Cervantes
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
- Biocenter Oulu, University of Oulu, Oulu, Finland
| | - Robert Kesälahti
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Timo A Kumpula
- Biocenter Oulu, University of Oulu, Oulu, Finland
- Laboratory of Cancer Genetics and Tumor Biology, Research Unit of Translational Medicine, University of Oulu, Oulu, Finland
| | - Tiina M Mattila
- Human Evolution, Department of Organismal Biology, Uppsala University, Uppsala, Sweden
| | - Heikki Helanterä
- Department of Ecology and Genetics, University of Oulu, Oulu, Finland
| | - Tanja Pyhäjärvi
- Department of Forest Sciences, University of Helsinki, Helsinki, Finland
| |
Collapse
|
7
|
Morton BR. Context and Mutation in Gymnosperm Chloroplast DNA. Genes (Basel) 2023; 14:1492. [PMID: 37510396 PMCID: PMC10378972 DOI: 10.3390/genes14071492] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/25/2023] [Revised: 07/15/2023] [Accepted: 07/18/2023] [Indexed: 07/30/2023] Open
Abstract
Mutations and subsequent repair processes are known to be strongly context-dependent in the flowering-plant chloroplast genome. At least six flanking bases, three on each side, can have an influence on the relative rates of different types of mutation at any given site. In this analysis, examine context and substitution at noncoding and fourfold degenerate coding sites in gymnosperm DNA. The sequences are analyzed in sets of three, allowing the inference of the substitution direction and the generation of context-dependent rate matrices. The size of the dataset limits the analysis to the tetranucleotide context of the sites, but the evidence shows that there are significant contextual effects, with patterns that are similar to those observed in angiosperms. These effects most likely represent an influence on the underlying mutation/repair dynamics. The data extend the plastome lineages that feature very complex patterns of mutation, which can have significant effects on the evolutionary dynamics of the chloroplast genome.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biology, Barnard College, Columbia University, 3009 Broadway, New York, NY 10027, USA
| |
Collapse
|
8
|
Bi D, Han S, Zhou J, Zhao M, Zhang S, Kan X. Codon Usage Analyses Reveal the Evolutionary Patterns among Plastid Genes of Saxifragales at a Larger-Sampling Scale. Genes (Basel) 2023; 14:genes14030694. [PMID: 36980966 PMCID: PMC10048229 DOI: 10.3390/genes14030694] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2023] [Revised: 03/07/2023] [Accepted: 03/10/2023] [Indexed: 03/14/2023] Open
Abstract
Saxifragales is a 15-family order of early-divergent Eudicots with a rich morphological diversity and an ancient rapid radiation. Codon usage bias (CUB) analyses have emerged as an essential tool for understanding the evolutionary dynamics in genes. Thus far, the codon utilization patterns had only been reported in four separate genera within Saxifragales. This study provides a comprehensive assessment of the codon manipulation based on 50 plastid genes, covering 11 constituent families at a larger sampling scale. Our results first showed a high preference for AT bases and AT-ending codons. We then used effective number of codons (ENC) to assess a range of codon bias levels in the plastid genes. We also detected high-informative intrafamilial differences of ENC in three families. Subsequently, parity rule 2 (PR2) plot analyses revealed both family-unique and order-shared bias patterns. Most importantly, the ENC plots and neutrality analyses collectively supported the dominant roles of selection in the CUB of Saxifragales plastid genes. Notably, the phylogenetic affinities inferred by both ML and BI methods were consistent with each other, and they all comprised two primary clades and four subclades. These findings significantly enhance our understanding of the evolutionary processes of the Saxifrage order, and could potentially inspire more CUB analyses at higher taxonomic levels.
Collapse
Affiliation(s)
- De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Shiyun Han
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Jun Zhou
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Maojin Zhao
- Suzhou Polytechnic Institute of Agriculture, Suzhou 215000, China
| | - Sijia Zhang
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
| | - Xianzhao Kan
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu 241000, China
- Correspondence: ; Tel.: +86-139-5537-2268
| |
Collapse
|
9
|
Evidence for Strand Asymmetry in Different Plastid Genomes. Genes (Basel) 2023; 14:genes14020320. [PMID: 36833247 PMCID: PMC9956171 DOI: 10.3390/genes14020320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/02/2022] [Revised: 01/06/2023] [Accepted: 01/10/2023] [Indexed: 01/28/2023] Open
Abstract
A common genome composition pattern in eubacteria is an asymmetry between the leading and lagging strands resulting in opposite skew patterns in the two replichores that lie between the origin and terminus of replication. Although this pattern has been reported for a couple of isolated plastid genomes, it is not clear how widespread it is overall in this chromosome. Using a random walk approach, we examine plastid genomes outside of the land plants, which are excluded since they are known not to initiate replication at a single site, for such a pattern of asymmetry. Although it is not a common feature, we find that it is detectable in the plastid genome of species from several diverse lineages. The euglenozoa in particular show a strong skew pattern as do several rhodophytes. There is a weaker pattern in some chlorophytes but it is not apparent in other lineages. The ramifications of this for analyses of plastid evolution are discussed.
Collapse
|
10
|
Do Noncoding and Coding Sites in Angiosperm Chloroplast DNA Have Different Mutation Processes? Genes (Basel) 2023; 14:genes14010148. [PMID: 36672890 PMCID: PMC9858945 DOI: 10.3390/genes14010148] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2022] [Revised: 12/30/2022] [Accepted: 01/03/2023] [Indexed: 01/09/2023] Open
Abstract
Fourfold degenerate sites within coding regions and intergenic sites have both been used as estimates of neutral evolution. In chloroplast DNA, the pattern of substitution at intergenic sites is strongly dependent on the composition of the surrounding hexanucleotide composed of the three base pairs on each side, which suggests that the mutation process is highly context-dependent in this genome. This study examines the context-dependency of substitutions at fourfold degenerate sites in protein-coding regions and compares the pattern to what has been observed at intergenic sites. Overall, there is strong similarity between the two types of sites, but there are some intriguing differences. One of these is that substitutions of G and C are significantly higher at fourfold degenerate sites across a range of contexts. In fact, A → T and T → A substitutions are the only substitution types that occur at a lower rate at fourfold degenerate sites. The data are not consistent with selective constraints being responsible for the difference in substitution patterns between intergenic and fourfold degenerate sites. Rather, it is suggested that the difference may be a result of different epigenetic modifications that result in slightly different mutation patterns in coding and intergenic DNA.
Collapse
|
11
|
Shi SL, Liu YQ, Xia RX, Qin L. Comprehensive Analysis of Codon Usage in Quercus Chloroplast Genome and Focus on psbA Gene. Genes (Basel) 2022; 13:2156. [PMID: 36421830 PMCID: PMC9690922 DOI: 10.3390/genes13112156] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/21/2022] [Revised: 11/13/2022] [Accepted: 11/15/2022] [Indexed: 10/27/2023] Open
Abstract
Quercus (oak) is an important economic and ecological tree species in the world, and it is the necessary feed for oak silkworm feeding. Chloroplasts play an important role in green plants but the codon usage of oak chloroplast genomes is not fully studied. We examined the codon usage of the oak chloroplast genomes in detail to facilitate the understanding of their biology and evolution. We downloaded all the protein coding genes of 26 non-redundant chloroplast reference genomes, removed short ones and those containing internal stop codons, and finally retained 50 genes shared by all genomes for comparative analyses. The base composition, codon bias, and codon preference are not significantly different between genomes but are significantly different among genes within these genomes. Oak chloroplast genomes prefer T/A-ending codons and avoid C/G-ending codons, and the psbA gene has the same preference except for the codons encoding amino acid Phe. Complex factors such as context-dependent mutations are the major factors affecting codon usage in these genomes, while selection plays an important role on the psbA gene. Our study provided an important understanding of codon usage in the oak chloroplast genomes and found that the psbA gene has nearly the same codon usage preference as other genes in the oak chloroplasts.
Collapse
Affiliation(s)
| | | | - Run-Xi Xia
- College of Bioscience and Biotechnology, Shenyang Agricultural University, Shenyang 110866, China
| | | |
Collapse
|
12
|
Morton BR. Substitution rate heterogeneity across hexanucleotide contexts in noncoding chloroplast DNA. G3 GENES|GENOMES|GENETICS 2022; 12:6608088. [PMID: 35699494 PMCID: PMC9339276 DOI: 10.1093/g3journal/jkac150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/27/2022] [Accepted: 06/07/2022] [Indexed: 11/13/2022]
Abstract
Substitutions between closely related noncoding chloroplast DNA sequences are studied with respect to the composition of the 3 bases on each side of the substitution, that is the hexanucleotide context. There is about 100-fold variation in rate, among the contexts, particularly on substitutions of A and T. Rate heterogeneity of transitions differs from that of transversions, resulting in a more than 200-fold variation in the transitions: transversion bias. The data are consistent with a CpG effect, and it is shown that both the A + T content and the arrangement of purines/pyrimidines along the same DNA strand are correlated with rate variation. Expected equilibrium A + T content ranges from 36.4% to 82.8% across contexts, while G–C skew ranges from −77.4 to 72.2 and A–T skew ranges from −63.9 to 68.2. The predicted equilibria are associated with specific features of the content of the hexanucleotide context, and also show close agreement with the observed context-dependent compositions. Finally, by controlling for the content of nucleotides closer to the substitution site, it is shown that both the third and fourth nucleotide removed on each side of the substitution directly influence substitution dynamics at that site. Overall, the results demonstrate that noncoding sites in different contexts are evolving along very different evolutionary trajectories and that substitution dynamics are far more complex than typically assumed. This has important implications for a number of types of sequence analysis, particularly analyses of natural selection, and the context-dependent substitution matrices developed here can be applied in future analyses.
Collapse
Affiliation(s)
- Brian R Morton
- Department of Biology, Barnard College, Columbia University , New York, NY 10027, USA
| |
Collapse
|
13
|
Han S, Bi D, Yi R, Ding H, Wu L, Kan X. Plastome evolution of Aeonium and Monanthes (Crassulaceae): insights into the variation of plastomic tRNAs, and the patterns of codon usage and aversion. PLANTA 2022; 256:35. [PMID: 35809200 DOI: 10.1007/s00425-022-03950-y] [Citation(s) in RCA: 13] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 03/26/2022] [Accepted: 06/24/2022] [Indexed: 06/15/2023]
Abstract
This study reported 13 new plastomes from Aeonium and Monanthes, and observed new markers for phylogeny and DNA barcoding, such as novel tRNA structures and codon usage bias and aversion. The Macaronesian clade of Crassulaceae consists of three genera: Aichryson, with about 15 species; Monanthes, with about 10 species; Aeonium, with about 40 species. Within this clade, Aeonium, known as "the botanical equivalent of Darwin's finches", is regarded as an excellent model plant for researching adaptive evolution. Differing from the well-resolved relationships among three genera of the Macaronesian clade, the internal branching patterns within the genus Aeonium are largely unclear. In this study, we first reported 13 new plastomes from genus Aeonium and the closely related genus Monanthes. We further performed comprehensive analyses of the plastomes, with focuses on the secondary structures of pttRNAs and the patterns of codon usage and aversion. With a typical circular and quadripartite structure, the 13 plastomes ranged from 149,900 to 151,030 bp in size, and the unique pattern in IR junctions might become a family-specific marker for Crassulaceae species. Surprisingly, the π values of plastomes from Monanthes were almost twice those from Aeonium. Most importantly, we strongly recommend that highly polymorphic regions, novel putative pttRNA structures, patterns of codon usage bias and aversion derived from plastomes might have phylogenetic implications, and could act as new markers for DNA barcoding of plants. The results of phylogenetic analyses strongly supported a clear internal branching pattern in Macaronesian clade (represented by Aeonium and Monanthes), with higher nodal support values. The findings reported here will provide new insights into the variation of pttRNAs, and the patterns of codon usage and aversion of the family Crassulaceae.
Collapse
Affiliation(s)
- Shiyun Han
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - De Bi
- Suzhou Polytechnic Institute of Agriculture, Suzhou, 215000, Jiangsu, China
| | - Ran Yi
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - Hengwu Ding
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China
| | - Longhua Wu
- CAS Key Laboratory of Soil Environment and Pollution Remediation, Institute of Soil Science, Chinese Academy of Sciences, Nanjing, 210008, Jiangsu, China
| | - Xianzhao Kan
- The Institute of Bioinformatics, College of Life Sciences, Anhui Normal University, Wuhu, 241000, Anhui, China.
- Anhui Provincial Key Laboratory of the Conservation and Exploitation of Biological Resources, Wuhu, 241000, Anhui, China.
| |
Collapse
|