1
|
Kaczanowski S. Detection of positive selection acting on protein surfaces at the whole-genome scale in the human malaria parasite Plasmodium falciparum. INFECTION, GENETICS AND EVOLUTION : JOURNAL OF MOLECULAR EPIDEMIOLOGY AND EVOLUTIONARY GENETICS IN INFECTIOUS DISEASES 2023; 107:105397. [PMID: 36572055 DOI: 10.1016/j.meegid.2022.105397] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2022] [Revised: 09/20/2022] [Accepted: 12/21/2022] [Indexed: 12/24/2022]
Abstract
The host-parasite evolutionary arms race is a fundamental process with medical implications. During this process, the host develops parasite resistance, and the parasite develops host immune evasion strategies. Thus, this process accelerates relevant protein evolution. This study test hypothesizes that proteins subject to sequence evolution structural constraints play a crucial role and that these constraints hinder the modification of such proteins in this process. These hypotheses were tested using Plasmodium falciparum model and evaluated protein structures predicted for the entire proteome by the AlphaFold method. Based on dN/dS test results and P. falciparum and P. reichenowi comparisons, the presented approach identified proteins subject to purifying selection acting on the whole sequence and buried residues (dN < dS) and positive selection on nonburied residues. Of the 26 proteins, some known antigens (ring-exported protein 3, RAP protein, erythrocyte binding antigen-140, and protein P47) targeted by the host immune system are promising vaccine candidates. The set also contained 11 enzymes, including FIKK kinase, which modifies host proteins. This set was compared with genes for which the dN/dS test suggested that positive selection acts on the whole gene (i.e., dN > dS). The present study found that such genes encode enzymes and antigenic vaccine candidates less frequently than genes for which evolution is not subject to selection constraints and positive selection acts on only exposed residues. The analysis was repeated comparing P. falciparum with P. alderi, which is more distantly related. The study discusses the potential implications of the presented methodology for rational vaccine design and the parasitology and evolutionary biology fields.
Collapse
Affiliation(s)
- Szymon Kaczanowski
- Department of Bioinformatics, Institute of Biochemistry and Biophysics, Polish Academy of Sciences, Warsaw, Poland.
| |
Collapse
|
2
|
Villada JC, Brustolini OJB, Batista da Silveira W. Integrated analysis of individual codon contribution to protein biosynthesis reveals a new approach to improving the basis of rational gene design. DNA Res 2017; 24:419-434. [PMID: 28449100 PMCID: PMC5737324 DOI: 10.1093/dnares/dsx014] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2016] [Revised: 03/22/2017] [Accepted: 03/23/2017] [Indexed: 01/21/2023] Open
Abstract
Gene codon optimization may be impaired by the misinterpretation of frequency and optimality of codons. Although recent studies have revealed the effects of codon usage bias (CUB) on protein biosynthesis, an integrated perspective of the biological role of individual codons remains unknown. Unlike other previous studies, we show, through an integrated framework that attributes of codons such as frequency, optimality and positional dependency should be combined to unveil individual codon contribution for protein biosynthesis. We designed a codon quantification method for assessing CUB as a function of position within genes with a novel constraint: the relativity of position-dependent codon usage shaped by coding sequence length. Thus, we propose a new way of identifying the enrichment, depletion and non-uniform positional distribution of codons in different regions of yeast genes. We clustered codons that shared attributes of frequency and optimality. The cluster of non-optimal codons with rare occurrence displayed two remarkable characteristics: higher codon decoding time than frequent-non-optimal cluster and enrichment at the 5'-end region, where optimal codons with the highest frequency are depleted. Interestingly, frequent codons with non-optimal adaptation to tRNAs are uniformly distributed in the Saccharomyces cerevisiae genes, suggesting their determinant role as a speed regulator in protein elongation.
Collapse
Affiliation(s)
- Juan C. Villada
- Department of Microbiology, Universidade Federal de Viçosa, Viçosa 36570-900, Brazil
| | | | | |
Collapse
|
3
|
O'Connell MJ, Doyle AM, Juenger TE, Donoghue MTA, Keshavaiah C, Tuteja R, Spillane C. In Arabidopsis thaliana codon volatility scores reflect GC3 composition rather than selective pressure. BMC Res Notes 2012; 5:359. [PMID: 22805311 PMCID: PMC3502101 DOI: 10.1186/1756-0500-5-359] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/13/2012] [Accepted: 07/17/2012] [Indexed: 11/10/2022] Open
Abstract
BACKGROUND Synonymous codon usage bias has typically been correlated with, and attributed to translational efficiency. However, there are other pressures on genomic sequence composition that can affect codon usage patterns such as mutational biases. This study provides an analysis of the codon usage patterns in Arabidopsis thaliana in relation to gene expression levels, codon volatility, mutational biases and selective pressures. RESULTS We have performed synonymous codon usage and codon volatility analyses for all genes in the A. thaliana genome. In contrast to reports for species from other kingdoms, we find that neither codon usage nor volatility are correlated with selection pressure (as measured by dN/dS), nor with gene expression levels on a genome wide level. Our results show that codon volatility and usage are not synonymous, rather that they are correlated with the abundance of G and C at the third codon position (GC3). CONCLUSIONS Our results indicate that while the A. thaliana genome shows evidence for synonymous codon usage bias, this is not related to the expression levels of its constituent genes. Neither codon volatility nor codon usage are correlated with expression levels or selective pressures but, because they are directly related to the composition of G and C at the third codon position, they are the result of mutational bias. Therefore, in A. thaliana codon volatility and usage do not result from selection for translation efficiency or protein functional shift as measured by positive selection.
Collapse
Affiliation(s)
- Mary J O'Connell
- Bioinformatics and Molecular Evolution Group, School of Biotechnology,Dublin City University, Dublin 9, Ireland
| | | | | | | | | | | | | |
Collapse
|
4
|
Abstract
To detect positive Darwinian selection it is thought essential to compare two sequences. Despite its defects, "the comparative method rules." However, genes evolving rapidly under positive selection conflict more with internal forces (the genome phenotype) than genes evolving slowly under negative selection. In particular, there is conflict with stem-loop potential. The conflict between protein-encoding potential (primary information) and stem-loop potential (secondary information) permits detection of positive selection in a single sequence. The degree to which secondary information is compromised provides a measure of the speed of transmission of primary information. Thus, the sovereignty of the comparative method is challenged not only by its own defects, but also by the availability of a single-sequence method. However, while of limited utility for positive selection, the comparative method casts new light on Darwin's great question — the origin of species. Comparison of rates of synonymous and non-synonymous mutation suggests that branching into new species begins with synonymous mutations.
Collapse
Affiliation(s)
- DONALD R. FORSDYKE
- Department of Biochemistry, Queen's University, Kingston, Ontario, Canada K7L3N6, Canada
| |
Collapse
|
5
|
Archetti M. Genetic robustness at the codon level as a measure of selection. Gene 2009; 443:64-9. [PMID: 19477246 DOI: 10.1016/j.gene.2009.05.009] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2008] [Revised: 05/15/2009] [Accepted: 05/19/2009] [Indexed: 10/20/2022]
Abstract
Selection at the DNA level is usually detected by analysing substitution rates from multiple-species comparisons. It has been suggested that measures of genetic robustness at the codon level, which can be measured by analysing a single coding sequence, can be used to estimate selection, but the validity of these measures has been questioned. Here I test the efficiency of different measures of genetic robustness at the codon level to estimate the level of selection acting on a gene. I find that volatility and other measures of robustness are correlated with dN/dS, and that this is not simply the effect of a preference for translationally optimal codons. I discuss the possible implications and the possible problems of these methods based on single-sequence codon usage analysis.
Collapse
|
6
|
Azuaje F, Ramirez JL, Da Silveira JF. An exploration of the genetic robustness landscape of surface protein families in the human protozoan parasite Trypanosoma cruzi. IEEE Trans Nanobioscience 2007; 6:223-8. [PMID: 17926780 DOI: 10.1109/tnb.2007.903482] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
The ability of genes to be robust to mutations at the codon level has been suggested as a key factor for understanding adaptation features. It has been proposed that genes relevant to host-parasite interactions will tend to exhibit high volatility or "antirobust" patterns, which may be related to the capacity of the parasite to evade the host immune system. We compared two superfamilies of surface proteins, trans-sialidase (TS)-like proteins and putative surface protein dispersed gene family-1 (DGF-1), in the parasite Trypanosoma cruzi in terms of a measure of gene volatility. We proposed alternative codon robustness indicators based on cross entropy and impurity of amino acids encoded by point-mutations, which were compared to a volatility estimator previously published. This allowed us to present a more detailed description of the differences between families. A significant difference was observed in terms of these scores, with the TS-MVar1 and the DGF-1 families showing the highest and lowest gene volatility values respectively. The cross entropy and impurity estimators suggest that the MVar1 levels of volatility are linearly correlated with their capacity to generate diverse sets of amino acids as a consequence of potential mutations. This study indicates the feasibility of applying different measures of genetic robustness to detect variations between potential drug targets at the protein level.
Collapse
Affiliation(s)
- Francisco Azuaje
- Computer Science Research Institute, University of Ulster, Jordanstown, BT37 OQB, UK.
| | | | | |
Collapse
|
7
|
Plotkin JB, Dushoff J, Desai MM, Fraser HB. Codon usage and selection on proteins. J Mol Evol 2006; 63:635-53. [PMID: 17043750 DOI: 10.1007/s00239-005-0233-x] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/03/2005] [Accepted: 06/12/2006] [Indexed: 10/24/2022]
Abstract
Selection pressures on proteins are usually measured by comparing homologous nucleotide sequences (Zuckerkandl and Pauling 1965). Recently we introduced a novel method, termed volatility, to estimate selection pressures on proteins on the basis of their synonymous codon usage (Plotkin and Dushoff 2003; Plotkin et al. 2004). Here we provide a theoretical foundation for this approach. Under the Fisher-Wright model, we derive the expected frequencies of synonymous codons as a function of the strength of selection on amino acids, the mutation rate, and the effective population size. We analyze the conditions under which we can expect to draw inferences from biased codon usage, and we estimate the time scales required to establish and maintain such a signal. We find that synonymous codon usage can reliably distinguish between negative selection and neutrality only for organisms, such as some microbes, that experience large effective population sizes or periods of elevated mutation rates. The power of volatility to detect positive selection is also modest--requiring approximately 100 selected sites--but it depends less strongly on population size. We show that phenomena such as transient hyper-mutators can improve the power of volatility to detect selection, even when the neutral site heterozygosity is low. We also discuss several confounding factors, neglected by the Fisher-Wright model, that may limit the applicability of volatility in practice.
Collapse
Affiliation(s)
- Joshua B Plotkin
- Department of Biology, University of Pennsylvania, Philadelphia, PA 19104, USA.
| | | | | | | |
Collapse
|
8
|
Hershberg U, Shlomchik MJ. Differences in potential for amino acid change after mutation reveals distinct strategies for kappa and lambda light-chain variation. Proc Natl Acad Sci U S A 2006; 103:15963-8. [PMID: 17038496 PMCID: PMC1635110 DOI: 10.1073/pnas.0607581103] [Citation(s) in RCA: 32] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
B cells generate varied yet functional clones under high rates of mutation of their V genes. It has been proposed that as a result of the opposing demands of diversification and preservation of integrity, the V genes of heavy and light chains have evolved to overexpress codons prone to amino acid change in their complementarity determining regions (CDR) compared with the framework (FW) regions. We have analyzed the germ-line V genes of heavy and light chains (both kappa and lambda), comparing codons of CDR and FW of the germ-line V regions both to each other and to control regions. We found that in both germ-line heavy chains and lambda chains, CDR codons are prone to replacement mutations, whereas in the FW, the opposite is true. Furthermore, the difference between CDR and FW in heavy chains and lambda chains is based on codons that are prone to nonconservative changes of amino acid. In contrast, in germ-line kappa chains, the codons in both CDR and FW are more prone to replacement mutations. We also demonstrated that negative selection during immune responses is more sensitive to nonconservative amino acid substitutions than overall amino acid change, demonstrating the applicability of our analysis to real-time process of selection in the immune system. The differences in germ-line kappa and lambda light chains' potential reaction to mutation suggests that via these two differently evolved light-chain types, the B cell repertoire encompasses two different strategies to balance diversity and stability in an immune response.
Collapse
Affiliation(s)
- Uri Hershberg
- Department of Laboratory Medicine and Section of Immunobiology, Yale University School of Medicine, 1 Gilbert Street, New Haven, CT 06510
| | - Mark J. Shlomchik
- Department of Laboratory Medicine and Section of Immunobiology, Yale University School of Medicine, 1 Gilbert Street, New Haven, CT 06510
- *To whom correspondence should be addressed. E-mail:
| |
Collapse
|
9
|
Kidgell C, Volkman SK, Daily J, Borevitz JO, Plouffe D, Zhou Y, Johnson JR, Le Roch KG, Sarr O, Ndir O, Mboup S, Batalov S, Wirth DF, Winzeler EA. A systematic map of genetic variation in Plasmodium falciparum. PLoS Pathog 2006; 2:e57. [PMID: 16789840 PMCID: PMC1480597 DOI: 10.1371/journal.ppat.0020057] [Citation(s) in RCA: 153] [Impact Index Per Article: 8.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/10/2006] [Accepted: 04/28/2006] [Indexed: 11/25/2022] Open
Abstract
Discovering novel genes involved in immune evasion and drug resistance in the human malaria parasite, Plasmodium falciparum, is of critical importance to global health. Such knowledge may assist in the development of new effective vaccines and in the appropriate use of antimalarial drugs. By performing a full-genome scan of allelic variability in 14 field and laboratory strains of P. falciparum, we comprehensively identified ≈500 genes evolving at higher than neutral rates. The majority of the most variable genes have paralogs within the P. falciparum genome and may be subject to a different evolutionary clock than those without. The group of 211 variable genes without paralogs contains most known immunogens and a few drug targets, consistent with the idea that the human immune system and drug use is driving parasite evolution. We also reveal gene-amplification events including one surrounding pfmdr1, the P. falciparum multidrug-resistance gene, and a previously uncharacterized amplification centered around the P. falciparum GTP cyclohydrolase gene, the first enzyme in the folate biosynthesis pathway. Although GTP cyclohydrolase is not the known target of any current drugs, downstream members of the pathway are targeted by several widely used antimalarials. We speculate that an amplification of the GTP cyclohydrolase enzyme in the folate biosynthesis pathway may increase flux through this pathway and facilitate parasite resistance to antifolate drugs. Variability in the genome of the human malaria parasite, Plasmodium falciparum, is key to the parasite's ability to cause disease and overcome therapeutic interventions such as drugs and vaccines. Elucidating the extent of genetic variation in the malaria parasite will therefore be central to decreasing the malaria disease burden. The authors performed a full-genome scan of variability in different strains of P. falciparum and observed a nonrandom distribution of variation. In particular, those genes that are predicted to have roles in evading the host immune response or antimalarial drugs show significantly higher levels of variation. In addition, the authors speculate that a previously unreported genome amplification in the folate biosynthesis pathway correlates with resistance to the antimalarial drug sulfadoxine. Such data enable hypotheses to be made about the function of many of the unknown elements in the parasite's genome, which may permit the identification of new targets that can be investigated for incorporation into a malaria vaccine and may aid in the understanding of how the parasite withstands drug pressure.
Collapse
Affiliation(s)
- Claire Kidgell
- Department of Cell Biology, The Scripps Research Institute, La Jolla, California, United States of America
| | - Sarah K Volkman
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Johanna Daily
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Justin O Borevitz
- Plant Biology Laboratory, Salk Institute for Biological Studies, La Jolla, California, United States of America
| | - David Plouffe
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Yingyao Zhou
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Jeffrey R Johnson
- Department of Cell Biology, The Scripps Research Institute, La Jolla, California, United States of America
| | - Karine G. Le Roch
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Ousmane Sarr
- Faculty of Medicine and Pharmacy, Cheikh Anta Diop University, Dakar, Senegal
| | - Omar Ndir
- Faculty of Medicine and Pharmacy, Cheikh Anta Diop University, Dakar, Senegal
| | - Soulyemane Mboup
- Faculty of Medicine and Pharmacy, Cheikh Anta Diop University, Dakar, Senegal
| | - Serge Batalov
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
| | - Dyann F Wirth
- Department of Immunology and Infectious Diseases, Harvard School of Public Health, Boston, Massachusetts, United States of America
| | - Elizabeth A Winzeler
- Department of Cell Biology, The Scripps Research Institute, La Jolla, California, United States of America
- Genomics Institute of the Novartis Research Foundation, San Diego, California, United States of America
- * To whom correspondence should be addressed. E-mail:
| |
Collapse
|
10
|
Abstract
Synonymous codons are neutral at the protein level, therefore natural selection at the protein level should have no effect on their frequencies. Synonymous codons, however, differ in their capacity to reduce the effects of errors: after mutation, certain codons keep on coding for the same amino acid or for amino acids with similar properties, while other synonymous codons produce very different amino acids. Therefore, the impact of errors on a coding sequence (genetic robustness) can be measured by analysing its codon usage. I analyse the codon usage of sequenced nuclear and cytoplasmic genomes and I show that there is an extensive variation in genetic robustness at the DNA sequence level, both among genomes and among genes of the same genome. I also show theoretically that robustness can be adaptive, that is natural selection may lead to a preference for codons that reduce the impact of errors. If selection occurs only among the mutants of a codon (e.g. among the progeny before the adult phase), however, the codons that are more sensitive to the effects of mutations may increase in frequency because they manage to get rid more easily of deleterious mutations. I also suggest other possible explanations for the evolution of genetic robustness at the codon level.
Collapse
Affiliation(s)
- M Archetti
- Department of Zoology, Oxford University, Oxford, UK.
| |
Collapse
|
11
|
Plotkin JB, Dushoff J, Desai MM, Fraser HB. Estimating selection pressures from limited comparative data. Mol Biol Evol 2006; 23:1457-9. [PMID: 16754640 DOI: 10.1093/molbev/msl021] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
We recently introduced a novel method for estimating selection pressures on proteins, termed "volatility," which requires only a single genome sequence. Some criticisms that have been levied against this approach are valid, but many others are based on misconceptions of volatility, or they apply equally to comparative methods of estimating selection. Here, we introduce a simple regression technique for estimating selection pressures on all proteins in a genome, on the basis of limited comparative data. The regression technique does not depend on an underlying population-genetic mechanism. This new approach to estimating selection across a genome should be more powerful and more widely applicable than volatility itself.
Collapse
|
12
|
Kartavtsev YP, Lee JS. Analysis of nucleotide diversity at the cytochrome b and cytochrome oxidase 1 genes at the population, species, and genus levels. RUSS J GENET+ 2006. [DOI: 10.1134/s1022795406040016] [Citation(s) in RCA: 53] [Impact Index Per Article: 2.9] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/23/2022]
|
13
|
Tang H, Wu CI. A new method for estimating nonsynonymous substitutions and its applications to detecting positive selection. Mol Biol Evol 2005; 23:372-9. [PMID: 16237204 DOI: 10.1093/molbev/msj043] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
The standard methods for computing the number of nonsynonymous substitutions (Ka) lump all amino acid changes into one single class, even though their rates of substitution vary by at least 10-fold (Tang et al., 2004). Classifying these changes by their physicochemical properties has not been suitably effective in isolating the fastest evolving classes of changes. We now propose to use the Universal index U of Tang et al. (2004) to classify the 75 elementary amino acid changes (codons differing by 1 bp) by their evolutionary exchangeability. Let Ki denote the Ka value of each class (i = 1, ..., 75 from the most to the least exchangeable). The cumulative Ki for the top 10 classes, denoted Kh (for high-exchangeability types), has two important properties: (1) Kh usually accounts for 25%-30% of total amino acid changes and (2) when the observed number of amino acid substitutions is large, Kh is predictably twice the value of Ka. This shall be referred to as the twofold approximation. The new method for estimating Kh is applied to the comparisons between human and macaque and between mouse and rat. The twofold approximation holds well in these data sets, and the signature of positive selection can be more easily discerned using the Kh statistic than using Ka. Many genes with Ka/Ks > 0.5 can now be shown to have Kh/Ks > 1 and to have evolved adaptively, at least for the high-exchangeability group of amino acid changes.
Collapse
Affiliation(s)
- Hua Tang
- Department of Ecology and Evolution, University of Chicago, USA
| | | |
Collapse
|
14
|
Meyers LA, Ancel FD, Lachmann M. Evolution of genetic potential. PLoS Comput Biol 2005; 1:236-43. [PMID: 16158095 PMCID: PMC1193991 DOI: 10.1371/journal.pcbi.0010032] [Citation(s) in RCA: 46] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2005] [Accepted: 07/22/2005] [Indexed: 11/25/2022] Open
Abstract
Organisms employ a multitude of strategies to cope with the dynamical environments in which they live. Homeostasis and physiological plasticity buffer changes within the lifetime of an organism, while stochastic developmental programs and hypermutability track changes on longer time-scales. An alternative long-term mechanism is "genetic potential"--a heightened sensitivity to the effects of mutation that facilitates rapid evolution to novel states. Using a transparent mathematical model, we illustrate the concept of genetic potential and show that as environmental variability decreases, the evolving population reaches three distinct steady state conditions: (1) organismal flexibility, (2) genetic potential, and (3) genetic robustness. As a specific example of this concept we examine fluctuating selection for hydrophobicity in a single amino acid. We see the same three stages, suggesting that environmental fluctuations can produce allele distributions that are distinct not only from those found under constant conditions, but also from the transient allele distributions that arise under isolated selective sweeps.
Collapse
Affiliation(s)
- Lauren Ancel Meyers
- Section of Integrative Biology, Institute for Cellular and Molecular Biology, University of Texas, Austin, Texas, USA.
| | | | | |
Collapse
|
15
|
Pillai SK, Kosakovsky Pond SL, Woelk CH, Richman DD, Smith DM. Codon volatility does not reflect selective pressure on the HIV-1 genome. Virology 2005; 336:137-43. [PMID: 15892955 DOI: 10.1016/j.virol.2005.03.014] [Citation(s) in RCA: 11] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/11/2005] [Revised: 03/14/2005] [Accepted: 03/16/2005] [Indexed: 10/25/2022]
Abstract
Codon volatility is defined as the proportion of a codon's point-mutation neighbors that encode different amino acids. The cumulative volatility of a gene in relation to its associated genome was recently reported to be an indicator of selection pressure. We used this approach to measure selection on all available full-length HIV-1 subtype B genomes in the Los Alamos HIV Sequence Database, and compared these estimates against those obtained via established likelihood- and distance-based comparative methods. Volatility failed to correlate with the results of any of the comparative methods demonstrating that it is not a reliable indicator of selection pressure.
Collapse
Affiliation(s)
- Satish K Pillai
- University of California, San Diego, Stein Clinical Research Building, Room 325, 9500 Gilman Drive 0679, La Jolla 92093, USA
| | | | | | | | | |
Collapse
|
16
|
Abstract
It has been suggested that volatility, the proportion of mutations which change an amino acid, can be used to infer the level of natural selection acting upon a gene. This conjecture is supported by a correlation between volatility and the rate of nonsynonymous substitution (dN), or the ratio of nonsynonymous and synonymous substitution rates, in a variety of organisms. These organisms include yeast, in which the correlations are quite strong. Here we show that these correlations are a by-product of a correlation between synonymous codon bias toward translationally optimal codons and dN. Although this analysis suggests that volatility is not a good measure of the selection, we suggest that it might be possible to infer something about the level of natural selection, from a single genome sequence, using translational codon bias.
Collapse
Affiliation(s)
- Nina Stoletzki
- Section of Evolutionary Biology, Department Biology II, Ludwig-Maximilians-University Munich, Planegg-Martinsried, Germany.
| | | | | | | |
Collapse
|
17
|
|