1
|
Farookhi H, Xia X. Differential Selection for Translation Efficiency Shapes Translation Machineries in Bacterial Species. Microorganisms 2024; 12:768. [PMID: 38674712 PMCID: PMC11052298 DOI: 10.3390/microorganisms12040768] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/05/2024] [Revised: 04/01/2024] [Accepted: 04/09/2024] [Indexed: 04/28/2024] Open
Abstract
Different bacterial species have dramatically different generation times, from 20-30 min in Escherichia coli to about two weeks in Mycobacterium leprae. The translation machinery in a cell needs to synthesize all proteins for a new cell in each generation. The three subprocesses of translation, i.e., initiation, elongation, and termination, are expected to be under stronger selection pressure to optimize in short-generation bacteria (SGB) such as Vibrio natriegens than in the long-generation Mycobacterium leprae. The initiation efficiency depends on the start codon decoded by the initiation tRNA, the optimal Shine-Dalgarno (SD) decoded by the anti-SD (aSD) sequence on small subunit rRNA, and the secondary structure that may embed the initiation signals and prevent them from being decoded. The elongation efficiency depends on the tRNA pool and codon usage. The termination efficiency in bacteria depends mainly on the nature of the stop codon and the nucleotide immediately downstream of the stop codon. By contrasting SGB with long-generation bacteria (LGB), we predict (1) SGB to have more ribosome RNA operons to produce ribosomes, and more tRNA genes for carrying amino acids to ribosomes, (2) SGB to have a higher percentage of genes using AUG as the start codon and UAA as the stop codon than LGB, (3) SGB to exhibit better codon and anticodon adaptation than LGB, and (4) SGB to have a weaker secondary structure near the translation initiation signals than LGB. These differences between SGB and LGB should be more pronounced in highly expressed genes than the rest of the genes. We present empirical evidence in support of these predictions.
Collapse
Affiliation(s)
- Heba Farookhi
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON K1N 6N5, Canada;
- Ottawa Institute of Systems Biology, University of Ottawa, Ottawa, ON K1H 8M5, Canada
| |
Collapse
|
2
|
Ho AT, Hurst LD. Stop codon usage as a window into genome evolution: mutation, selection, biased gene conversion and the TAG paradox. Genome Biol Evol 2022; 14:6648529. [PMID: 35867377 PMCID: PMC9348620 DOI: 10.1093/gbe/evac115] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 07/17/2022] [Indexed: 11/16/2022] Open
Abstract
Protein coding genes terminate with one of three stop codons (TAA, TGA, or TAG) that, like synonymous codons, are not employed equally. With TGA and TAG having identical nucleotide content, analysis of their differential usage provides an unusual window into the forces operating on what are ostensibly functionally identical residues. Across genomes and between isochores within the human genome, TGA usage increases with G + C content but, with a common G + C → A + T mutation bias, this cannot be explained by mutation bias-drift equilibrium. Increased usage of TGA in G + C-rich genomes or genomic regions is also unlikely to reflect selection for the optimal stop codon, as TAA appears to be universally optimal, probably because it has the lowest read-through rate. Despite TAA being favored by selection and mutation bias, as with codon usage bias G + C pressure is the prime determinant of between-species TGA usage trends. In species with strong G + C-biased gene conversion (gBGC), such as mammals and birds, the high usage and conservation of TGA is best explained by an A + T → G + C repair bias. How to explain TGA enrichment in other G + C-rich genomes is less clear. Enigmatically, across bacterial and archaeal species and between human isochores TAG usage is mostly unresponsive to G + C pressure. This unresponsiveness we dub the TAG paradox as currently no mutational, selective, or gBGC model provides a well-supported explanation. That TAG does increase with G + C usage across eukaryotes makes the usage elsewhere yet more enigmatic. We suggest resolution of the TAG paradox may provide insights into either an unknown but common selective preference (probably at the DNA/RNA level) or an unrecognized complexity to the action of gBGC.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, UK
| | | |
Collapse
|
3
|
Ho AT, Hurst LD. Unusual mammalian usage of TGA stop codons reveals that sequence conservation need not imply purifying selection. PLoS Biol 2022; 20:e3001588. [PMID: 35550630 PMCID: PMC9129041 DOI: 10.1371/journal.pbio.3001588] [Citation(s) in RCA: 5] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/17/2022] [Revised: 05/24/2022] [Accepted: 04/20/2022] [Indexed: 11/18/2022] Open
Abstract
The assumption that conservation of sequence implies the action of purifying selection is central to diverse methodologies to infer functional importance. GC-biased gene conversion (gBGC), a meiotic mismatch repair bias strongly favouring GC over AT, can in principle mimic the action of selection, this being thought to be especially important in mammals. As mutation is GC→AT biased, to demonstrate that gBGC does indeed cause false signals requires evidence that an AT-rich residue is selectively optimal compared to its more GC-rich allele, while showing also that the GC-rich alternative is conserved. We propose that mammalian stop codon evolution provides a robust test case. Although in most taxa TAA is the optimal stop codon, TGA is both abundant and conserved in mammalian genomes. We show that this mammalian exceptionalism is well explained by gBGC mimicking purifying selection and that TAA is the selectively optimal codon. Supportive of gBGC, we observe (i) TGA usage trends are consistent at the focal stop codon and elsewhere (in UTR sequences); (ii) that higher TGA usage and higher TAA→TGA substitution rates are predicted by a high recombination rate; and (iii) across species the difference in TAA <-> TGA substitution rates between GC-rich and GC-poor genes is largest in genomes that possess higher between-gene GC variation. TAA optimality is supported both by enrichment in highly expressed genes and trends associated with effective population size. High TGA usage and high TAA→TGA rates in mammals are thus consistent with gBGC’s predicted ability to “drive” deleterious mutations and supports the hypothesis that sequence conservation need not be indicative of purifying selection. A general trend for GC-rich trinucleotides to reside at frequencies far above their mutational equilibrium in high recombining domains supports the generality of these results.
Collapse
Affiliation(s)
- Alexander Thomas Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | | |
Collapse
|
4
|
Ho AT, Hurst LD. Variation in Release Factor Abundance Is Not Needed to Explain Trends in Bacterial Stop Codon Usage. Mol Biol Evol 2022; 39:msab326. [PMID: 34751397 PMCID: PMC8789281 DOI: 10.1093/molbev/msab326] [Citation(s) in RCA: 2] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2022] Open
Abstract
In bacteria stop codons are recognized by one of two class I release factors (RF1) recognizing TAG, RF2 recognizing TGA, and TAA being recognized by both. Variation across bacteria in the relative abundance of RF1 and RF2 is thus hypothesized to select for different TGA/TAG usage. This has been supported by correlations between TAG:TGA ratios and RF1:RF2 ratios across multiple bacterial species, potentially also explaining why TAG usage is approximately constant despite extensive variation in GC content. It is, however, possible that stop codon trends are determined by other forces and that RF ratios adapt to stop codon usage, rather than vice versa. Here, we determine which direction of the causal arrow is the more parsimonious. Our results support the notion that RF1/RF2 ratios become adapted to stop codon usage as the same trends, notably the anomalous TAG behavior, are seen in contexts where RF1:RF2 ratios cannot be, or are unlikely to be, causative, that is, at 3'untranslated sites never used for translation termination, in intragenomic analyses, and across archaeal species (that possess only one RF1). We conclude that specifics of RF biology are unlikely to fully explain TGA/TAG relative usage. We discuss why the causal relationships for the evolution of synonymous stop codon usage might be different from those affecting synonymous sense codon usage, noting that transitions between TGA and TAG require two-point mutations one of which is likely to be deleterious.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
5
|
Ho AT, Hurst LD. Effective Population Size Predicts Local Rates but Not Local Mitigation of Read-through Errors. Mol Biol Evol 2021; 38:244-262. [PMID: 32797190 PMCID: PMC7783166 DOI: 10.1093/molbev/msaa210] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022] Open
Abstract
In correctly predicting that selection efficiency is positively correlated with the effective population size (Ne), the nearly neutral theory provides a coherent understanding of between-species variation in numerous genomic parameters, including heritable error (germline mutation) rates. Does the same theory also explain variation in phenotypic error rates and in abundance of error mitigation mechanisms? Translational read-through provides a model to investigate both issues as it is common, mostly nonadaptive, and has good proxy for rate (TAA being the least leaky stop codon) and potential error mitigation via "fail-safe" 3' additional stop codons (ASCs). Prior theory of translational read-through has suggested that when population sizes are high, weak selection for local mitigation can be effective thus predicting a positive correlation between ASC enrichment and Ne. Contra to prediction, we find that ASC enrichment is not correlated with Ne. ASC enrichment, although highly phylogenetically patchy, is, however, more common both in unicellular species and in genes expressed in unicellular modes in multicellular species. By contrast, Ne does positively correlate with TAA enrichment. These results imply that local phenotypic error rates, not local mitigation rates, are consistent with a drift barrier/nearly neutral model.
Collapse
Affiliation(s)
- Alexander T Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- Corresponding author: E-mail:
| | - Laurence D Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
6
|
Ho AT, Hurst LD. In eubacteria, unlike eukaryotes, there is no evidence for selection favouring fail-safe 3' additional stop codons. PLoS Genet 2019; 15:e1008386. [PMID: 31527909 PMCID: PMC6764699 DOI: 10.1371/journal.pgen.1008386] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/11/2019] [Revised: 09/27/2019] [Accepted: 08/27/2019] [Indexed: 12/23/2022] Open
Abstract
Errors throughout gene expression are likely deleterious, hence genomes are under selection to ameliorate their consequences. Additional stop codons (ASCs) are in-frame nonsense ‘codons’ downstream of the primary stop which may be read by translational machinery should the primary stop have been accidentally read through. Prior evidence in several eukaryotes suggests that ASCs are selected to prevent potentially-deleterious consequences of read-through. We extend this evidence showing that enrichment of ASCs is common but not universal for single cell eukaryotes. By contrast, there is limited evidence as to whether the same is true in other taxa. Here, we provide the first systematic test of the hypothesis that ASCs act as a fail-safe mechanism in eubacteria, a group with high read-through rates. Contra to the predictions of the hypothesis we find: there is paucity, not enrichment, of ASCs downstream; substitutions that degrade stops are more frequent in-frame than out-of-frame in 3’ sequence; highly expressed genes are no more likely to have ASCs than lowly expressed genes; usage of the leakiest primary stop (TGA) in highly expressed genes does not predict ASC enrichment even compared to usage of non-leaky stops (TAA) in lowly expressed genes, beyond downstream codon +1. Any effect at the codon immediately proximal to the primary stop can be accounted for by a preference for a T/U residue immediately following the stop, although if anything, TT- and TC- starting codons are preferred. We conclude that there is no compelling evidence for ASC selection in eubacteria. This presents an unusual case in which the same error could be solved by the same mechanism in eukaryotes and prokaryotes but is not. We discuss two possible explanations: that, owing to the absence of nonsense mediated decay, bacteria may solve read-through via gene truncation and in eukaryotes certain prion states cause raised read-through rates. In all organisms, gene expression is error-prone. One such error, translational read-through, occurs where the primary stop codon of an expressed gene is missed by the translational machinery. Failure to terminate is likely to be costly, hence genomes are under selection to prevent this from happening. One proposed error-proofing strategy involves in-frame proximal additional stop codons (ASCs) which may act as a ‘fail-safe’ mechanism by providing another opportunity for translation to terminate. There is evidence for ASC enrichment in several eukaryotes. We extend this evidence showing it to be common but not universal in single celled eukaryotes. However, the situation in bacteria is poorly understood, despite bacteria having high read-through rates. Here, we test the fail-safe hypothesis within a broad range of bacteria. To our surprise, we find that not only are ASCs not enriched, but they may even be selected against. This provides evidence for an unusual circumstance where eukaryotes and prokaryotes could solve the same problem the same way but don’t. What are we to make of this? We suggest that if read-through is the problem, ASCs are not necessarily the expected solution. Owing to the absence of nonsense-mediated decay, a process that makes gene truncation in eukaryotes less viable, we propose bacteria may rescue a leaky stop by mutation that creates a new stop upstream. Alternatively, raised read-through rates in some particular conditions in eukaryotes might explain the difference.
Collapse
Affiliation(s)
- Alexander T. Ho
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
- * E-mail:
| | - Laurence D. Hurst
- Milner Centre for Evolution, University of Bath, Bath, United Kingdom
| |
Collapse
|
7
|
Abrahams L, Hurst LD. Refining the Ambush Hypothesis: Evidence That GC- and AT-Rich Bacteria Employ Different Frameshift Defence Strategies. Genome Biol Evol 2018; 10:1153-1173. [PMID: 29617761 PMCID: PMC5909447 DOI: 10.1093/gbe/evy075] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 03/30/2018] [Indexed: 12/13/2022] Open
Abstract
Stop codons are frequently selected for beyond their regular termination function for error control. The “ambush hypothesis” proposes out-of-frame stop codons (OSCs) terminating frameshifted translations are selected for. Although early indirect evidence was partially supportive, recent evidence suggests OSC frequencies are not exceptional when considering underlying nucleotide content. However, prior null tests fail to control amino acid/codon usages or possible local mutational biases. We therefore return to the issue using bacterial genomes, considering several tests defining and testing against a null. We employ simulation approaches preserving amino acid order but shuffling synonymous codons or preserving codons while shuffling amino acid order. Additionally, we compare codon usage in amino acid pairs, where one codon can but the next, otherwise identical codon, cannot encode an OSC. OSC frequencies exceed expectations typically in AT-rich genomes, the +1 frame and for TGA/TAA but not TAG. With this complex evidence, simply rejecting or accepting the ambush hypothesis is not warranted. We propose a refined post hoc model, whereby AT-rich genomes have more accidental frameshifts, handled by RF2–RF3 complexes (associated with TGA/TAA) and are mostly +1 (or −2) slips. Supporting this, excesses positively correlate with in silico predicted frameshift probabilities. Thus, we propose a more viable framework, whereby genomes broadly adopt one of the two strategies to combat frameshifts: preventing frameshifting (GC-rich) or permitting frameshifts but minimizing impacts when most are caught early (AT-rich). Our refined framework holds promise yet some features, such as the bias of out-of-frame sense codons, remain unexplained.
Collapse
Affiliation(s)
- Liam Abrahams
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| | - Laurence D Hurst
- Department of Biology and Biochemistry, The Milner Centre for Evolution, University of Bath, United Kingdom
| |
Collapse
|
8
|
Abstract
Codon usage depends on mutation bias, tRNA-mediated selection, and the need for high efficiency and accuracy in translation. One codon in a synonymous codon family is often strongly over-used, especially in highly expressed genes, which often leads to a high dN/dS ratio because dS is very small. Many different codon usage indices have been proposed to measure codon usage and codon adaptation. Sense codon could be misread by release factors and stop codons misread by tRNAs, which also contribute to codon usage in rare cases. This chapter outlines the conceptual framework on codon evolution, illustrates codon-specific and gene-specific codon usage indices, and presents their applications. A new index for codon adaptation that accounts for background mutation bias (Index of Translation Elongation) is presented and contrasted with codon adaptation index (CAI) which does not consider background mutation bias. They are used to re-analyze data from a recent paper claiming that translation elongation efficiency matters little in protein production. The reanalysis disproves the claim.
Collapse
|
9
|
Wong HE, Huang CJ, Zhang Z. Amino acid misincorporation in recombinant proteins. Biotechnol Adv 2017; 36:168-181. [PMID: 29107148 DOI: 10.1016/j.biotechadv.2017.10.006] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/13/2017] [Revised: 09/12/2017] [Accepted: 10/24/2017] [Indexed: 11/26/2022]
Abstract
Proteins provide the molecular basis for cellular structure, catalytic activity, signal transduction, and molecular transport in biological systems. Recombinant protein expression is widely used to prepare and manufacture novel proteins that serve as the foundation of many biopharmaceutical products. However, protein translation bioprocesses are inherently prone to low-level errors. These sequence variants caused by amino acid misincorporation have been observed in both native and recombinant proteins. Protein sequence variants impact product quality, and their presence can be exacerbated through cellular stress, overexpression, and nutrient starvation. Therefore, the cell line selection process, which is used in the biopharmaceutical industry, is not only directed towards maximizing productivity, but also focuses on selecting clones which yield low sequence variant levels, thereby proactively avoiding potentially inauspicious patient safety and efficacy outcomes. Here, we summarize a number of hallmark studies aimed at understanding the mechanisms of amino acid misincorporation, as well as exacerbating factors, and mitigation strategies. We also describe key advances in analytical technologies in the identification and quantification of sequence variants, and some practical considerations when using LC-MS/MS for detecting sequence variants.
Collapse
Affiliation(s)
- H Edward Wong
- Process Development, Amgen Inc., 1 Amgen Center Drive, Thousand Oaks, CA 91320, United States
| | - Chung-Jr Huang
- Process Development, Amgen Inc., 1 Amgen Center Drive, Thousand Oaks, CA 91320, United States
| | - Zhongqi Zhang
- Process Development, Amgen Inc., 1 Amgen Center Drive, Thousand Oaks, CA 91320, United States.
| |
Collapse
|
10
|
Wei Y, Xia X. The Role of +4U as an Extended Translation Termination Signal in Bacteria. Genetics 2017; 205:539-549. [PMID: 27903612 PMCID: PMC5289835 DOI: 10.1534/genetics.116.193961] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2016] [Accepted: 11/05/2016] [Indexed: 12/19/2022] Open
Abstract
Termination efficiency of stop codons depends on the first 3' flanking (+4) base in bacteria and eukaryotes. In both Escherichia coli and Saccharomyces cerevisiae, termination read-through is reduced in the presence of +4U; however, the molecular mechanism underlying +4U function is poorly understood. Here, we perform comparative genomics analysis on 25 bacterial species (covering Actinobacteria, Bacteriodetes, Cyanobacteria, Deinococcus-Thermus, Firmicutes, Proteobacteria, and Spirochaetae) with bioinformatics approaches to examine the influence of +4U in bacterial translation termination by contrasting highly- and lowly-expressed genes (HEGs and LEGs, respectively). We estimated gene expression using the recently formulated Index of Translation Elongation, ITE, and identified stop codon near-cognate transfer RNAs (tRNAs) from well-annotated genomes. We show that +4U was consistently overrepresented in UAA-ending HEGs relative to LEGs. The result is consistent with the interpretation that +4U enhances termination mainly for UAA. Usage of +4U decreases in GC-rich species where most stop codons are UGA and UAG, with few UAA-ending genes, which is expected if UAA usage in HEGs drives up +4U usage. In HEGs, +4U usage increases significantly with abundance of UAA nc_tRNAs (near-cognate tRNAs that decode codons differing from UAA by a single nucleotide), particularly those with a mismatch at the first stop codon site. UAA is always the preferred stop codon in HEGs, and our results suggest that UAAU is the most efficient translation termination signal in bacteria.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ontario K1N 6N5, Canada
- Ottawa Institute of Systems Biology, Ontario K1H 8M5, Canada
| |
Collapse
|
11
|
Wei Y, Wang J, Xia X. Coevolution between Stop Codon Usage and Release Factors in Bacterial Species. Mol Biol Evol 2016; 33:2357-67. [PMID: 27297468 PMCID: PMC4989110 DOI: 10.1093/molbev/msw107] [Citation(s) in RCA: 27] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022] Open
Abstract
Three stop codons in bacteria represent different translation termination signals, and their usage is expected to depend on their differences in translation termination efficiency, mutation bias, and relative abundance of release factors (RF1 decoding UAA and UAG, and RF2 decoding UAA and UGA). In 14 bacterial species (covering Proteobacteria, Firmicutes, Cyanobacteria, Actinobacteria and Spirochetes) with cellular RF1 and RF2 quantified, UAA is consistently over-represented in highly expressed genes (HEGs) relative to lowly expressed genes (LEGs), whereas UGA usage is the opposite even in species where RF2 is far more abundant than RF1. UGA usage relative to UAG increases significantly with PRF2 [=RF2/(RF1 + RF2)] as expected from adaptation between stop codons and their decoders. PRF2 is > 0.5 over a wide range of AT content (measured by PAT3 as the proportion of AT at third codon sites), but decreases rapidly toward zero at the high range of PAT3. This explains why bacterial lineages with high PAT3 often have UGA reassigned because of low RF2. There is no indication that UAG is a minor stop codon in bacteria as claimed in a recent publication. The claim is invalid because of the failure to apply the two key criteria in identifying a minor codon: (1) it is least preferred by HEGs (or most preferred by LEGs) and (2) it corresponds to the least abundant decoder. Our results suggest a more plausible explanation for why UAA usage increases, and UGA usage decreases, with PAT3, but UAG usage remains low over the entire PAT3 range.
Collapse
Affiliation(s)
- Yulong Wei
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Juan Wang
- Department of Biology, University of Ottawa, Ottawa, ON, Canada
| | - Xuhua Xia
- Department of Biology, University of Ottawa, Ottawa, ON, Canada Ottawa Institute of Systems Biology, Ottawa, ON, Canada
| |
Collapse
|
12
|
Borisov OV, Alvarez M, Carroll JA, Brown PW. Sequence Variants and Sequence Variant Analysis in Biotherapeutic Proteins. ACS SYMPOSIUM SERIES 2015. [DOI: 10.1021/bk-2015-1201.ch002] [Citation(s) in RCA: 12] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 02/02/2023]
Affiliation(s)
- Oleg V. Borisov
- Novavax, Inc., Gaithersburg, Maryland 20878, United States
- Roche Group Member, Genentech, Inc., South San Francisco, California 94080, United States
- Pfizer Worldwide Research & Development, Chesterfield, Missouri 63017, United States
| | - Melissa Alvarez
- Novavax, Inc., Gaithersburg, Maryland 20878, United States
- Roche Group Member, Genentech, Inc., South San Francisco, California 94080, United States
- Pfizer Worldwide Research & Development, Chesterfield, Missouri 63017, United States
| | - James A. Carroll
- Novavax, Inc., Gaithersburg, Maryland 20878, United States
- Roche Group Member, Genentech, Inc., South San Francisco, California 94080, United States
- Pfizer Worldwide Research & Development, Chesterfield, Missouri 63017, United States
| | - Paul W. Brown
- Novavax, Inc., Gaithersburg, Maryland 20878, United States
- Roche Group Member, Genentech, Inc., South San Francisco, California 94080, United States
- Pfizer Worldwide Research & Development, Chesterfield, Missouri 63017, United States
| |
Collapse
|
13
|
Zhang T, Huang Y, Chamberlain S, Romeo T, Zhu-Shimoni J, Hewitt D, Zhu M, Katta V, Mauger B, Kao YH. Identification of a single base-pair mutation of TAA (Stop codon) → GAA (Glu) that causes light chain extension in a CHO cell derived IgG1. MAbs 2012; 4:694-700. [PMID: 23018810 PMCID: PMC3502236 DOI: 10.4161/mabs.22232] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/04/2022] Open
Abstract
We describe here the identification of a stop codon TAA (Stop) → GAA (Glu) = Stop221E mutation on the light chain of a recombinant IgG1 antibody expressed in a Chinese hamster ovary (CHO) cell line. The extended light chain variants, which were caused by translation beyond the mutated stop codon to the next alternative in-frame stop codon, were observed by mass spectra analysis. The abnormal peptide peaks present in tryptic and chymotryptic LC–MS peptide mapping were confirmed by N-terminal sequencing as C-terminal light chain extension peptides. Furthermore, LC-MS/MS of Glu-C peptide mapping confirmed the stop221E mutation, which is consistent with a single base-pair mutation in TAA (stop codon) to GAA (Glu). The light chain variants were approximately 13.6% of wild type light chain as estimated by RP-HPLC analysis. DNA sequencing techniques determined a single base pair stop codon mutation, instead of a stop codon read-through, as the cause of this light chain extension. To our knowledge, the stop codon mutation has not been reported for IgGs expressed in CHO cells. These results demonstrate orthogonal techniques should be implemented to characterize recombinant proteins and select appropriate cell lines for production of therapeutic proteins because modifications could occur at unexpected locations.
Collapse
Affiliation(s)
- Taylor Zhang
- Protein Analytical Chemistry, Genentech, South San Francisco, CA, USA.
| | | | | | | | | | | | | | | | | | | |
Collapse
|
14
|
Yang Y, Strahan A, Li C, Shen A, Liu H, Ouyang J, Katta V, Francissen K, Zhang B. Detecting low level sequence variants in recombinant monoclonal antibodies. MAbs 2010; 2:285-98. [PMID: 20400866 DOI: 10.4161/mabs.2.3.11718] [Citation(s) in RCA: 64] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/19/2022] Open
Abstract
A systematic analytical approach combining tryptic and chymotryptic peptide mapping with a Mascot Error Tolerant Search (ETS) has been developed to detect and identify low level protein sequence variants, i.e., amino acid substitutions, in recombinant monoclonal antibodies. The reversed-phase HPLC separation with ultraviolet (UV) detection and mass spectral acquisition parameters of the peptide mapping methods were optimized by using a series of model samples that contained low levels (0.5-5.0%) of recombinant humanized anti-HER2 antibody (rhumAb HER2) along with another unrelated recombinant humanized monoclonal antibody (rhumAb A). This systematic approach's application in protein sequence variant analysis depends upon time and sensitivity constraints. An example of using this approach as a rapid screening assay is described in the first case study. For stable CHO clone selection for an early stage antibody project, comparison of peptide map UV profiles from the top four clone-derived rhumAb B samples quickly detected two sequence variants (M83R at 5% and P274T at 42% protein levels) from two clones among the four. The second case study described in this work demonstrates how this approach can be applied to late stage antibody projects. A sequence variant, L413Q, present at 0.3% relative to the expected sequence of rhumAb C was identified by a Mascot-ETS for one out of four top producers. The incorporation of this systematic sequence variant analysis into clone selection and the peptide mapping procedure described herein have practical applications for the biotechnology industry, including possible detection of polymorphisms in endogenous proteins.
Collapse
Affiliation(s)
- Yi Yang
- Protein Analytical Chemistry, Genentech, Inc., South San Francisco, CA, USA
| | | | | | | | | | | | | | | | | |
Collapse
|
15
|
González B, Ceciliani F, Galizzi A. Growth at low temperature suppresses readthrough of the UGA stop codon during the expression of Bacillus subtilis flgM gene in Escherichia coli. J Biotechnol 2003; 101:173-80. [PMID: 12568746 DOI: 10.1016/s0168-1656(02)00340-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/27/2022]
Abstract
The efficient production of recombinant proteins in Escherichia coli requires a proper termination of translation to ensure the synthesis of only the desired product. During the recombinant production of Bacillus subtilis flgM in E. coli, we detected an additional polypeptide of molecular mass higher than the expected, corresponding to a product of a translational readthrough of the UGA stop codon. In this paper we show that the readthrough was abolished when the synthesis of the recombinant protein was carried out at 25 degrees C. The possible causes that contribute to reduce the proportion of readthrough protein species against the correct terminated product are discussed.
Collapse
Affiliation(s)
- Beatriz González
- Laboratory of Bioreactors, Plant Division, Genetic Engineering and Biotechnology Center, PO Box 6162, CP 10600, La Habana, Cuba.
| | | | | |
Collapse
|
16
|
Jacobson FS, Hanson JT, Wong PY, Mulkerrin M, Deveney J, Reilly D, Wong SC. Role of high-performance liquid chromatographic protein analysis in developing fermentation processes for recombinant human growth hormone, relaxin, antibody fragments and lymphotoxin. J Chromatogr A 1997; 763:31-48. [PMID: 9129313 DOI: 10.1016/s0021-9673(96)01010-2] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/04/2023]
Abstract
Development of efficient and reliable fermentation processes for protein pharmaceuticals is aided by the availability of accurate quantitative and qualitative product analyses. We have developed a variety of single and dual column chromatographic separations that meet the needs of process development and examples will be provided of how the resulting data has been used to optimize the culture process. For single column methods, reversed-phase chromatography has been the most versatile, permitting the reliable quantitation of many yeast, Chinese hamster ovary (CHO) cell and Escherichia coli-expressed products in the matrix of culture broth or cell extract. Analysis of secreted human growth hormone synthesized in E. coli, along with clipped and unprocessed forms, will be discussed. Another reversed-phase assay for direct analysis of a peptide product (B-chain relaxin) and its degradation products secreted into E. coli fermentation medium has allowed the purification of the responsible protease. Cation-exchange has proven extremely useful for the direct analysis of antibody fragment synthesized in E. coli, allowing the separation and quantitation of the desired Fab' and Fab'2, as well as the unwanted products of glutathione addition and translational read-through. Assay development is often complicated by the presence of host proteins with chromatographic behavior that is similar to that of the product. Commercial instrumentation now permits the facile development of multidimensional chromatographic assays. We show examples of coupled receptor affinity-reversed-phase assays for a mistranslation product and for covalent multimers of E. coli-synthesized lymphotoxin.
Collapse
Affiliation(s)
- F S Jacobson
- Department of Fermentation and Cell Culture Process Development, Genentech, Inc., South San Francisco, CA 94080, USA
| | | | | | | | | | | | | |
Collapse
|