51
|
Spielman SJ. Relative Model Fit Does Not Predict Topological Accuracy in Single-Gene Protein Phylogenetics. Mol Biol Evol 2021; 37:2110-2123. [PMID: 32191313 PMCID: PMC7306691 DOI: 10.1093/molbev/msaa075] [Citation(s) in RCA: 18] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
It is regarded as best practice in phylogenetic reconstruction to perform relative model selection to determine an appropriate evolutionary model for the data. This procedure ranks a set of candidate models according to their goodness of fit to the data, commonly using an information theoretic criterion. Users then specify the best-ranking model for inference. Although it is often assumed that better-fitting models translate to increase accuracy, recent studies have shown that the specific model employed may not substantially affect inferences. We examine whether there is a systematic relationship between relative model fit and topological inference accuracy in protein phylogenetics, using simulations and real sequences. Simulations employed site-heterogeneous mechanistic codon models that are distinct from protein-level phylogenetic inference models, allowing us to investigate how protein models performs when they are misspecified to the data, as will be the case for any real sequence analysis. We broadly find that phylogenies inferred across models with vastly different fits to the data produce highly consistent topologies. We additionally find that all models infer similar proportions of false-positive splits, raising the possibility that all available models of protein evolution are similarly misspecified. Moreover, we find that the parameter-rich GTR (general time reversible) model, whose amino acid exchangeabilities are free parameters, performs similarly to models with fixed exchangeabilities, although the inference precision associated with GTR models was not examined. We conclude that, although relative model selection may not hinder phylogenetic analysis on protein data, it may not offer specific predictable improvements and is not a reliable proxy for accuracy.
Collapse
|
52
|
Mattenberger F, Vila-Nistal M, Geller R. Increased RNA virus population diversity improves adaptability. Sci Rep 2021; 11:6824. [PMID: 33767337 PMCID: PMC7994910 DOI: 10.1038/s41598-021-86375-z] [Citation(s) in RCA: 14] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/30/2020] [Accepted: 03/15/2021] [Indexed: 11/20/2022] Open
Abstract
The replication machinery of most RNA viruses lacks proofreading mechanisms. As a result, RNA virus populations harbor a large amount of genetic diversity that confers them the ability to rapidly adapt to changes in their environment. In this work, we investigate whether further increasing the initial population diversity of a model RNA virus can improve adaptation to a single selection pressure, thermal inactivation. For this, we experimentally increased the diversity of coxsackievirus B3 (CVB3) populations across the capsid region. We then compared the ability of these high diversity CVB3 populations to achieve resistance to thermal inactivation relative to standard CVB3 populations in an experimental evolution setting. We find that viral populations with high diversity are better able to achieve resistance to thermal inactivation at both the temperature employed during experimental evolution as well as at a more extreme temperature. Moreover, we identify mutations in the CVB3 capsid that confer resistance to thermal inactivation, finding significant mutational epistasis. Our results indicate that even naturally diverse RNA virus populations can benefit from experimental augmentation of population diversity for optimal adaptation and support the use of such viral populations in directed evolution efforts that aim to select viruses with desired characteristics.
Collapse
Affiliation(s)
- Florian Mattenberger
- Institute for Integrative Systems Biology, I2SysBio (Universitat de València-CSIC), C. Catedràtic José Beltrán 2, 46980, Paterna, Spain
| | - Marina Vila-Nistal
- Department of Physiology, Genetics and Microbiology, Universidad de Alicante, C. San Vicente del Raspeig s/n, 03690, Alicante, Spain
| | - Ron Geller
- Institute for Integrative Systems Biology, I2SysBio (Universitat de València-CSIC), C. Catedràtic José Beltrán 2, 46980, Paterna, Spain.
| |
Collapse
|
53
|
Mattenberger F, Latorre V, Tirosh O, Stern A, Geller R. Globally defining the effects of mutations in a picornavirus capsid. eLife 2021; 10:64256. [PMID: 33432927 PMCID: PMC7861617 DOI: 10.7554/elife.64256] [Citation(s) in RCA: 11] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/22/2020] [Accepted: 01/11/2021] [Indexed: 02/07/2023] Open
Abstract
The capsids of non-enveloped viruses are highly multimeric and multifunctional protein assemblies that play key roles in viral biology and pathogenesis. Despite their importance, a comprehensive understanding of how mutations affect viral fitness across different structural and functional attributes of the capsid is lacking. To address this limitation, we globally define the effects of mutations across the capsid of a human picornavirus. Using this resource, we identify structural and sequence determinants that accurately predict mutational fitness effects, refine evolutionary analyses, and define the sequence specificity of key capsid-encoded motifs. Furthermore, capitalizing on the derived sequence requirements for capsid-encoded protease cleavage sites, we implement a bioinformatic approach for identifying novel host proteins targeted by viral proteases. Our findings represent the most comprehensive investigation of mutational fitness effects in a picornavirus capsid to date and illuminate important aspects of viral biology, evolution, and host interactions. A virus is made up of genetic material that is encased with a protective protein coat called the capsid. The capsid also helps the virus to infect host cells by binding to the host receptor proteins and releasing its genetic material. Inside the cell, the virus hitchhikes the infected cell’s machinery to grow or replicate its own genetic material. Viral capsids are the main target of the host’s defence system, and therefore, continuously change in an attempt to escape the immune system by introducing alterations (known as mutations) into the genes encoding viral capsid proteins. Mutations occur randomly, and so while some changes to the viral capsid might confer an advantage, others may have no effect at all, or even weaken the virus. To better understand the effect of capsid mutations on the virus’ ability to infect host cells, Mattenberger et al. studied the Coxsackievirus B3, which is linked to heart problems and acute heart failure in humans. The researchers analysed around 90% of possible amino acid mutations (over 14,800 mutations) and correlated each mutation to how it influenced the virus’ ability to replicate in human cells grown in the laboratory. Based on these results, Mattenberger et al. developed a computer model to predict how a particular mutation might affect the virus. The analysis also identified specific amino acid sequences of capsid proteins that are essential for certain tasks, such as building the capsid. It also included an analysis of sequences in the capsid that allow it to be recognized by another viral protein, which cuts the capsid proteins into the right size from a larger precursor. By looking for similar sequences in human genes, the researchers identified several ones that the virus may attack and inactivate to support its own replication. These findings may help identify potential drug targets to develop new antiviral therapies. For example, proteins of the capsid that are less likely to mutate will provide a better target as they lower the possibility of the virus to become resistant to the treatment. They also highlight new proteins in human cells that could potentially block the virus in cells.
Collapse
Affiliation(s)
- Florian Mattenberger
- Institute for Integrative Systems Biology, I2SysBio (Universitat de València-CSIC), Paterna, Spain
| | - Victor Latorre
- Institute for Integrative Systems Biology, I2SysBio (Universitat de València-CSIC), Paterna, Spain
| | - Omer Tirosh
- The Shmunis School of Biomedicine and Cancer Research, Tel-Aviv University, Tel-Aviv, Israel
| | - Adi Stern
- The Shmunis School of Biomedicine and Cancer Research, Tel-Aviv University, Tel-Aviv, Israel
| | - Ron Geller
- Institute for Integrative Systems Biology, I2SysBio (Universitat de València-CSIC), Paterna, Spain
| |
Collapse
|
54
|
Puller V, Sagulenko P, Neher RA. Efficient inference, potential, and limitations of site-specific substitution models. Virus Evol 2020; 6:veaa066. [PMID: 33343922 PMCID: PMC7733610 DOI: 10.1093/ve/veaa066] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022] Open
Abstract
Natural selection imposes a complex filter on which variants persist in a population resulting in evolutionary patterns that vary greatly along the genome. Some sites evolve close to neutrally, while others are highly conserved, allow only specific states, or only change in concert with other sites. On one hand, such constraints on sequence evolution can be to infer biological function, one the other hand they need to be accounted for in phylogenetic reconstruction. Phylogenetic models often account for this complexity by partitioning sites into a small number of discrete classes with different rates and/or state preferences. Appropriate model complexity is typically determined by model selection procedures. Here, we present an efficient algorithm to estimate more complex models that allow for different preferences at every site and explore the accuracy at which such models can be estimated from simulated data. Our iterative approximate maximum likelihood scheme uses information in the data efficiently and accurately estimates site-specific preferences from large data sets with moderately diverged sequences and known topology. However, the joint estimation of site-specific rates, and site-specific preferences, and phylogenetic branch length can suffer from identifiability problems, while ignoring variation in preferences across sites results in branch length underestimates. Site-specific preferences estimated from large HIV pol alignments show qualitative concordance with intra-host estimates of fitness costs. Analysis of these substitution models suggests near saturation of divergence after a few hundred years. Such saturation can explain the inability to infer deep divergence times of HIV and SIVs using molecular clock approaches and time-dependent rate estimates.
Collapse
Affiliation(s)
- Vadim Puller
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 61, Basel, Switzerland
| | - Pavel Sagulenko
- Max Planck Institute for Developmental Biology, Max-Planck-Ring 5, 72076 Tübingen, Germany
| | - Richard A Neher
- Biozentrum, University of Basel, Klingelbergstrasse 50/70, 4056 Basel, Switzerland.,SIB Swiss Institute of Bioinformatics, Klingelbergstrasse 61, Basel, Switzerland
| |
Collapse
|
55
|
Munro D, Singh M. DeMaSk: a deep mutational scanning substitution matrix and its use for variant impact prediction. Bioinformatics 2020; 36:5322-5329. [PMID: 33325500 PMCID: PMC8016454 DOI: 10.1093/bioinformatics/btaa1030] [Citation(s) in RCA: 23] [Impact Index Per Article: 4.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/21/2020] [Revised: 10/16/2020] [Accepted: 11/30/2020] [Indexed: 01/27/2023] Open
Abstract
Motivation Accurately predicting the quantitative impact of a substitution on a protein’s molecular function would be a great aid in understanding the effects of observed genetic variants across populations. While this remains a challenging task, new approaches can leverage data from the increasing numbers of comprehensive deep mutational scanning (DMS) studies that systematically mutate proteins and measure fitness. Results We introduce DeMaSk, an intuitive and interpretable method based only upon DMS datasets and sequence homologs that predicts the impact of missense mutations within any protein. DeMaSk first infers a directional amino acid substitution matrix from DMS datasets and then fits a linear model that combines these substitution scores with measures of per-position evolutionary conservation and variant frequency across homologs. Despite its simplicity, DeMaSk has state-of-the-art performance in predicting the impact of amino acid substitutions, and can easily and rapidly be applied to any protein sequence. Availability and implementation https://demask.princeton.edu generates fitness impact predictions and visualizations for any user-submitted protein sequence. Supplementary information Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Daniel Munro
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA
| | - Mona Singh
- Lewis-Sigler Institute for Integrative Genomics, Princeton University, Princeton, 08544, USA.,Department of Computer Science, Princeton University, Princeton, 08544, USA
| |
Collapse
|
56
|
A facile method of mapping HIV-1 neutralizing epitopes using chemically masked cysteines and deep sequencing. Proc Natl Acad Sci U S A 2020; 117:29584-29594. [PMID: 33168755 DOI: 10.1073/pnas.2010256117] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/18/2022] Open
Abstract
Identification of specific epitopes targeted by neutralizing antibodies is essential to advance epitope-based vaccine design strategies. We report a facile methodology for rapid epitope mapping of neutralizing antibodies (NAbs) against HIV-1 Envelope (Env) at single-residue resolution, using Cys labeling, viral neutralization assays, and deep sequencing. This was achieved by the generation of a library of Cys mutations in Env glycoprotein on the viral surface, covalent labeling of the Cys residues using a Cys-reactive label that masks epitope residues, followed by infection of the labeled mutant virions in mammalian cells in the presence of NAbs. Env gene sequencing from NAb-resistant viruses was used to accurately delineate epitopes for the NAbs VRC01, PGT128, and PGT151. These agreed well with corresponding experimentally determined structural epitopes previously inferred from NAb:Env structures. HIV-1 infection is associated with complex and polyclonal antibody responses, typically composed of multiple antibody specificities. Deconvoluting the epitope specificities in a polyclonal response is a challenging task. We therefore extended our methodology to map multiple specificities of epitopes targeted in polyclonal sera, elicited in immunized animals as well as in an HIV-1-infected elite neutralizer capable of neutralizing tier 3 pseudoviruses with high titers. The method can be readily extended to other viruses for which convenient reverse genetics or lentiviral surface display systems are available.
Collapse
|
57
|
tenOever BR. Synthetic Virology: Building Viruses to Better Understand Them. Cold Spring Harb Perspect Med 2020; 10:cshperspect.a038703. [PMID: 31871242 DOI: 10.1101/cshperspect.a038703] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/26/2022]
Abstract
Generally comprised of less than a dozen components, RNA viruses can be viewed as well-designed genetic circuits optimized to replicate and spread within a given host. Understanding the molecular design that enables this activity not only allows one to disrupt these circuits to study their biology, but it provides a reprogramming framework to achieve novel outputs. Recent advances have enabled a "learning by building" approach to better understand virus biology and create valuable tools. Below is a summary of how modifying the preexisting genetic framework of influenza A virus has been used to track viral movement, understand virus replication, and identify host factors that engage this viral circuitry.
Collapse
Affiliation(s)
- Benjamin R tenOever
- Department of Microbiology, Icahn School of Medicine at Mount Sinai, New York, New York 10029, USA
| |
Collapse
|
58
|
Starr TN, Greaney AJ, Hilton SK, Ellis D, Crawford KHD, Dingens AS, Navarro MJ, Bowen JE, Tortorici MA, Walls AC, King NP, Veesler D, Bloom JD. Deep Mutational Scanning of SARS-CoV-2 Receptor Binding Domain Reveals Constraints on Folding and ACE2 Binding. Cell 2020; 182:1295-1310.e20. [PMID: 32841599 PMCID: PMC7418704 DOI: 10.1016/j.cell.2020.08.012] [Citation(s) in RCA: 1467] [Impact Index Per Article: 293.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/17/2020] [Revised: 07/31/2020] [Accepted: 08/06/2020] [Indexed: 02/07/2023]
Abstract
The receptor binding domain (RBD) of the SARS-CoV-2 spike glycoprotein mediates viral attachment to ACE2 receptor and is a major determinant of host range and a dominant target of neutralizing antibodies. Here, we experimentally measure how all amino acid mutations to the RBD affect expression of folded protein and its affinity for ACE2. Most mutations are deleterious for RBD expression and ACE2 binding, and we identify constrained regions on the RBD's surface that may be desirable targets for vaccines and antibody-based therapeutics. But a substantial number of mutations are well tolerated or even enhance ACE2 binding, including at ACE2 interface residues that vary across SARS-related coronaviruses. However, we find no evidence that these ACE2-affinity-enhancing mutations have been selected in current SARS-CoV-2 pandemic isolates. We present an interactive visualization and open analysis pipeline to facilitate use of our dataset for vaccine design and functional annotation of mutations observed during viral surveillance.
Collapse
Affiliation(s)
- Tyler N Starr
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Allison J Greaney
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Medical Scientist Training Program, University of Washington, Seattle, WA 98195, USA
| | - Sarah K Hilton
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Daniel Ellis
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA; Department of Biochemistry, University of Washington, Seattle, WA 98195, USA; Graduate Program in Molecular and Cellular Biology, University of Washington, Seattle, WA 98195, USA
| | - Katharine H D Crawford
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Medical Scientist Training Program, University of Washington, Seattle, WA 98195, USA
| | - Adam S Dingens
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
| | - Mary Jane Navarro
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - John E Bowen
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | | | - Alexandra C Walls
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Neil P King
- Institute for Protein Design, University of Washington, Seattle, WA 98195, USA; Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - David Veesler
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Jesse D Bloom
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA; Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA; Howard Hughes Medical Institute, Seattle, WA 98109, USA.
| |
Collapse
|
59
|
Starr TN, Greaney AJ, Hilton SK, Crawford KH, Navarro MJ, Bowen JE, Tortorici MA, Walls AC, Veesler D, Bloom JD. Deep mutational scanning of SARS-CoV-2 receptor binding domain reveals constraints on folding and ACE2 binding. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2020:2020.06.17.157982. [PMID: 32587970 PMCID: PMC7310626 DOI: 10.1101/2020.06.17.157982] [Citation(s) in RCA: 27] [Impact Index Per Article: 5.4] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/16/2022]
Abstract
The receptor binding domain (RBD) of the SARS-CoV-2 spike glycoprotein mediates viral attachment to ACE2 receptor, and is a major determinant of host range and a dominant target of neutralizing antibodies. Here we experimentally measure how all amino-acid mutations to the RBD affect expression of folded protein and its affinity for ACE2. Most mutations are deleterious for RBD expression and ACE2 binding, and we identify constrained regions on the RBD's surface that may be desirable targets for vaccines and antibody-based therapeutics. But a substantial number of mutations are well tolerated or even enhance ACE2 binding, including at ACE2 interface residues that vary across SARS-related coronaviruses. However, we find no evidence that these ACE2-affinity enhancing mutations have been selected in current SARS-CoV-2 pandemic isolates. We present an interactive visualization and open analysis pipeline to facilitate use of our dataset for vaccine design and functional annotation of mutations observed during viral surveillance.
Collapse
Affiliation(s)
- Tyler N. Starr
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Co-first authors
| | - Allison J. Greaney
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Medical Scientist Training Program, University of Washington, Seattle, WA 98195, USA
- Co-first authors
| | - Sarah K. Hilton
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
| | - Katharine H.D. Crawford
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Medical Scientist Training Program, University of Washington, Seattle, WA 98195, USA
| | - Mary Jane Navarro
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - John E. Bowen
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | | | - Alexandra C. Walls
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - David Veesler
- Department of Biochemistry, University of Washington, Seattle, WA 98195, USA
| | - Jesse D. Bloom
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA 98109, USA
- Department of Genome Sciences, University of Washington, Seattle, WA 98195, USA
- Howard Hughes Medical Institute, Seattle, WA 98109, USA
- Lead Contact
| |
Collapse
|
60
|
Hasle N, Cooke A, Srivatsan S, Huang H, Stephany JJ, Krieger Z, Jackson D, Tang W, Pendyala S, Monnat RJ, Trapnell C, Hatch EM, Fowler DM. High-throughput, microscope-based sorting to dissect cellular heterogeneity. Mol Syst Biol 2020; 16:e9442. [PMID: 32500953 PMCID: PMC7273721 DOI: 10.15252/msb.20209442] [Citation(s) in RCA: 45] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/07/2020] [Revised: 04/23/2020] [Accepted: 04/29/2020] [Indexed: 12/24/2022] Open
Abstract
Microscopy is a powerful tool for characterizing complex cellular phenotypes, but linking these phenotypes to genotype or RNA expression at scale remains challenging. Here, we present Visual Cell Sorting, a method that physically separates hundreds of thousands of live cells based on their visual phenotype. Automated imaging and phenotypic analysis directs selective illumination of Dendra2, a photoconvertible fluorescent protein expressed in live cells; these photoactivated cells are then isolated using fluorescence-activated cell sorting. First, we use Visual Cell Sorting to assess hundreds of nuclear localization sequence variants in a pooled format, identifying variants that improve nuclear localization and enabling annotation of nuclear localization sequences in thousands of human proteins. Second, we recover cells that retain normal nuclear morphologies after paclitaxel treatment, and then derive their single-cell transcriptomes to identify pathways associated with paclitaxel resistance in cancers. Unlike alternative methods, Visual Cell Sorting depends on inexpensive reagents and commercially available hardware. As such, it can be readily deployed to uncover the relationships between visual cellular phenotypes and internal states, including genotypes and gene expression programs.
Collapse
Affiliation(s)
- Nicholas Hasle
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | | | - Sanjay Srivatsan
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Heather Huang
- Divisions of Basic Sciences and Human BiologyFred Hutchinson Cancer Research CenterSeattleWAUSA
| | - Jason J Stephany
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Zachary Krieger
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Dana Jackson
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Weiliang Tang
- Department of PathologyUniversity of WashingtonSeattleWAUSA
| | - Sriram Pendyala
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Raymond J Monnat
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
- Department of PathologyUniversity of WashingtonSeattleWAUSA
| | - Cole Trapnell
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
| | - Emily M Hatch
- Divisions of Basic Sciences and Human BiologyFred Hutchinson Cancer Research CenterSeattleWAUSA
| | - Douglas M Fowler
- Department of Genome SciencesUniversity of WashingtonSeattleWAUSA
- Department of BioengineeringUniversity of WashingtonSeattleWAUSA
| |
Collapse
|
61
|
Abstract
Influenza viruses rapidly diversify within individual human infections. Several recent studies have deep-sequenced clinical influenza infections to identify viral variation within hosts, but it remains unclear how within-host mutations fare at the between-host scale. Here, we compare the genetic variation of H3N2 influenza within and between hosts to link viral evolutionary dynamics across scales. Synonymous sites evolve at similar rates at both scales, indicating that global evolution at these putatively neutral sites results from the accumulation of within-host variation. However, nonsynonymous mutations are depleted between hosts compared to within hosts, suggesting that selection purges many of the protein-altering changes that arise within hosts. The exception is at antigenic sites, where selection detectably favors nonsynonymous mutations at the global scale, but not within hosts. These results suggest that selection against deleterious mutations and selection for antigenic change are the main forces that act on within-host variants of influenza virus as they transmit and circulate between hosts.
Collapse
Affiliation(s)
- Katherine S Xue
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Department of Biology, Stanford University, Stanford, CA, USA
| | - Jesse D Bloom
- Department of Genome Sciences, University of Washington, Foege Building S-250, Box 3550653720 15th Ave NE, Seattle WA 98195-5065, USA.,Basic Sciences Division, Fred Hutchinson Cancer Research Center, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA.,Howard Hughes Medical Institute, 1100 Fairview Ave N, Seattle, WA 98109-1024, USA
| |
Collapse
|
62
|
Simons JF, Lim YW, Carter KP, Wagner EK, Wayham N, Adler AS, Johnson DS. Affinity maturation of antibodies by combinatorial codon mutagenesis versus error-prone PCR. MAbs 2020; 12:1803646. [PMID: 32744131 PMCID: PMC7531523 DOI: 10.1080/19420862.2020.1803646] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/13/2020] [Revised: 06/22/2020] [Accepted: 07/27/2020] [Indexed: 01/08/2023] Open
Abstract
IN VITRO affinity maturation of therapeutic monoclonal antibodies is commonly applied to achieve desired properties, such as improved binding kinetics and affinity. Currently there are no universally accepted protocols for generation of variegated antibody libraries or selection thereof. Here, we performed affinity maturation using a yeast-based single-chain variable fragment (scFv) expression system to compare two mutagenesis methods: random mutagenesis across the entire V(D)J region by error-prone PCR, and a novel combinatorial mutagenesis process limited to the complementarity-determining regions (CDRs). We applied both methods of mutagenesis to four human antibodies against well-known immuno-oncology target proteins. Detailed sequence analysis showed an even mutational distribution across the entire length of the scFv for the error-prone PCR method and an almost exclusive targeting of the CDRs for the combinatorial method. Though there were distinct mutagenesis profiles for each target antibody and mutagenesis method, we found that both methods improved scFv affinity with similar efficiency. When a subset of the affinity-matured antibodies was expressed as full-length immunoglobulin, the measured affinity constants were mostly comparable to those of the respective scFv, but the full-length antibodies were inferior to their scFv counterparts for one of the targets. Furthermore, we found that improved affinity for the full-length antibody did not always translate into enhanced binding to cell-surface expressed antigen or improved immune checkpoint blocking ability, suggesting that screening with full-length antibody or antigen-binding fragment formats might be advantageous and the subject of a future study.
Collapse
|
63
|
A high-affinity human PD-1/PD-L2 complex informs avenues for small-molecule immune checkpoint drug discovery. Proc Natl Acad Sci U S A 2019; 116:24500-24506. [PMID: 31727844 PMCID: PMC6900541 DOI: 10.1073/pnas.1916916116] [Citation(s) in RCA: 45] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Immune checkpoint blockade of programmed death-1 (PD-1) by monoclonal antibody drugs has transformed the treatment of cancer. Small-molecule PD-1 drugs have the potential to offer increased efficacy, safety, and global access. Despite substantial efforts, such small-molecule drugs have been out of reach. We identify a prominent pocket on the ligand-binding surface of human PD-1 that appears to be an attractive small-molecule drug target. The pocket forms when PD-1 is bound to one of its ligands, PD-L2. Our high-resolution crystal structure of the human PD-1/PD-L2 complex facilitates virtual drug-screening efforts and opens additional avenues for the design and discovery of small-molecule PD-1 inhibitors. Our work provides a strategy that may enable discovery of small-molecule inhibitors of other “undruggable” protein–protein interactions. Immune checkpoint blockade of programmed death-1 (PD-1) by monoclonal antibody drugs has delivered breakthroughs in the treatment of cancer. Nonetheless, small-molecule PD-1 inhibitors could lead to increases in treatment efficacy, safety, and global access. While the ligand-binding surface of apo-PD-1 is relatively flat, it harbors a striking pocket in the murine PD-1/PD-L2 structure. An analogous pocket in human PD-1 may serve as a small-molecule drug target, but the structure of the human complex is unknown. Because the CC′ and FG loops in murine PD-1 adopt new conformations upon binding PD-L2, we hypothesized that mutations in these two loops could be coupled to pocket formation and alter PD-1’s affinity for PD-L2. Here, we conducted deep mutational scanning in these loops and used yeast surface display to select for enhanced PD-L2 binding. A PD-1 variant with three substitutions binds PD-L2 with an affinity two orders of magnitude higher than that of the wild-type protein, permitting crystallization of the complex. We determined the X-ray crystal structures of the human triple-mutant PD-1/PD-L2 complex and the apo triple-mutant PD-1 variant at 2.0 Å and 1.2 Å resolution, respectively. Binding of PD-L2 is accompanied by formation of a prominent pocket in human PD-1, as well as substantial conformational changes in the CC′ and FG loops. The structure of the apo triple-mutant PD-1 shows that the CC′ loop adopts the ligand-bound conformation, providing support for allostery between the loop and pocket. This human PD-1/PD-L2 structure provide critical insights for the design and discovery of small-molecule PD-1 inhibitors.
Collapse
|
64
|
Deep Mutational Scanning Comprehensively Maps How Zika Envelope Protein Mutations Affect Viral Growth and Antibody Escape. J Virol 2019; 93:JVI.01291-19. [PMID: 31511387 DOI: 10.1128/jvi.01291-19] [Citation(s) in RCA: 22] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/04/2019] [Accepted: 09/06/2019] [Indexed: 12/11/2022] Open
Abstract
Functional constraints on viral proteins are often assessed by examining sequence conservation among natural strains, but this approach is relatively ineffective for Zika virus because all known sequences are highly similar. Here, we take an alternative approach to map functional constraints on Zika virus's envelope (E) protein by using deep mutational scanning to measure how all amino acid mutations to the E protein affect viral growth in cell culture. The resulting sequence-function map is consistent with existing knowledge about E protein structure and function but also provides insight into mutation-level constraints in many regions of the protein that have not been well characterized in prior functional work. In addition, we extend our approach to completely map how mutations affect viral neutralization by two monoclonal antibodies, thereby precisely defining their functional epitopes. Overall, our study provides a valuable resource for understanding the effects of mutations to this important viral protein and also offers a roadmap for future work to map functional and antigenic selection to Zika virus at high resolution.IMPORTANCE Zika virus has recently been shown to be associated with severe birth defects. The virus's E protein mediates its ability to infect cells and is also the primary target of the antibodies that are elicited by natural infection and vaccines that are being developed against the virus. Therefore, determining the effects of mutations to this protein is important for understanding its function, its susceptibility to vaccine-mediated immunity, and its potential for future evolution. We completely mapped how amino acid mutations to the E protein affected the virus's ability to grow in cells in the laboratory and escape from several antibodies. The resulting maps relate changes in the E protein's sequence to changes in viral function and therefore provide a valuable complement to existing maps of the physical structure of the protein.
Collapse
|
65
|
Esposito D, Weile J, Shendure J, Starita LM, Papenfuss AT, Roth FP, Fowler DM, Rubin AF. MaveDB: an open-source platform to distribute and interpret data from multiplexed assays of variant effect. Genome Biol 2019; 20:223. [PMID: 31679514 PMCID: PMC6827219 DOI: 10.1186/s13059-019-1845-6] [Citation(s) in RCA: 155] [Impact Index Per Article: 25.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/31/2019] [Accepted: 10/01/2019] [Indexed: 11/10/2022] Open
Abstract
Multiplex assays of variant effect (MAVEs), such as deep mutational scans and massively parallel reporter assays, test thousands of sequence variants in a single experiment. Despite the importance of MAVE data for basic and clinical research, there is no standard resource for their discovery and distribution. Here, we present MaveDB ( https://www.mavedb.org ), a public repository for large-scale measurements of sequence variant impact, designed for interoperability with applications to interpret these datasets. We also describe the first such application, MaveVis, which retrieves, visualizes, and contextualizes variant effect maps. Together, the database and applications will empower the community to mine these powerful datasets.
Collapse
Affiliation(s)
- Daniel Esposito
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
| | - Jochen Weile
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
- Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Jay Shendure
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
- Howard Hughes Medical Institute, University of Washington, Seattle, WA, USA
| | - Lea M Starita
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
- Brotman Baty Institute for Precision Medicine, Seattle, WA, USA
| | - Anthony T Papenfuss
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia
- Sir Peter MacCallum Department of Oncology, University of Melbourne, Melbourne, VIC, Australia
- Department of Mathematics and Statistics, University of Melbourne, Melbourne, VIC, Australia
| | - Frederick P Roth
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.
- Lunenfeld-Tanenbaum Research Institute, Sinai Health System, Toronto, ON, Canada.
- Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.
- Department of Computer Science, University of Toronto, Toronto, ON, Canada.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA, USA.
- Canadian Institute for Advanced Research, Toronto, ON, Canada.
- Department of Bioengineering, University of Washington, Seattle, WA, USA.
| | - Alan F Rubin
- Bioinformatics Division, The Walter and Eliza Hall Institute of Medical Research, Parkville, VIC, Australia.
- Department of Medical Biology, University of Melbourne, Melbourne, VIC, Australia.
- Bioinformatics and Cancer Genomics Laboratory, Peter MacCallum Cancer Centre, Melbourne, VIC, Australia.
| |
Collapse
|
66
|
Species-Specific Host-Virus Interactions: Implications for Viral Host Range and Virulence. Trends Microbiol 2019; 28:46-56. [PMID: 31597598 DOI: 10.1016/j.tim.2019.08.007] [Citation(s) in RCA: 84] [Impact Index Per Article: 14.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2019] [Revised: 08/11/2019] [Accepted: 08/19/2019] [Indexed: 01/09/2023]
Abstract
A growing number of studies indicate that host species-specific and virus strain-specific interactions of viral molecules with the host innate immune system play a pivotal role in determining virus host range and virulence. Because interacting proteins are likely constrained in their evolution, mutations that are selected to improve virus replication in one species may, by chance, alter the ability of a viral antagonist to inhibit immune responses in hosts the virus has not yet encountered. Based on recent findings of host-species interactions of poxvirus, herpesvirus, and influenza virus proteins, we propose a model for viral fitness and host range which considers the full interactome between a specific host species and a virus, resulting from the combination of all interactions, positive and negative, that influence whether a virus can productively infect a cell and cause disease in different hosts.
Collapse
|
67
|
Ferrada E. Gene Families, Epistasis and the Amino Acid Preferences of Protein Homologs. Evol Bioinform Online 2019; 15:1176934319870485. [PMID: 31452598 PMCID: PMC6698995 DOI: 10.1177/1176934319870485] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2019] [Accepted: 07/27/2019] [Indexed: 11/16/2022] Open
Abstract
In order to preserve structure and function, proteins tend to preferentially conserve amino acids at particular sites along the sequence. Because mutations can affect structure and function, the question arises whether the preference of a protein site for a particular amino acid varies between protein homologs, and to what extent that variation depends on sequence divergence. Answering these questions can help in the development of models of sequence evolution, as well as provide insights on the dependence of the fitness effects of mutations on the genetic background of sequences, a phenomenon known as epistasis. Here, I comment on recent computational work providing a systematic analysis of the extent to which the amino acid preferences of proteins depend on the background mutations of protein homologs.
Collapse
Affiliation(s)
- Evandro Ferrada
- Center for Genomics and Bioinformatics, Faculty of Science, Universidad Mayor, Santiago, Chile
| |
Collapse
|
68
|
Lee JM, Eguia R, Zost SJ, Choudhary S, Wilson PC, Bedford T, Stevens-Ayers T, Boeckh M, Hurt AC, Lakdawala SS, Hensley SE, Bloom JD. Mapping person-to-person variation in viral mutations that escape polyclonal serum targeting influenza hemagglutinin. eLife 2019; 8:e49324. [PMID: 31452511 PMCID: PMC6711711 DOI: 10.7554/elife.49324] [Citation(s) in RCA: 72] [Impact Index Per Article: 12.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2019] [Accepted: 07/27/2019] [Indexed: 12/11/2022] Open
Abstract
A longstanding question is how influenza virus evolves to escape human immunity, which is polyclonal and can target many distinct epitopes. Here, we map how all amino-acid mutations to influenza's major surface protein affect viral neutralization by polyclonal human sera. The serum of some individuals is so focused that it selects single mutations that reduce viral neutralization by over an order of magnitude. However, different viral mutations escape the sera of different individuals. This individual-to-individual variation in viral escape mutations is not present among ferrets that have been infected just once with a defined viral strain. Our results show how different single mutations help influenza virus escape the immunity of different members of the human population, a phenomenon that could shape viral evolution and disease susceptibility.
Collapse
Affiliation(s)
- Juhye M Lee
- Basic Sciences DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
- Department of Genome SciencesUniversity of WashingtonSeattleUnited States
| | - Rachel Eguia
- Basic Sciences DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
| | - Seth J Zost
- Department of MicrobiologyPerelman School of Medicine, University of PennsylvaniaPhiladelphiaUnited States
| | - Saket Choudhary
- Department of Biological SciencesUniversity of Southern CaliforniaLos AngelesUnited States
| | - Patrick C Wilson
- Department of MedicineSection of Rheumatology, University of ChicagoChicagoUnited States
| | - Trevor Bedford
- Vaccine and Infectious Disease DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
| | - Terry Stevens-Ayers
- Vaccine and Infectious Disease DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
| | - Michael Boeckh
- Vaccine and Infectious Disease DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
| | - Aeron C Hurt
- WHO Collaborating Centre for Reference and Research on Influenza, Peter Doherty Institute for Infection and ImmunityMelbourneAustralia
| | - Seema S Lakdawala
- Department of Microbiology and Molecular GeneticsSchool of Medicine, University of PittsburghPittsburghUnited States
| | - Scott E Hensley
- Department of MicrobiologyPerelman School of Medicine, University of PennsylvaniaPhiladelphiaUnited States
| | - Jesse D Bloom
- Basic Sciences DivisionFred Hutchinson Cancer Research CenterSeattleUnited States
- Department of Genome SciencesUniversity of WashingtonSeattleUnited States
- Howard Hughes Medical InstituteSeattleUnited States
| |
Collapse
|
69
|
Russell AB, Elshina E, Kowalsky JR, Te Velthuis AJW, Bloom JD. Single-Cell Virus Sequencing of Influenza Infections That Trigger Innate Immunity. J Virol 2019; 93:e00500-19. [PMID: 31068418 PMCID: PMC6600203 DOI: 10.1128/jvi.00500-19] [Citation(s) in RCA: 85] [Impact Index Per Article: 14.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2019] [Accepted: 04/29/2019] [Indexed: 12/21/2022] Open
Abstract
Influenza virus-infected cells vary widely in their expression of viral genes and only occasionally activate innate immunity. Here, we develop a new method to assess how the genetic variation in viral populations contributes to this heterogeneity. We do this by determining the transcriptome and full-length sequences of all viral genes in single cells infected with a nominally "pure" stock of influenza virus. Most cells are infected by virions with defects, some of which increase the frequency of innate-immune activation. These immunostimulatory defects are diverse and include mutations that perturb the function of the viral polymerase protein PB1, large internal deletions in viral genes, and failure to express the virus's interferon antagonist NS1. However, immune activation remains stochastic in cells infected by virions with these defects and occasionally is triggered even by virions that express unmutated copies of all genes. Our work shows that the diverse spectrum of defects in influenza virus populations contributes to-but does not completely explain-the heterogeneity in viral gene expression and immune activation in single infected cells.IMPORTANCE Because influenza virus has a high mutation rate, many cells are infected by mutated virions. But so far, it has been impossible to fully characterize the sequence of the virion infecting any given cell, since conventional techniques such as flow cytometry and single-cell transcriptome sequencing (scRNA-seq) only detect if a protein or transcript is present, not its sequence. Here we develop a new approach that uses long-read PacBio sequencing to determine the sequences of virions infecting single cells. We show that viral genetic variation explains some but not all of the cell-to-cell variability in viral gene expression and innate immune induction. Overall, our study provides the first complete picture of how viral mutations affect the course of infection in single cells.
Collapse
Affiliation(s)
- Alistair B Russell
- Basic Sciences and Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Elizaveta Elshina
- Division of Virology, Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | - Jacob R Kowalsky
- Basic Sciences and Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Aartjan J W Te Velthuis
- Division of Virology, Department of Pathology, University of Cambridge, Cambridge, United Kingdom
| | - Jesse D Bloom
- Basic Sciences and Computational Biology, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Department of Genome Sciences, University of Washington, Seattle, Washington, USA
- Howard Hughes Medical Institute, Seattle, Washington, USA
| |
Collapse
|
70
|
Hom N, Gentles L, Bloom JD, Lee KK. Deep Mutational Scan of the Highly Conserved Influenza A Virus M1 Matrix Protein Reveals Substantial Intrinsic Mutational Tolerance. J Virol 2019; 93:e00161-19. [PMID: 31019050 PMCID: PMC6580950 DOI: 10.1128/jvi.00161-19] [Citation(s) in RCA: 21] [Impact Index Per Article: 3.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2019] [Accepted: 04/09/2019] [Indexed: 12/30/2022] Open
Abstract
Influenza A virus matrix protein M1 is involved in multiple stages of the viral infectious cycle. Despite its functional importance, our present understanding of this essential viral protein is limited. The roles of a small subset of specific amino acids have been reported, but a more comprehensive understanding of the relationship between M1 sequence, structure, and virus fitness remains elusive. In this study, we used deep mutational scanning to measure the effect of every amino acid substitution in M1 on viral replication in cell culture. The map of amino acid mutational tolerance we have generated allows us to identify sites that are functionally constrained in cell culture as well as sites that are less constrained. Several sites that exhibit low tolerance to mutation have been found to be critical for M1 function and production of viable virions. Surprisingly, significant portions of the M1 sequence, especially in the C-terminal domain, whose structure is undetermined, were found to be highly tolerant of amino acid variation, despite having extremely low levels of sequence diversity among natural influenza virus strains. This unexpected discrepancy indicates that not all sites in M1 that exhibit high sequence conservation in nature are under strong constraint during selection for viral replication in cell culture.IMPORTANCE The M1 matrix protein is critical for many stages of the influenza virus infection cycle. Currently, we have an incomplete understanding of this highly conserved protein's function and structure. Key regions of M1, particularly in the C terminus of the protein, remain poorly characterized. In this study, we used deep mutational scanning to determine the extent of M1's tolerance to mutation. Surprisingly, nearly two-thirds of the M1 sequence exhibits a high tolerance for substitutions, contrary to the extremely low sequence diversity observed across naturally occurring M1 isolates. Sites with low mutational tolerance were also identified, suggesting that they likely play critical functional roles and are under selective pressure. These results reveal the intrinsic mutational tolerance throughout M1 and shape future inquiries probing the functions of this essential influenza A virus protein.
Collapse
Affiliation(s)
- Nancy Hom
- Department of Medicinal Chemistry, University of Washington, Seattle, Washington, USA
| | - Lauren Gentles
- Department of Microbiology, University of Washington, Seattle, Washington, USA
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
| | - Jesse D Bloom
- Department of Microbiology, University of Washington, Seattle, Washington, USA
- Division of Basic Sciences, Fred Hutchinson Cancer Research Center, Seattle, Washington, USA
- Howard Hughes Medical Institute, Chevy Chase, Maryland, USA
| | - Kelly K Lee
- Department of Medicinal Chemistry, University of Washington, Seattle, Washington, USA
- Department of Microbiology, University of Washington, Seattle, Washington, USA
| |
Collapse
|
71
|
Qiu X, Duvvuri VR, Bahl J. Computational Approaches and Challenges to Developing Universal Influenza Vaccines. Vaccines (Basel) 2019; 7:E45. [PMID: 31141933 PMCID: PMC6631137 DOI: 10.3390/vaccines7020045] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/10/2019] [Revised: 05/15/2019] [Accepted: 05/23/2019] [Indexed: 12/25/2022] Open
Abstract
The traditional design of effective vaccines for rapidly-evolving pathogens, such as influenza A virus, has failed to provide broad spectrum and long-lasting protection. With low cost whole genome sequencing technology and powerful computing capabilities, novel computational approaches have demonstrated the potential to facilitate the design of a universal influenza vaccine. However, few studies have integrated computational optimization in the design and discovery of new vaccines. Understanding the potential of computational vaccine design is necessary before these approaches can be implemented on a broad scale. This review summarizes some promising computational approaches under current development, including computationally optimized broadly reactive antigens with consensus sequences, phylogenetic model-based ancestral sequence reconstruction, and immunomics to compute conserved cross-reactive T-cell epitopes. Interactions between virus-host-environment determine the evolvability of the influenza population. We propose that with the development of novel technologies that allow the integration of data sources such as protein structural modeling, host antibody repertoire analysis and advanced phylodynamic modeling, computational approaches will be crucial for the development of a long-lasting universal influenza vaccine. Taken together, computational approaches are powerful and promising tools for the development of a universal influenza vaccine with durable and broad protection.
Collapse
Affiliation(s)
- Xueting Qiu
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
| | - Venkata R Duvvuri
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
| | - Justin Bahl
- Center for Ecology of Infectious Diseases, Department of Infectious Diseases, College of Veterinary Medicine, University of Georgia, Athens, GA 30602, USA.
- Department of Epidemiology and Biostatistics, College of Public Health, University of Georgia, Athens, GA 30606, USA.
- Duke-NUS Graduate Medical School, Singapore 169857, Singapore.
| |
Collapse
|
72
|
Characterization of substitutions in the neuraminidase of A(H7N9) influenza viruses selected following serial passage in the presence of different neuraminidase inhibitors. Antiviral Res 2019; 168:68-75. [PMID: 31132385 DOI: 10.1016/j.antiviral.2019.05.009] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/09/2018] [Revised: 05/23/2019] [Accepted: 05/24/2019] [Indexed: 02/06/2023]
Abstract
Avian A(H7N9) infections in humans have been reported in China since 2013 and are of public health concern due to their severity and pandemic potential. Oseltamivir and peramivir are neuraminidase inhibitors (NAIs) routinely used for the treatment of A(H7N9) infections, but variants with reduced sensitivity to these drugs can emerge in patients during treatment. Zanamivir and laninamivir are NAIs that are used less frequently. Herein, we performed in vitro serial passaging experiments with recombinant viruses, containing the neuraminidase (NA) from influenza A/Anhui/1/13 (H7N9) virus, in the presence of each NAI, to determine whether variants with reduced sensitivity would emerge. NA substitutions were characterized for their effect on the NA enzymatic activity and surface expression of the A/Anhui/1/13 (Anhui/1) NA, as well as NAs originating from contemporary A(H7N9) viruses of the Yangtze River Delta and Pearl River Delta lineages. In vitro passage in the presence of oseltamivir, peramivir and laninamivir selected for substitutions associated with reduced sensitivity (E119D, R292K and R152K), whereas passage in the presence of zanamivir did not select for any viruses with reduced sensitivity. All the NA substitutions significantly reduced activity, but not the expression of the Anhui/1 NA. In contemporary N9 NAs, all substitutions tested significantly reduced NA enzyme function in the Yangtze River lineage background, but not in the Pearl River Delta lineage background. Overall, these findings suggest that zanamivir may be less likely than the other NAIs to select for resistance in A(H7N9) viruses and that the impact of substitutions that reduce NAI susceptibility or enzyme function may be less in A(H7N9) viruses from the Pearl River lineage.
Collapse
|
73
|
Kinney JB, McCandlish DM. Massively Parallel Assays and Quantitative Sequence-Function Relationships. Annu Rev Genomics Hum Genet 2019; 20:99-127. [PMID: 31091417 DOI: 10.1146/annurev-genom-083118-014845] [Citation(s) in RCA: 96] [Impact Index Per Article: 16.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Abstract
Over the last decade, a rich variety of massively parallel assays have revolutionized our understanding of how biological sequences encode quantitative molecular phenotypes. These assays include deep mutational scanning, high-throughput SELEX, and massively parallel reporter assays. Here, we review these experimental methods and how the data they produce can be used to quantitatively model sequence-function relationships. In doing so, we touch on a diverse range of topics, including the identification of clinically relevant genomic variants, the modeling of transcription factor binding to DNA, the functional and evolutionary landscapes of proteins, and cis-regulatory mechanisms in both transcription and mRNA splicing. We further describe a unified conceptual framework and a core set of mathematical modeling strategies that studies in these diverse areas can make use of. Finally, we highlight key aspects of experimental design and mathematical modeling that are important for the results of such studies to be interpretable and reproducible.
Collapse
Affiliation(s)
- Justin B Kinney
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA; ,
| | - David M McCandlish
- Simons Center for Quantitative Biology, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York 11724, USA; ,
| |
Collapse
|
74
|
Soh YS, Moncla LH, Eguia R, Bedford T, Bloom JD. Comprehensive mapping of adaptation of the avian influenza polymerase protein PB2 to humans. eLife 2019; 8:45079. [PMID: 31038123 PMCID: PMC6491042 DOI: 10.7554/elife.45079] [Citation(s) in RCA: 37] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/11/2019] [Accepted: 03/31/2019] [Indexed: 12/11/2022] Open
Abstract
Viruses like influenza are infamous for their ability to adapt to new hosts. Retrospective studies of natural zoonoses and passaging in the lab have identified a modest number of host-adaptive mutations. However, it is unclear if these mutations represent all ways that influenza can adapt to a new host. Here we take a prospective approach to this question by completely mapping amino-acid mutations to the avian influenza virus polymerase protein PB2 that enhance growth in human cells. We identify numerous previously uncharacterized human-adaptive mutations. These mutations cluster on PB2’s surface, highlighting potential interfaces with host factors. Some previously uncharacterized adaptive mutations occur in avian-to-human transmission of H7N9 influenza, showing their importance for natural virus evolution. But other adaptive mutations do not occur in nature because they are inaccessible via single-nucleotide mutations. Overall, our work shows how selection at key molecular surfaces combines with evolutionary accessibility to shape viral host adaptation. Viruses copy themselves by hijacking the cells of an infected host, but this comes with some limitations. Cells from different species have different molecular machinery and so viruses often have to specialize to a narrow group of species. This specialization consists largely of fine-tuning the way that viral proteins interact with host proteins. For instance, in bird flu viruses, a protein known as PB2 does not interact well with the machinery in human cells. Because PB2 proteins form part of the viral polymerase (the structure that copies the viral genome), this prevents bird flu viruses from replicating efficiently in humans. Sometimes however, changes in the PB2 protein allow bird flu viruses to better replicate in humans, potentially leading to deadly flu pandemics. To understand exactly how this happens, researchers have previously used two approaches: examining the changes that have happened in past flu viruses, and monitoring the evolution of bird flu viruses grown in human cells in the lab. However, these approaches can only look at a small number of the many possible genetic changes to the virus. This makes it hard to anticipate the new ways that flu might adapt to human cells in the future. To overcome this problem, Soh et al. systematically created all of the single changes to the bird flu PB2, altering every element of the protein sequence one-by-one. They then tested which of the changes to PB2 helped the virus grow better in human cells. The modifications that made the viruses thrive were on the surface of the protein, suggesting that they might improve interaction with the cell machinery of the host. Some changes have been found in bird flu viruses that have recently jumped into humans in nature, although fortunately none of these viruses have yet spread widely to cause a pandemic. Many factors affect the evolution of viruses, and their ability to infect new species. Understanding which changes in proteins help these microbes adapt to new hosts is an important element that scientists could consider to assess future risks of pandemics.
Collapse
Affiliation(s)
- Yq Shirleen Soh
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Louise H Moncla
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Rachel Eguia
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Trevor Bedford
- Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Jesse D Bloom
- Basic Sciences Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Howard Hughes Medical Institute, Seattle, United States
| |
Collapse
|
75
|
Determinants of Zika virus host tropism uncovered by deep mutational scanning. Nat Microbiol 2019; 4:876-887. [PMID: 30886357 DOI: 10.1038/s41564-019-0399-4] [Citation(s) in RCA: 43] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2018] [Accepted: 02/01/2019] [Indexed: 01/01/2023]
Abstract
Arboviruses cycle between, and replicate in, both invertebrate and vertebrate hosts, which for Zika virus (ZIKV) involves Aedes mosquitoes and primates1. The viral determinants required for replication in such obligate hosts are under strong purifying selection during natural virus evolution, making it challenging to resolve which determinants are optimal for viral fitness in each host. Herein we describe a deep mutational scanning (DMS) strategy2-5 whereby a viral cDNA library was constructed containing all codon substitutions in the C-terminal 204 amino acids of ZIKV envelope protein (E). The cDNA library was transfected into C6/36 (Aedes) and Vero (primate) cells, with subsequent deep sequencing and computational analyses of recovered viruses showing that substitutions K316Q and S461G, or Q350L and T397S, conferred substantial replicative advantages in mosquito and primate cells, respectively. A 316Q/461G virus was constructed and shown to be replication-defective in mammalian cells due to severely compromised virus particle formation and secretion. The 316Q/461G virus was also highly attenuated in human brain organoids, and illustrated utility as a vaccine in mice. This approach can thus imitate evolutionary selection in a matter of days and identify amino acids key to the regulation of virus replication in specific host environments.
Collapse
|
76
|
Gauthier L, Di Franco R, Serohijos AWR. SodaPop: a forward simulation suite for the evolutionary dynamics of asexual populations on protein fitness landscapes. Bioinformatics 2019; 35:4053-4062. [DOI: 10.1093/bioinformatics/btz175] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2018] [Revised: 01/21/2019] [Accepted: 03/12/2019] [Indexed: 11/14/2022] Open
Abstract
Abstract
Motivation
Protein evolution is determined by forces at multiple levels of biological organization. Random mutations have an immediate effect on the biophysical properties, structure and function of proteins. These same mutations also affect the fitness of the organism. However, the evolutionary fate of mutations, whether they succeed to fixation or are purged, also depends on population size and dynamics. There is an emerging interest, both theoretically and experimentally, to integrate these two factors in protein evolution. Although there are several tools available for simulating protein evolution, most of them focus on either the biophysical or the population-level determinants, but not both. Hence, there is a need for a publicly available computational tool to explore both the effects of protein biophysics and population dynamics on protein evolution.
Results
To address this need, we developed SodaPop, a computational suite to simulate protein evolution in the context of the population dynamics of asexual populations. SodaPop accepts as input several fitness landscapes based on protein biochemistry or other user-defined fitness functions. The user can also provide as input experimental fitness landscapes derived from deep mutational scanning approaches or theoretical landscapes derived from physical force field estimates. Here, we demonstrate the broad utility of SodaPop with different applications describing the interplay of selection for protein properties and population dynamics. SodaPop is designed such that population geneticists can explore the influence of protein biochemistry on patterns of genetic variation, and that biochemists and biophysicists can explore the role of population size and demography on protein evolution.
Availability and implementation
Source code and binaries are freely available at https://github.com/louisgt/SodaPop under the GNU GPLv3 license. The software is implemented in C++ and supported on Linux, Mac OS/X and Windows.
Supplementary information
Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Louis Gauthier
- Département de Biochimie, Université de Montréal, Montréal, QC, Canada
- Centre Robert-Cedergren en Bioinformatique et Génomique, Université de Montréal, Montréal, QC, Canada
| | - Rémicia Di Franco
- Département de Biochimie, Université de Montréal, Montréal, QC, Canada
- Centre Robert-Cedergren en Bioinformatique et Génomique, Université de Montréal, Montréal, QC, Canada
- Enseirb-Matmeca, Bordeaux Institute of Technology, Talence, France
| | - Adrian W R Serohijos
- Département de Biochimie, Université de Montréal, Montréal, QC, Canada
- Centre Robert-Cedergren en Bioinformatique et Génomique, Université de Montréal, Montréal, QC, Canada
| |
Collapse
|
77
|
Ferrada E. The Site-Specific Amino Acid Preferences of Homologous Proteins Depend on Sequence Divergence. Genome Biol Evol 2019; 11:121-135. [PMID: 30496400 PMCID: PMC6326188 DOI: 10.1093/gbe/evy261] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 11/26/2018] [Indexed: 12/20/2022] Open
Abstract
The propensity of protein sites to be occupied by any of the 20 amino acids is known as site-specific amino acid preferences (SSAP). Under the assumption that SSAP are conserved among homologs, they can be used to parameterize evolutionary models for the reconstruction of accurate phylogenetic trees. However, simulations and experimental studies have not been able to fully assess the relative conservation of SSAP as a function of sequence divergence between protein homologs. Here, we implement a computational procedure to predict the SSAP of proteins based on the effect of changes in thermodynamic stability upon mutation. An advantage of this computational approach is that it allows us to interrogate a large and unbiased sample of homologous proteins, over the entire spectrum of sequence divergence, and under selection for the same molecular trait. We show that computational predictions have reproducibilities that resemble those obtained in experimental replicates, and can largely recapitulate the SSAP observed in a large-scale mutagenesis experiment. Our results support recent experimental reports on the conservation of SSAP of related homologs, with a slowly increasing fraction of up to 15% of different sites at sequence distances lower than 40%. However, even under the sole contribution of thermodynamic stability, our conservative approach identifies up to 30% of significant different sites between divergent homologs. We show that this relation holds for homologs of diverse sizes and structural classes. Analyses of residue contact networks suggest that an important determinant of these differences is the increasing accumulation of structural deviations that results from sequence divergence.
Collapse
Affiliation(s)
- Evandro Ferrada
- Center for Genomics and Bioinformatics, Faculty of Science, Universidad Mayor, Camino La Pirámide 5750, Huechuraba, 8580745, Santiago, Chile
| |
Collapse
|
78
|
Hilton SK, Bloom JD. Modeling site-specific amino-acid preferences deepens phylogenetic estimates of viral sequence divergence. Virus Evol 2018; 4:vey033. [PMID: 30425841 PMCID: PMC6220371 DOI: 10.1093/ve/vey033] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023] Open
Abstract
Molecular phylogenetics is often used to estimate the time since the divergence of modern gene sequences. For highly diverged sequences, such phylogenetic techniques sometimes estimate surprisingly recent divergence times. In the case of viruses, independent evidence indicates that the estimates of deep divergence times from molecular phylogenetics are sometimes too recent. This discrepancy is caused in part by inadequate models of purifying selection leading to branch-length underestimation. Here we examine the effect on branch-length estimation of using models that incorporate experimental measurements of purifying selection. We find that models informed by experimentally measured site-specific amino-acid preferences estimate longer deep branches on phylogenies of influenza virus hemagglutinin. This lengthening of branches is due to more realistic stationary states of the models, and is mostly independent of the branch-length extension from modeling site-to-site variation in amino-acid substitution rate. The branch-length extension from experimentally informed site-specific models is similar to that achieved by other approaches that allow the stationary state to vary across sites. However, the improvements from all of these site-specific but time homogeneous and site independent models are limited by the fact that a protein’s amino-acid preferences gradually shift as it evolves. Overall, our work underscores the importance of modeling site-specific amino-acid preferences when estimating deep divergence times—but also shows the inherent limitations of approaches that fail to account for how these preferences shift over time.
Collapse
Affiliation(s)
- Sarah K Hilton
- Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center.,Department of Genome Sciences, University of Washington, USA
| | - Jesse D Bloom
- Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center.,Department of Genome Sciences, University of Washington, USA.,Howard Hughes Medical Institute, Seattle, WA, USA
| |
Collapse
|
79
|
Phillips AM, Ponomarenko AI, Chen K, Ashenberg O, Miao J, McHugh SM, Butty VL, Whittaker CA, Moore CL, Bloom JD, Lin YS, Shoulders MD. Destabilized adaptive influenza variants critical for innate immune system escape are potentiated by host chaperones. PLoS Biol 2018; 16:e3000008. [PMID: 30222731 PMCID: PMC6160216 DOI: 10.1371/journal.pbio.3000008] [Citation(s) in RCA: 29] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/30/2018] [Revised: 09/27/2018] [Accepted: 08/30/2018] [Indexed: 11/24/2022] Open
Abstract
The threat of viral pandemics demands a comprehensive understanding of evolution at the host-pathogen interface. Here, we show that the accessibility of adaptive mutations in influenza nucleoprotein at fever-like temperatures is mediated by host chaperones. Particularly noteworthy, we observe that the Pro283 nucleoprotein variant, which (1) is conserved across human influenza strains, (2) confers resistance to the Myxovirus resistance protein A (MxA) restriction factor, and (3) critically contributed to adaptation to humans in the 1918 pandemic influenza strain, is rendered unfit by heat shock factor 1 inhibition-mediated host chaperone depletion at febrile temperatures. This fitness loss is due to biophysical defects that chaperones are unavailable to address when heat shock factor 1 is inhibited. Thus, influenza subverts host chaperones to uncouple the biophysically deleterious consequences of viral protein variants from the benefits of immune escape. In summary, host proteostasis plays a central role in shaping influenza adaptation, with implications for the evolution of other viruses, for viral host switching, and for antiviral drug development.
Collapse
Affiliation(s)
- Angela M. Phillips
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Anna I. Ponomarenko
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Kenny Chen
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Orr Ashenberg
- Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Jiayuan Miao
- Department of Chemistry, Tufts University, Medford, Massachusetts, United States of America
| | - Sean M. McHugh
- Department of Chemistry, Tufts University, Medford, Massachusetts, United States of America
| | - Vincent L. Butty
- BioMicro Center, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Charles A. Whittaker
- BioMicro Center, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Christopher L. Moore
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| | - Jesse D. Bloom
- Fred Hutchinson Cancer Research Center, Seattle, Washington, United States of America
| | - Yu-Shan Lin
- Department of Chemistry, Tufts University, Medford, Massachusetts, United States of America
| | - Matthew D. Shoulders
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, Massachusetts, United States of America
| |
Collapse
|
80
|
Using the Mutation-Selection Framework to Characterize Selection on Protein Sequences. Genes (Basel) 2018; 9:genes9080409. [PMID: 30104502 PMCID: PMC6115872 DOI: 10.3390/genes9080409] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/19/2018] [Revised: 08/02/2018] [Accepted: 08/09/2018] [Indexed: 12/13/2022] Open
Abstract
When mutational pressure is weak, the generative process of protein evolution involves explicit probabilities of mutations of different types coupled to their conditional probabilities of fixation dependent on selection. Establishing this mechanistic modeling framework for the detection of selection has been a goal in the field of molecular evolution. Building on a mathematical framework proposed more than a decade ago, numerous methods have been introduced in an attempt to detect and measure selection on protein sequences. In this review, we discuss the structure of the original model, subsequent advances, and the series of assumptions that these models operate under.
Collapse
|
81
|
Deep mutational scanning of hemagglutinin helps predict evolutionary fates of human H3N2 influenza variants. Proc Natl Acad Sci U S A 2018; 115:E8276-E8285. [PMID: 30104379 PMCID: PMC6126756 DOI: 10.1073/pnas.1806133115] [Citation(s) in RCA: 124] [Impact Index Per Article: 17.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023] Open
Abstract
A key goal in the study of influenza virus evolution is to forecast which viral strains will persist and which ones will die out. Here we experimentally measure the effects of all amino acid mutations to the hemagglutinin protein from a human H3N2 influenza strain on viral growth in cell culture. We show that these measurements have utility for distinguishing among viral strains that do and do not succeed in nature. Overall, our work suggests that new high-throughput experimental approaches may be useful for understanding virus evolution in nature. Human influenza virus rapidly accumulates mutations in its major surface protein hemagglutinin (HA). The evolutionary success of influenza virus lineages depends on how these mutations affect HA’s functionality and antigenicity. Here we experimentally measure the effects on viral growth in cell culture of all single amino acid mutations to the HA from a recent human H3N2 influenza virus strain. We show that mutations that are measured to be more favorable for viral growth are enriched in evolutionarily successful H3N2 viral lineages relative to mutations that are measured to be less favorable for viral growth. Therefore, despite the well-known caveats about cell-culture measurements of viral fitness, such measurements can still be informative for understanding evolution in nature. We also compare our measurements for H3 HA to similar data previously generated for a distantly related H1 HA and find substantial differences in which amino acids are preferred at many sites. For instance, the H3 HA has less disparity in mutational tolerance between the head and stalk domains than the H1 HA. Overall, our work suggests that experimental measurements of mutational effects can be leveraged to help understand the evolutionary fates of viral lineages in nature—but only when the measurements are made on a viral strain similar to the ones being studied in nature.
Collapse
|
82
|
Lyons DM, Lauring AS. Mutation and Epistasis in Influenza Virus Evolution. Viruses 2018; 10:E407. [PMID: 30081492 PMCID: PMC6115771 DOI: 10.3390/v10080407] [Citation(s) in RCA: 64] [Impact Index Per Article: 9.1] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2018] [Revised: 07/30/2018] [Accepted: 07/30/2018] [Indexed: 12/25/2022] Open
Abstract
Influenza remains a persistent public health challenge, because the rapid evolution of influenza viruses has led to marginal vaccine efficacy, antiviral resistance, and the annual emergence of novel strains. This evolvability is driven, in part, by the virus's capacity to generate diversity through mutation and reassortment. Because many new traits require multiple mutations and mutations are frequently combined by reassortment, epistatic interactions between mutations play an important role in influenza virus evolution. While mutation and epistasis are fundamental to the adaptability of influenza viruses, they also constrain the evolutionary process in important ways. Here, we review recent work on mutational effects and epistasis in influenza viruses.
Collapse
Affiliation(s)
- Daniel M Lyons
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
| | - Adam S Lauring
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109, USA.
- Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI 48109, USA.
- Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI 48109, USA.
| |
Collapse
|
83
|
Multiplexed assays of variant effects contribute to a growing genotype-phenotype atlas. Hum Genet 2018; 137:665-678. [PMID: 30073413 PMCID: PMC6153521 DOI: 10.1007/s00439-018-1916-x] [Citation(s) in RCA: 89] [Impact Index Per Article: 12.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/21/2018] [Accepted: 07/21/2018] [Indexed: 12/12/2022]
Abstract
Given the constantly improving cost and speed of genome sequencing, it is reasonable to expect that personal genomes will soon be known for many millions of humans. This stands in stark contrast with our limited ability to interpret the sequence variants which we find. Although it is, perhaps, easiest to interpret variants in coding regions, knowledge of functional impact is unknown for the vast majority of missense variants. While many computational approaches can predict the impact of coding variants, they are given a little weight in the current guidelines for interpreting clinical variants. Laboratory assays produce comparatively more trustworthy results, but until recently did not scale to the space of all possible mutations. The development of deep mutational scanning and other multiplexed assays of variant effect has now brought feasibility of this endeavour within view. Here, we review progress in this field over the last decade, break down the different approaches into their components, and compare methodological differences.
Collapse
|
84
|
Diefenbacher M, Sun J, Brooke CB. The parts are greater than the whole: the role of semi-infectious particles in influenza A virus biology. Curr Opin Virol 2018; 33:42-46. [PMID: 30053722 DOI: 10.1016/j.coviro.2018.07.002] [Citation(s) in RCA: 24] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/14/2018] [Accepted: 07/05/2018] [Indexed: 12/21/2022]
Abstract
The influenza A virus (IAV) genome is incorporated into newly produced virions through a tightly orchestrated process that is one of the best studied examples of genome packaging by a segmented virus. Despite the remarkable selectivity and efficiency of this process, it is clear that the vast majority of IAV virions are unable to express the full set of essential viral gene products and are thus incapable of productive replication in the absence of complementation. Here, we attempt to reconcile the widespread production of these semi-infectious particles (SIPs) with the high efficiency and selectivity of IAV genome packaging. We also cover what is known and what remains unknown about the consequences of SIP production for the replication and evolution of viral populations.
Collapse
Affiliation(s)
| | - Jiayi Sun
- Department of Microbiology, University of Illinois, Urbana, IL 61801, USA
| | - Christopher B Brooke
- Department of Microbiology, University of Illinois, Urbana, IL 61801, USA; Carl R. Woese Institute for Genomic Biology, University of Illinois, Urbana, IL 61801, USA.
| |
Collapse
|
85
|
Lyons DM, Lauring AS. Evidence for the Selective Basis of Transition-to-Transversion Substitution Bias in Two RNA Viruses. Mol Biol Evol 2018; 34:3205-3215. [PMID: 29029187 PMCID: PMC5850290 DOI: 10.1093/molbev/msx251] [Citation(s) in RCA: 76] [Impact Index Per Article: 10.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/21/2022] Open
Abstract
The substitution rates of transitions are higher than expected by chance relative to those of transversions. Many have argued that selection disfavors transversions, as nonsynonymous transversions are less likely to conserve biochemical properties of the original amino acid. Only recently has it become feasible to directly test this selective hypothesis by comparing the fitness effects of a large number of transition and transversion mutations. For example, a recent study of six viruses and one beta-lactamase gene did not find evidence supporting the selective hypothesis. Here, we analyze the relative fitness effects of transition and transversion mutations from our recently published genome-wide study of mutational fitness effects in influenza virus. In contrast to prior work, we find that transversions are significantly more detrimental than transitions. Using what we believe to be an improved statistical framework, we also identify a similar trend in two HIV data sets. We further demonstrate a fitness difference in transition and transversion mutations using four deep mutational scanning data sets of influenza virus and HIV, which provided adequate statistical power. We find that three of the most commonly cited radical/conservative amino acid categories are predictive of fitness, supporting their utility in studies of positive selection and codon usage bias. We conclude that selection is a major contributor to the transition:transversion substitution bias in viruses and that this effect is only partially explained by the greater likelihood of transversion mutations to cause radical as opposed to conservative amino acid changes.
Collapse
Affiliation(s)
- Daniel M Lyons
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI
| | - Adam S Lauring
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI.,Department of Microbiology and Immunology, University of Michigan, Ann Arbor, MI.,Division of Infectious Diseases, Department of Internal Medicine, University of Michigan, Ann Arbor, MI
| |
Collapse
|
86
|
Multinucleotide mutations cause false inferences of lineage-specific positive selection. Nat Ecol Evol 2018; 2:1280-1288. [PMID: 29967485 PMCID: PMC6093625 DOI: 10.1038/s41559-018-0584-5] [Citation(s) in RCA: 88] [Impact Index Per Article: 12.6] [Reference Citation Analysis] [Abstract] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/26/2017] [Accepted: 05/18/2018] [Indexed: 11/08/2022]
Abstract
Phylogenetic tests of adaptive evolution, such as the widely used branch-site test, assume that nucleotide substitutions occur singly and independently. But recent research has shown that errors at adjacent sites often occur during DNA replication, and the resulting multinucleotide mutations (MNMs) are overwhelmingly likely to be nonsynonymous. We evaluated whether the branch-site test (BST) might misinterpret sequence patterns produced by MNMs as false support for positive selection. We analyzed two genome-scale datasets– one from mammals and one from flies – and found that codons with multiple differences account for virtually all the support for lineage-specific positive selection in the BST. Simulations under conditions derived from these alignments but without positive selection show that realistic rates of MNMs cause a strong and systematic bias towards false inferences of selection. This bias is sufficient under empirically derived conditions to produce false positive inferences as often as the branch-site test infers positive selection from the empirical data. Although some genes with BST-positive results may have evolved adaptively, the test cannot distinguish sequence patterns produced by authentic positive selection from those caused by neutral fixation of MNMs. Many published inferences of adaptive evolution using this technique may therefore be artifacts of model violation caused by unincorporated neutral mutational processes. We introduce a model that incorporates MNMs and may help to ameliorate this bias.
Collapse
|
87
|
Abstract
The evolution of viral pathogens is shaped by strong selective forces that are exerted during jumps to new hosts, confrontations with host immune responses and antiviral drugs, and numerous other processes. However, while undeniably strong and frequent, adaptive evolution is largely confined to small parts of information-packed viral genomes, and the majority of observed variation is effectively neutral. The predictions and implications of the neutral theory have proven immensely useful in this context, with applications spanning understanding within-host population structure, tracing the origins and spread of viral pathogens, predicting evolutionary dynamics, and modeling the emergence of drug resistance. We highlight the multiple ways in which the neutral theory has had an impact, which has been accelerated in the age of high-throughput, high-resolution genomics.
Collapse
Affiliation(s)
- Simon D W Frost
- Department of Veterinary Medicine, University of Cambridge, Cambridge,
United Kingdom
- The Alan Turing Institute, London, United Kingdom
| | - Brittany Rife Magalis
- Institute for Genomics and Evolutionary Medicine, Temple University,
Philadelphia, PA
| | | |
Collapse
|
88
|
Abstract
The deterministic force of natural selection and stochastic influence of drift shape RNA virus evolution. New deep-sequencing and microfluidics technologies allow us to quantify the effect of mutations and trace the evolution of viral populations with single-genome and single-nucleotide resolution. Such experiments can reveal the topography of the genotype-fitness landscapes that shape the path of viral evolution. By combining historical analyses, like phylogenetic approaches, with high-throughput and high-resolution evolutionary experiments, we can observe parallel patterns of evolution that drive important phenotypic transitions. These developments provide a framework for quantifying and anticipating potential evolutionary events. Here, we examine emerging technologies that can map the selective landscapes of viruses, focusing on their application to pathogenic viruses. We identify areas where these technologies can bolster our ability to study the evolution of viruses and to anticipate and possibly intervene in evolutionary events and prevent viral disease.
Collapse
Affiliation(s)
- Patrick T Dolan
- Department of Biology, Stanford University, E200 Clark Center, 318 Campus Drive, Stanford, CA 94305, USA; Department of Microbiology and Immunology, University of California, San Francisco, 600 16th Street, GH-S572, UCSF Box 2280, San Francisco, CA 94143-2280, USA
| | - Zachary J Whitfield
- Department of Microbiology and Immunology, University of California, San Francisco, 600 16th Street, GH-S572, UCSF Box 2280, San Francisco, CA 94143-2280, USA
| | - Raul Andino
- Department of Microbiology and Immunology, University of California, San Francisco, 600 16th Street, GH-S572, UCSF Box 2280, San Francisco, CA 94143-2280, USA.
| |
Collapse
|
89
|
How single mutations affect viral escape from broad and narrow antibodies to H1 influenza hemagglutinin. Nat Commun 2018; 9:1386. [PMID: 29643370 PMCID: PMC5895760 DOI: 10.1038/s41467-018-03665-3] [Citation(s) in RCA: 118] [Impact Index Per Article: 16.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/22/2018] [Accepted: 02/28/2018] [Indexed: 01/19/2023] Open
Abstract
Influenza virus can escape most antibodies with single mutations. However, rare antibodies broadly neutralize many viral strains. It is unclear how easily influenza virus might escape such antibodies if there was strong pressure to do so. Here, we map all single amino-acid mutations that increase resistance to broad antibodies to H1 hemagglutinin. Our approach not only identifies antigenic mutations but also quantifies their effect sizes. All antibodies select mutations, but the effect sizes vary widely. The virus can escape a broad antibody to hemagglutinin's receptor-binding site the same way it escapes narrow strain-specific antibodies: via single mutations with huge effects. In contrast, broad antibodies to hemagglutinin's stalk only select mutations with small effects. Therefore, among the antibodies we examine, breadth is an imperfect indicator of the potential for viral escape via single mutations. Antibodies targeting the H1 hemagglutinin stalk are quantifiably harder to escape than the other antibodies tested here.
Collapse
|
90
|
Haddox HK, Dingens AS, Hilton SK, Overbaugh J, Bloom JD. Mapping mutational effects along the evolutionary landscape of HIV envelope. eLife 2018; 7:34420. [PMID: 29590010 PMCID: PMC5910023 DOI: 10.7554/elife.34420] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2017] [Accepted: 03/15/2018] [Indexed: 01/04/2023] Open
Abstract
The immediate evolutionary space accessible to HIV is largely determined by how single amino acid mutations affect fitness. These mutational effects can shift as the virus evolves. However, the prevalence of such shifts in mutational effects remains unclear. Here, we quantify the effects on viral growth of all amino acid mutations to two HIV envelope (Env) proteins that differ at >100 residues. Most mutations similarly affect both Envs, but the amino acid preferences of a minority of sites have clearly shifted. These shifted sites usually prefer a specific amino acid in one Env, but tolerate many amino acids in the other. Surprisingly, shifts are only slightly enriched at sites that have substituted between the Envs—and many occur at residues that do not even contact substitutions. Therefore, long-range epistasis can unpredictably shift Env’s mutational tolerance during HIV evolution, although the amino acid preferences of most sites are conserved between moderately diverged viral strains. The virus that causes AIDS, or HIV, has a protein called Env on its surface, which is essential for the virus to infect cells. Env can also be recognized by the immune system, which then targets the virus for destruction or blocks it from infecting cells. Unfortunately, Env evolves very quickly, which means that HIV can evade our defenses. However, there are limits to how much this protein can change, since it still needs to perform its essential role in helping viruses enter cells. In the century since HIV first appeared in human populations, the virus has evolved considerably. There are now many HIV strains that infect people, and they bear Env proteins with substantially different sequences. However, it is not clear if these changes in sequence have resulted in Envs from distinct strains being able to tolerate different mutations. To examine this question, Haddox et al. compared how the Envs from two strains of HIV react to modifications in their sequences. They created all possible individual mutations in the proteins, and the resulting collections of mutated viruses were then tested for their ability to infect cells in the laboratory. Most mutations had similar effects in both Env proteins. This allowed Haddox et al. to identify portions of the protein that easily accommodate changes, and portions that must remain unchanged for viruses to remain infectious—at least in the laboratory. Some of these mutations are under different types of pressures when the virus faces the immune system, and those were identified using computational approaches. However, some mutations were tolerated differently by the two Env proteins. Therefore, viral strains differ in how their Env proteins can evolve. The parts of Env that showed differences in mutational tolerance between the strains were not necessarily the parts that differ in sequence. This shows that changes in sequence in one part of the protein can modify how other portions evolve. It remains to be determined whether changes in tolerance to mutations translate into differences in how the virus can escape immunity. This is an important question given that the rapid evolution of Env is a major obstacle to creating a vaccine for HIV.
Collapse
Affiliation(s)
- Hugh K Haddox
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Molecular and Cellular Biology PhD program, University of Washington, Seattle, United States
| | - Adam S Dingens
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Molecular and Cellular Biology PhD program, University of Washington, Seattle, United States
| | - Sarah K Hilton
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| | - Julie Overbaugh
- Human Biology Division, Fred Hutchinson Cancer Research Center, Seattle, United States.,Epidemiology Program, Fred Hutchinson Cancer Research Center, Seattle, United States
| | - Jesse D Bloom
- Basic Sciences Division and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, United States.,Department of Genome Sciences, University of Washington, Seattle, United States
| |
Collapse
|
91
|
Abstract
The rapid global evolution of influenza virus begins with mutations that arise de novo in individual infections, but little is known about how evolution occurs within hosts. We review recent progress in understanding how and why influenza viruses evolve within human hosts. Advances in deep sequencing make it possible to measure within-host genetic diversity in both acute and chronic influenza infections. Factors like antigenic selection, antiviral treatment, tissue specificity, spatial structure, and multiplicity of infection may affect how influenza viruses evolve within human hosts. Studies of within-host evolution can contribute to our understanding of the evolutionary and epidemiological factors that shape influenza virus's global evolution.
Collapse
Affiliation(s)
- Katherine S Xue
- Department of Genome Sciences, University of Washington, Seattle, WA, USA; Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Louise H Moncla
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Trevor Bedford
- Vaccine and Infectious Disease Division, Fred Hutchinson Cancer Research Center, Seattle, WA, USA
| | - Jesse D Bloom
- Department of Genome Sciences, University of Washington, Seattle, WA, USA; Division of Basic Sciences and Computational Biology Program, Fred Hutchinson Cancer Research Center, Seattle, WA, USA.
| |
Collapse
|
92
|
Russell AB, Trapnell C, Bloom JD. Extreme heterogeneity of influenza virus infection in single cells. eLife 2018; 7:e32303. [PMID: 29451492 PMCID: PMC5826275 DOI: 10.7554/elife.32303] [Citation(s) in RCA: 175] [Impact Index Per Article: 25.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2017] [Accepted: 01/31/2018] [Indexed: 12/13/2022] Open
Abstract
Viral infection can dramatically alter a cell's transcriptome. However, these changes have mostly been studied by bulk measurements on many cells. Here we use single-cell mRNA sequencing to examine the transcriptional consequences of influenza virus infection. We find extremely wide cell-to-cell variation in the productivity of viral transcription - viral transcripts comprise less than a percent of total mRNA in many infected cells, but a few cells derive over half their mRNA from virus. Some infected cells fail to express at least one viral gene, but this gene absence only partially explains variation in viral transcriptional load. Despite variation in viral load, the relative abundances of viral mRNAs are fairly consistent across infected cells. Activation of innate immune pathways is rare, but some cellular genes co-vary in abundance with the amount of viral mRNA. Overall, our results highlight the complexity of viral infection at the level of single cells.
Collapse
Affiliation(s)
- Alistair B Russell
- Basic Sciences Division and Computational Biology ProgramFred Hutchinson Cancer Research CenterSeattleUnited States
| | - Cole Trapnell
- Department of Genome SciencesUniversity of WashingtonSeattleUnited States
| | - Jesse D Bloom
- Basic Sciences Division and Computational Biology ProgramFred Hutchinson Cancer Research CenterSeattleUnited States
- Department of Genome SciencesUniversity of WashingtonSeattleUnited States
| |
Collapse
|
93
|
Wu NC, Xie J, Zheng T, Nycholat CM, Grande G, Paulson JC, Lerner RA, Wilson IA. Diversity of Functionally Permissive Sequences in the Receptor-Binding Site of Influenza Hemagglutinin. Cell Host Microbe 2018; 21:742-753.e8. [PMID: 28618270 DOI: 10.1016/j.chom.2017.05.011] [Citation(s) in RCA: 53] [Impact Index Per Article: 7.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/31/2017] [Revised: 04/24/2017] [Accepted: 05/27/2017] [Indexed: 12/21/2022]
Abstract
Influenza A virus hemagglutinin (HA) initiates viral entry by engaging host receptor sialylated glycans via its receptor-binding site (RBS). The amino acid sequence of the RBS naturally varies across avian and human influenza virus subtypes and is also evolvable. However, functional sequence diversity in the RBS has not been fully explored. Here, we performed a large-scale mutational analysis of the RBS of A/WSN/33 (H1N1) and A/Hong Kong/1/1968 (H3N2) HAs. Many replication-competent mutants not yet observed in nature were identified, including some that could escape from an RBS-targeted broadly neutralizing antibody. This functional sequence diversity is made possible by pervasive epistasis in the RBS 220-loop and can be buffered by avidity in viral receptor binding. Overall, our study reveals that the HA RBS can accommodate a much greater range of sequence diversity than previously thought, which has significant implications for the complex evolutionary interrelationships between receptor specificity and immune escape.
Collapse
Affiliation(s)
- Nicholas C Wu
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Jia Xie
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Tianqing Zheng
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Corwin M Nycholat
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Geramie Grande
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - James C Paulson
- Department of Molecular Medicine, The Scripps Research Institute, La Jolla, CA 92037, USA; Department of Immunology and Microbiology, The Scripps Research Institute, La Jolla, CA 92037, USA
| | - Richard A Lerner
- Department of Chemistry, The Scripps Research Institute, La Jolla, CA 92037, USA.
| | - Ian A Wilson
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037, USA; The Skaggs Institute for Chemical Biology, The Scripps Research Institute, La Jolla, CA 92037, USA.
| |
Collapse
|
94
|
Plesa C, Sidore AM, Lubock NB, Zhang D, Kosuri S. Multiplexed gene synthesis in emulsions for exploring protein functional landscapes. Science 2018; 359:343-347. [PMID: 29301959 PMCID: PMC6261299 DOI: 10.1126/science.aao5167] [Citation(s) in RCA: 80] [Impact Index Per Article: 11.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2017] [Accepted: 12/18/2017] [Indexed: 12/14/2022]
Abstract
Improving our ability to construct and functionally characterize DNA sequences would broadly accelerate progress in biology. Here, we introduce DropSynth, a scalable, low-cost method to build thousands of defined gene-length constructs in a pooled (multiplexed) manner. DropSynth uses a library of barcoded beads that pull down the oligonucleotides necessary for a gene's assembly, which are then processed and assembled in water-in-oil emulsions. We used DropSynth to successfully build more than 7000 synthetic genes that encode phylogenetically diverse homologs of two essential genes in Escherichia coli We tested the ability of phosphopantetheine adenylyltransferase homologs to complement a knockout E. coli strain in multiplex, revealing core functional motifs and reasons underlying homolog incompatibility. DropSynth coupled with multiplexed functional assays allows us to rationally explore sequence-function relationships at an unprecedented scale.
Collapse
Affiliation(s)
- Calin Plesa
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
| | - Angus M. Sidore
- Department of Chemical and Biomolecular Engineering, University of California, Los Angeles, Los Angeles, California, USA
| | - Nathan B. Lubock
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
| | - Di Zhang
- Genomics and Computational Biology Graduate Group, Perelman School of Medicine, University of Pennsylvania, Philadelphia, Pennsylvania, USA
| | - Sriram Kosuri
- Department of Chemistry and Biochemistry, University of California, Los Angeles, Los Angeles, California, USA
- UCLA-DOE Institute for Genomics and Proteomics, Molecular Biology Institute, Quantitative and Computational Biology Institute, Eli and Edythe Broad Center of Regenerative Medicine and Stem Cell Research, Jonsson Comprehensive Cancer Center, University of California, Los Angeles, Los Angeles, California, USA
| |
Collapse
|
95
|
Kang Z, Ding W, Jin P, Du G, Chen J. Combinatorial Evolution of DNA with RECODE. Methods Mol Biol 2018; 1772:205-212. [PMID: 29754230 DOI: 10.1007/978-1-4939-7795-6_11] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 06/08/2023]
Abstract
In past decades, DNA engineering protocols have led to the rapid development of synthetic biology. To engineer the natural proteins, many directed evolution methods based on molecular biology have been presented for generating genetic diversity or obtaining specific properties. Here, we provide a simple (PCR operation), efficient (larger amount of products), and powerful (multiple point mutations, deletions, insertions, and combinatorial multipoint mutagenesis) RECODE method, which is capable of reediting the target DNA flexibly to restructure regulatory regions and remodel enzymes by using the combined function of the thermostable DNA polymerase and DNA ligase in one pot. RECODE is expected to be an applicable choice to create diverse mutant libraries for rapid evolution and optimization of enzymes and synthetic pathways.
Collapse
Affiliation(s)
- Zhen Kang
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China.
- Synergetic Innovation Center of Food Safety and Nutrition, Jiangnan University, Wuxi, Jiangsu, China.
- The Key Laboratory of Carbohydrate Chemistry and Biotechnology, Ministry of Education, Jiangnan University, Wuxi, China.
| | - Wenwen Ding
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
| | - Peng Jin
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
| | - Guocheng Du
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
- Synergetic Innovation Center of Food Safety and Nutrition, Jiangnan University, Wuxi, Jiangsu, China
| | - Jian Chen
- The Key Laboratory of Industrial Biotechnology, Ministry of Education, School of Biotechnology, Jiangnan University, Wuxi, China
- Synergetic Innovation Center of Food Safety and Nutrition, Jiangnan University, Wuxi, Jiangsu, China
| |
Collapse
|
96
|
Weile J, Sun S, Cote AG, Knapp J, Verby M, Mellor JC, Wu Y, Pons C, Wong C, van Lieshout N, Yang F, Tasan M, Tan G, Yang S, Fowler DM, Nussbaum R, Bloom JD, Vidal M, Hill DE, Aloy P, Roth FP. A framework for exhaustively mapping functional missense variants. Mol Syst Biol 2017; 13:957. [PMID: 29269382 PMCID: PMC5740498 DOI: 10.15252/msb.20177908] [Citation(s) in RCA: 118] [Impact Index Per Article: 14.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
Although we now routinely sequence human genomes, we can confidently identify only a fraction of the sequence variants that have a functional impact. Here, we developed a deep mutational scanning framework that produces exhaustive maps for human missense variants by combining random codon mutagenesis and multiplexed functional variation assays with computational imputation and refinement. We applied this framework to four proteins corresponding to six human genes: UBE2I (encoding SUMO E2 conjugase), SUMO1 (small ubiquitin‐like modifier), TPK1 (thiamin pyrophosphokinase), and CALM1/2/3 (three genes encoding the protein calmodulin). The resulting maps recapitulate known protein features and confidently identify pathogenic variation. Assays potentially amenable to deep mutational scanning are already available for 57% of human disease genes, suggesting that DMS could ultimately map functional variation for all human disease genes.
Collapse
Affiliation(s)
- Jochen Weile
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Song Sun
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada.,Department of Medical Biochemistry and Microbiology, Uppsala University, Uppsala, Sweden
| | - Atina G Cote
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Jennifer Knapp
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Marta Verby
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Joseph C Mellor
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,SeqWell Inc, Boston, MA, USA
| | - Yingzhou Wu
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Carles Pons
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute for Science and Technology, Barcelona, Catalonia, Spain
| | - Cassandra Wong
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada
| | | | - Fan Yang
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Murat Tasan
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada.,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada
| | - Guihong Tan
- The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada
| | - Shan Yang
- Invitae Corp., San Francisco, CA, USA
| | - Douglas M Fowler
- Department of Genome Sciences, University of Washington, Seattle, WA, USA
| | | | | | - Marc Vidal
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA.,Department of Genetics, Harvard Medical School, Boston, MA, USA
| | - David E Hill
- Center for Cancer Systems Biology (CCSB), Dana-Farber Cancer Institute, Boston, MA, USA
| | - Patrick Aloy
- Institute for Research in Biomedicine (IRB Barcelona), The Barcelona Institute for Science and Technology, Barcelona, Catalonia, Spain.,Institució Catalana de Recerca I Estudis Avançats (ICREA), Barcelona, Catalonia, Spain
| | - Frederick P Roth
- Lunenfeld-Tanenbaum Research Institute, Mount Sinai Hospital, Toronto, ON, Canada .,The Donnelly Centre, University of Toronto, Toronto, ON, Canada.,Department of Molecular Genetics, University of Toronto, Toronto, ON, Canada.,Department of Computer Science, University of Toronto, Toronto, ON, Canada.,Canadian Institute for Advanced Research, Toronto, ON, Canada
| |
Collapse
|
97
|
|
98
|
Abstract
Influenza A virus (IAV) continues to pose an enormous and unpredictable global public health threat, largely due to the continual evolution of escape from preexisting immunity and the potential for zoonotic emergence. Understanding how the unique genetic makeup and structure of IAV populations influences their transmission and evolution is essential for developing more-effective vaccines, therapeutics, and surveillance capabilities. Owing to their mutation-prone replicase and unique genome organization, IAV populations exhibit enormous amounts of diversity both in terms of sequence and functional gene content. Here, I review what is currently known about the genetic and genomic diversity present within IAV populations and how this diversity may shape the replicative and evolutionary dynamics of these viruses.
Collapse
|
99
|
Phillips AM, Gonzalez LO, Nekongo EE, Ponomarenko AI, McHugh SM, Butty VL, Levine SS, Lin YS, Mirny LA, Shoulders MD. Host proteostasis modulates influenza evolution. eLife 2017; 6. [PMID: 28949290 PMCID: PMC5614556 DOI: 10.7554/elife.28652] [Citation(s) in RCA: 33] [Impact Index Per Article: 4.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/17/2017] [Accepted: 08/18/2017] [Indexed: 01/02/2023] Open
Abstract
Predicting and constraining RNA virus evolution require understanding the molecular factors that define the mutational landscape accessible to these pathogens. RNA viruses typically have high mutation rates, resulting in frequent production of protein variants with compromised biophysical properties. Their evolution is necessarily constrained by the consequent challenge to protein folding and function. We hypothesized that host proteostasis mechanisms may be significant determinants of the fitness of viral protein variants, serving as a critical force shaping viral evolution. Here, we test that hypothesis by propagating influenza in host cells displaying chemically-controlled, divergent proteostasis environments. We find that both the nature of selection on the influenza genome and the accessibility of specific mutational trajectories are significantly impacted by host proteostasis. These findings provide new insights into features of host–pathogen interactions that shape viral evolution, and into the potential design of host proteostasis-targeted antiviral therapeutics that are refractory to resistance. Influenza viruses, commonly called flu, can evade our immune system and develop resistance to treatments by changing frequently. Specifically, mutations in their genome cause influenza proteins to change in ways that can help the virus evade our defences. However, these mutations come at a cost and can prevent the viral proteins from forming functional and stable three-dimensional shapes – a process known as protein folding – thereby hampering the virus’ ability to replicate. In human cells, proteins called chaperones can help our other proteins fold properly. Influenza viruses do not have their own chaperones and, instead, hijack those of their host. Host chaperones are therefore crucial to the virus’ ability to replicate. However, until now, it was not known if host chaperones can influence how these viruses evolve. Here, Phillips et al. used mammalian cells to study how host chaperones affect an evolving influenza population. First, cells were engineered to either have normal chaperone levels, elevated chaperone levels, or inactive chaperones. Next, the H3N2 influenza strain was grown in these different conditions for nearly 200 generations and sequenced to determine how the virus evolved in each distinctive host chaperone environment. Phillips et al. discovered that host chaperones affect the rate at which mutations accumulate in the influenza population, and also the types of mutations in the influenza genome. For instance, when a chaperone called Hsp90 was inactivated, mutations became prevalent in the viral population more slowly than in cells with normal or elevated chaperone levels. Moreover, some specific mutations fared better in cells with high chaperone levels, whilst others worked better in cells with inactivated chaperones. These results suggest that influenza evolution is affected by host chaperone levels in complex and important ways. Moreover, whether chaperones will promote or hinder the effects of any single mutation is difficult to predict ahead of time. This discovery is significant, as the chaperones available to influenza can vary in different tissues, organisms and infectious conditions, and may therefore influence the virus' ability to change and evolve in a context-specific manner. The findings are likely to extend to other viruses such as HIV and Ebola, which also hijack host chaperones for the same purpose. More work is now needed to systematically quantify these effects so that we can better predict how specific chaperones will affect the ability of viruses to adapt, especially in pathologically relevant conditions like fever or viral host-switching. In the future, such insights could help shape the design of treatments to which viruses do not evolve resistance.
Collapse
Affiliation(s)
- Angela M Phillips
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| | - Luna O Gonzalez
- Department of Mathematics, Massachusetts Institute of Technology, Cambridge, United States
| | - Emmanuel E Nekongo
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| | - Anna I Ponomarenko
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| | - Sean M McHugh
- Department of Chemistry, Tufts University, Medford, United States
| | - Vincent L Butty
- BioMicro Center, Massachusetts Institute of Technology, Cambridge, United States
| | - Stuart S Levine
- BioMicro Center, Massachusetts Institute of Technology, Cambridge, United States
| | - Yu-Shan Lin
- Department of Chemistry, Tufts University, Medford, United States
| | - Leonid A Mirny
- Department of Physics, Massachusetts Institute of Technology, Cambridge, United States.,Institute for Medical Engineering and Science, Massachusetts Institute of Technology, Cambridge, United States
| | - Matthew D Shoulders
- Department of Chemistry, Massachusetts Institute of Technology, Cambridge, United States
| |
Collapse
|
100
|
Epistatic Interactions within the Influenza A Virus Polymerase Complex Mediate Mutagen Resistance and Replication Fidelity. mSphere 2017; 2:mSphere00323-17. [PMID: 28815216 PMCID: PMC5557677 DOI: 10.1128/msphere.00323-17] [Citation(s) in RCA: 9] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/19/2017] [Accepted: 07/25/2017] [Indexed: 01/07/2023] Open
Abstract
RNA viruses exist as genetically diverse populations. This standing genetic diversity gives them the potential to adapt rapidly, evolve resistance to antiviral therapeutics, and evade immune responses. Viral mutants with altered mutation rates or mutational tolerance have provided insights into how genetic diversity arises and how it affects the behavior of RNA viruses. To this end, we identified variants within the polymerase complex of influenza virus that are able tolerate drug-mediated increases in viral mutation rates. We find that drug resistance is highly dependent on interactions among mutations in the polymerase complex. In contrast to other viruses, influenza virus counters the effect of higher mutation rates primarily by maintaining high levels of genome replication. These findings suggest the importance of maintaining large population sizes for viruses with high mutation rates and show that multiple proteins can affect both mutation rate and genome synthesis. Lethal mutagenesis is a broad-spectrum antiviral strategy that employs mutagenic nucleoside analogs to exploit the high mutation rate and low mutational tolerance of many RNA viruses. Studies of mutagen-resistant viruses have identified determinants of replicative fidelity and the importance of mutation rate to viral population dynamics. We have previously demonstrated the effective lethal mutagenesis of influenza A virus using three nucleoside analogs as well as the virus’s high genetic barrier to mutagen resistance. Here, we investigate the mutagen-resistant phenotypes of mutations that were enriched in drug-treated populations. We find that PB1 T123A has higher replicative fitness than the wild type, PR8, and maintains its level of genome production during 5-fluorouracil (2,4-dihydroxy-5-fluoropyrimidine) treatment. Surprisingly, this mutagen-resistant variant also has an increased baseline rate of C-to-U and G-to-A mutations. A second drug-selected mutation, PA T97I, interacts epistatically with PB1 T123A to mediate high-level mutagen resistance, predominantly by limiting the inhibitory effect of nucleosides on polymerase activity. Consistent with the importance of epistatic interactions in the influenza virus polymerase, our data suggest that nucleoside analog resistance and replication fidelity are strain dependent. Two previously identified ribavirin {1-[(2R,3R,4S,5R)-3,4-dihydroxy-5-(hydroxymethyl)oxolan-2-yl]-1H-1,2,4-triazole-3-carboxamide} resistance mutations, PB1 V43I and PB1 D27N, do not confer drug resistance in the PR8 background, and the PR8-PB1 V43I polymerase exhibits a normal baseline mutation rate. Our results highlight the genetic complexity of the influenza A virus polymerase and demonstrate that increased replicative capacity is a mechanism by which an RNA virus can counter the negative effects of elevated mutation rates. IMPORTANCE RNA viruses exist as genetically diverse populations. This standing genetic diversity gives them the potential to adapt rapidly, evolve resistance to antiviral therapeutics, and evade immune responses. Viral mutants with altered mutation rates or mutational tolerance have provided insights into how genetic diversity arises and how it affects the behavior of RNA viruses. To this end, we identified variants within the polymerase complex of influenza virus that are able to tolerate drug-mediated increases in viral mutation rates. We find that drug resistance is highly dependent on interactions among mutations in the polymerase complex. In contrast to other viruses, influenza virus counters the effect of higher mutation rates primarily by maintaining high levels of genome replication. These findings suggest the importance of maintaining large population sizes for viruses with high mutation rates and show that multiple proteins can affect both mutation rate and genome synthesis.
Collapse
|