1
|
Rorick MM, Baskerville EB, Rask TS, Day KP, Pascual M. Identifying functional groups among the diverse, recombining antigenic var genes of the malaria parasite Plasmodium falciparum from a local community in Ghana. PLoS Comput Biol 2018; 14:e1006174. [PMID: 29897905 PMCID: PMC6016947 DOI: 10.1371/journal.pcbi.1006174] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/28/2017] [Revised: 06/25/2018] [Accepted: 05/03/2018] [Indexed: 11/18/2022] Open
Abstract
A challenge in studying diverse multi-copy gene families is deciphering distinct functional types within immense sequence variation. Functional changes can in some cases be tracked through the evolutionary history of a gene family; however phylogenetic approaches are not possible in cases where gene families diversify primarily by recombination. We take a network theoretical approach to functionally classify the highly recombining var antigenic gene family of the malaria parasite Plasmodium falciparum. We sample var DBLα sequence types from a local population in Ghana, and classify 9,276 of these variants into just 48 functional types. Our approach is to first decompose each sequence type into its constituent, recombining parts; we then use a stochastic block model to identify functional groups among the parts; finally, we classify the sequence types based on which functional groups they contain. This method for functional classification does not rely on an inferred phylogenetic history, nor does it rely on inferring function based on conserved sequence features. Instead, it infers functional similarity among recombining parts based on the sharing of similar co-occurrence interactions with other parts. This method can therefore group sequences that have undetectable sequence homology or even distinct origination. Describing these 48 var functional types allows us to simplify the antigenic diversity within our dataset by over two orders of magnitude. We consider how the var functional types are distributed in isolates, and find a nonrandom pattern reflecting that common var functional types are non-randomly distinct from one another in terms of their functional composition. The coarse-graining of var gene diversity into biologically meaningful functional groups has important implications for understanding the disease ecology and evolution of this system, as well as for designing effective epidemiological monitoring and intervention.
Collapse
Affiliation(s)
- Mary M. Rorick
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, United States of America
- Department of Biology, University of Utah, Salt Lake City, UT, United States of America
| | - Edward B. Baskerville
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, United States of America
| | - Thomas S. Rask
- School of Biosciences, Bio21 Institute, The University of Melbourne, Melbourne, AU
- Department of Microbiology, New York University, New York, NY, United States of America
| | - Karen P. Day
- School of Biosciences, Bio21 Institute, The University of Melbourne, Melbourne, AU
- Department of Microbiology, New York University, New York, NY, United States of America
| | - Mercedes Pascual
- Department of Ecology and Evolution, University of Chicago, Chicago, IL, United States of America
- The Santa Fe Institute, Santa Fe, NM, United States of America
| |
Collapse
|
2
|
Rorick MM, Artzy-Randrup Y, Ruybal-Pesántez S, Tiedje KE, Rask TS, Oduro A, Ghansah A, Koram K, Day KP, Pascual M. Signatures of competition and strain structure within the major blood-stage antigen of Plasmodium falciparum in a local community in Ghana. Ecol Evol 2018; 8:3574-3588. [PMID: 29686839 PMCID: PMC5901166 DOI: 10.1002/ece3.3803] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/13/2017] [Revised: 10/31/2017] [Accepted: 12/06/2017] [Indexed: 11/12/2022] Open
Abstract
The concept of niche partitioning has received considerable theoretical attention at the interface of ecology and evolution of infectious diseases. Strain theory postulates that pathogen populations can be structured into distinct nonoverlapping strains by frequency-dependent selection in response to intraspecific competition for host immune space. The malaria parasite Plasmodium falciparum presents an opportunity to investigate this phenomenon in nature, under conditions of high recombination rate and extensive antigenic diversity. The parasite's major blood-stage antigen, Pf EMP1, is encoded by the hyperdiverse var genes. With a dataset that includes thousands of var DBLα sequence types sampled from asymptomatic cases within an area of high endemicity in Ghana, we address how var diversity is distributed within isolates and compare this to the distribution of microsatellite allelic diversity within isolates to test whether antigenic and neutral regions of the genome are structured differently. With respect to var DBLα sequence types, we find that on average isolates exhibit significantly lower overlap than expected randomly, but that there also exists frequent pairs of isolates that are highly related. Furthermore, the linkage network of var DBLα sequence types reveals a pattern of nonrandom modularity unique to these antigenic genes, and we find that modules of highly linked DBLα types are not explainable by neutral forces related to var recombination constraints, microsatellite diversity, sampling location, host age, or multiplicity of infection. These findings of reduced overlap and modularity among the var antigenic genes are consistent with a role for immune selection as proposed by strain theory. Identifying the evolutionary and ecological dynamics that are responsible for the nonrandom structure in P. falciparum antigenic diversity is important for designing effective intervention in endemic areas.
Collapse
Affiliation(s)
- Mary M Rorick
- Department of Ecology and Evolution University of Chicago Chicago IL USA.,Department of Biology University of Utah Salt Lake City UT USA
| | - Yael Artzy-Randrup
- Theoretical Ecology Group Institute for Biodiversity and Ecosystem Dynamics University of Amsterdam Amsterdam The Netherlands
| | - Shazia Ruybal-Pesántez
- School of Biosciences Bio21 Institute The University of Melbourne Melbourne Vic. Australia.,Department of Microbiology New York University New York NY USA
| | - Kathryn E Tiedje
- School of Biosciences Bio21 Institute The University of Melbourne Melbourne Vic. Australia.,Department of Microbiology New York University New York NY USA
| | - Thomas S Rask
- School of Biosciences Bio21 Institute The University of Melbourne Melbourne Vic. Australia.,Department of Microbiology New York University New York NY USA
| | | | - Anita Ghansah
- Noguchi Memorial Institute for Medical Research University of Ghana Legon Ghana
| | - Kwadwo Koram
- Noguchi Memorial Institute for Medical Research University of Ghana Legon Ghana
| | - Karen P Day
- School of Biosciences Bio21 Institute The University of Melbourne Melbourne Vic. Australia.,Department of Microbiology New York University New York NY USA
| | - Mercedes Pascual
- Department of Ecology and Evolution University of Chicago Chicago IL USA.,The Santa Fe Institute Santa Fe NM USA
| |
Collapse
|
3
|
Ruybal-Pesántez S, Tiedje KE, Rorick MM, Amenga-Etego L, Ghansah A, R. Oduro A, Koram KA, Day KP. Lack of Geospatial Population Structure Yet Significant Linkage Disequilibrium in the Reservoir of Plasmodium falciparum in Bongo District, Ghana. Am J Trop Med Hyg 2017; 97:1180-1189. [PMID: 28722587 PMCID: PMC5637601 DOI: 10.4269/ajtmh.17-0119] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/15/2017] [Accepted: 05/22/2017] [Indexed: 11/07/2022] Open
Abstract
Malaria control in West Africa is impeded by the large reservoir of chronic asymptomatic Plasmodium falciparum infections in the human population. This study aimed to assess the extent of diversity in the P. falciparum reservoir in Bongo District (BD), Ghana, at the end of the dry season, the lowest point in malaria transmission over the course of the year. Analysis of the variation in 12 microsatellite loci was completed for 200 P. falciparum isolates collected from a cross-sectional survey of residents of all ages from two catchment areas in BD. Analysis of the multilocus haplotypes showed high levels of genetic diversity (He = 0.74), no population differentiation yet significant linkage disequilibrium (LD) (ISA = 0.0127, P = 0.006) in BD. Multilocus LD was significant between and within catchment areas even though every haplotype in the population was unique and the majority of individuals (84.0%) harbored multiple-clone infections. The linkage structure among multilocus haplotypes was not associated with sampling location. These data provide the first study with deep sampling of the P. falciparum reservoir in an area of seasonal malaria transmission in West Africa. The co-occurrence of high multiplicity of infection (multiple-clone infections) with significant multilocus LD is surprising given the likelihood of high recombination rates in BD. The results suggest that the linkage structure among multilocus haplotypes has not been shaped by geographic separation of parasite populations. Furthermore, the observed LD levels provide a baseline population genetic metric with putatively neutral markers to evaluate the effects of seasonality and malaria control efforts in BD.
Collapse
Affiliation(s)
- Shazia Ruybal-Pesántez
- School of BioSciences, Bio21 Institute/The University of Melbourne, Melbourne, Australia
- Department of Microbiology, New York University, New York, New York
| | - Kathryn E. Tiedje
- School of BioSciences, Bio21 Institute/The University of Melbourne, Melbourne, Australia
- Department of Microbiology, New York University, New York, New York
| | - Mary M. Rorick
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, Michigan
- Howard Hughes Medical Institute, Ann Arbor, Michigan
- Department of Ecology and Evolution, University of Chicago, Chicago, Illinois
| | | | - Anita Ghansah
- Noguchi Memorial Institute for Medical Research, University of Ghana, Legon, Ghana
| | | | - Kwadwo A. Koram
- Noguchi Memorial Institute for Medical Research, University of Ghana, Legon, Ghana
| | - Karen P. Day
- School of BioSciences, Bio21 Institute/The University of Melbourne, Melbourne, Australia
- Department of Microbiology, New York University, New York, New York
| |
Collapse
|
4
|
Day KP, Artzy-Randrup Y, Tiedje KE, Rougeron V, Chen DS, Rask TS, Rorick MM, Migot-Nabias F, Deloron P, Luty AJF, Pascual M. Evidence of strain structure in Plasmodium falciparum var gene repertoires in children from Gabon, West Africa. Proc Natl Acad Sci U S A 2017; 114:E4103-E4111. [PMID: 28461509 PMCID: PMC5441825 DOI: 10.1073/pnas.1613018114] [Citation(s) in RCA: 30] [Impact Index Per Article: 4.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Existing theory on competition for hosts between pathogen strains has proposed that immune selection can lead to the maintenance of strain structure consisting of discrete, weakly overlapping antigenic repertoires. This prediction of strain theory has conceptual overlap with fundamental ideas in ecology on niche partitioning and limiting similarity between coexisting species in an ecosystem, which oppose the hypothesis of neutral coexistence. For Plasmodium falciparum, strain theory has been specifically proposed in relation to the major surface antigen of the blood stage, known as PfEMP1 and encoded by the multicopy multigene family known as the var genes. Deep sampling of the DBLα domain of var genes in the local population of Bakoumba, West Africa, was completed to define whether patterns of repertoire overlap support a role of immune selection under the opposing force of high outcrossing, a characteristic of areas of intense malaria transmission. Using a 454 high-throughput sequencing protocol, we report extremely high diversity of the DBLα domain and a large parasite population with DBLα repertoires structured into nonrandom patterns of overlap. Such population structure, significant for the high diversity of var genes that compose it at a local level, supports the existence of "strains" characterized by distinct var gene repertoires. Nonneutral, frequency-dependent competition would be at play and could underlie these patterns. With a computational experiment that simulates an intervention similar to mass drug administration, we argue that the observed repertoire structure matters for the antigenic var diversity of the parasite population remaining after intervention.
Collapse
Affiliation(s)
- Karen P Day
- School of Biosciences, The University of Melbourne, Parkville, VIC 3052, Australia;
- Department of Microbiology, New York University, New York, NY 10016
| | - Yael Artzy-Randrup
- Theoretical Ecology Group, Institute for Biodiversity and Ecosystem Dynamics, University of Amsterdam, 1090 GE Amsterdam, The Netherlands
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109
| | - Kathryn E Tiedje
- School of Biosciences, The University of Melbourne, Parkville, VIC 3052, Australia
- Department of Microbiology, New York University, New York, NY 10016
| | - Virginie Rougeron
- Department of Microbiology, New York University, New York, NY 10016
- Laboratoire Maladies Infectieuses et Vecteurs: Ecologie, Génétique, Evolution et Contrôle, UMR 224-5290 CNRS, Institut de Recherche pour le Développement-Université de Montpellier, Centre Institut de Recherche pour le Développement de Montpellier, 34394 Montpellier, France
| | - Donald S Chen
- Department of Microbiology, New York University, New York, NY 10016
- Department of Medicine, New York Medical College, Valhalla, NY 10595
| | - Thomas S Rask
- School of Biosciences, The University of Melbourne, Parkville, VIC 3052, Australia
- Department of Microbiology, New York University, New York, NY 10016
| | - Mary M Rorick
- Department of Ecology and Evolutionary Biology, University of Michigan, Ann Arbor, MI 48109
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637
| | - Florence Migot-Nabias
- Institut de Recherche pour le Développement, UMR 216 Mère et Enfant Face aux Infections Tropicales, 75006 Paris, France
- Communautés d'Universités et Établissements, Sorbonne Paris Cité, Université Paris Descartes, Faculté des Sciences Pharmaceutiques et Biologiques, 75006 Paris, France
| | - Philippe Deloron
- Institut de Recherche pour le Développement, UMR 216 Mère et Enfant Face aux Infections Tropicales, 75006 Paris, France
- Communautés d'Universités et Établissements, Sorbonne Paris Cité, Université Paris Descartes, Faculté des Sciences Pharmaceutiques et Biologiques, 75006 Paris, France
| | - Adrian J F Luty
- Institut de Recherche pour le Développement, UMR 216 Mère et Enfant Face aux Infections Tropicales, 75006 Paris, France
- Communautés d'Universités et Établissements, Sorbonne Paris Cité, Université Paris Descartes, Faculté des Sciences Pharmaceutiques et Biologiques, 75006 Paris, France
| | - Mercedes Pascual
- Department of Ecology and Evolution, University of Chicago, Chicago, IL 60637
- Santa Fe Institute, Santa Fe, NM 87501
| |
Collapse
|
5
|
Rorick MM, Rask TS, Baskerville EB, Day KP, Pascual M. Homology blocks of Plasmodium falciparum var genes and clinically distinct forms of severe malaria in a local population. BMC Microbiol 2013; 13:244. [PMID: 24192078 PMCID: PMC3827005 DOI: 10.1186/1471-2180-13-244] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/18/2013] [Accepted: 10/26/2013] [Indexed: 11/25/2022] Open
Abstract
Background The primary target of the human immune response to the malaria parasite Plasmodium falciparum, P. falciparum erythrocyte membrane protein 1 (PfEMP1), is encoded by the members of the hyper-diverse var gene family. The parasite exhibits antigenic variation via mutually exclusive expression (switching) of the ~60 var genes within its genome. It is thought that different variants exhibit different host endothelial binding preferences that in turn result in different manifestations of disease. Results Var sequences comprise ancient sequence fragments, termed homology blocks (HBs), that recombine at exceedingly high rates. We use HBs to define distinct var types within a local population. We then reanalyze a dataset that contains clinical and var expression data to investigate whether the HBs allow for a description of sequence diversity corresponding to biological function, such that it improves our ability to predict disease phenotype from parasite genetics. We find that even a generic set of HBs, which are defined for a small number of non-local parasites: capture the majority of local sequence diversity; improve our ability to predict disease severity from parasite genetics; and reveal a previously hypothesized yet previously unobserved parasite genetic basis for two forms of severe disease. We find that the expression rates of some HBs correlate more strongly with severe disease phenotypes than the expression rates of classic var DBLα tag types, and principal components of HB expression rate profiles further improve genotype-phenotype models. More specifically, within the large Kenyan dataset that is the focus of this study, we observe that HB expression differs significantly for severe versus mild disease, and for rosetting versus impaired consciousness associated severe disease. The analysis of a second much smaller dataset from Mali suggests that these HB-phenotype associations are consistent across geographically distant populations, since we find evidence suggesting that the same HB-phenotype associations characterize this population as well. Conclusions The distinction between rosetting versus impaired consciousness associated var genes has not been described previously, and it could have important implications for monitoring, intervention and diagnosis. Moreover, our results have the potential to illuminate the molecular mechanisms underlying the complex spectrum of severe disease phenotypes associated with malaria—an important objective given that only about 1% of P. falciparum infections result in severe disease.
Collapse
Affiliation(s)
- Mary M Rorick
- Department of Ecology and Evolutionary Biology, University of Michigan, 2019 Kraus Nat, Sci, Bldg,, 830 North University Ave, Ann Arbor 48109-1048, Michigan, USA.
| | | | | | | | | |
Collapse
|
6
|
Foster DV, Rorick MM, Gesell T, Feeney LM, Foster JG. Dynamic landscapes: a model of context and contingency in evolution. J Theor Biol 2013; 334:162-72. [PMID: 23796530 DOI: 10.1016/j.jtbi.2013.05.030] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2012] [Revised: 05/12/2013] [Accepted: 05/31/2013] [Indexed: 01/09/2023]
Abstract
Although the basic mechanics of evolution have been understood since Darwin, debate continues over whether macroevolutionary phenomena are driven by the fitness structure of genotype space or by ecological interaction. In this paper we propose a simple model capturing key features of fitness-landscape and ecological models of evolution. Our model describes evolutionary dynamics in a high-dimensional, structured genotype space with interspecies interaction. We find promising qualitative similarity with the empirical facts about macroevolution, including broadly distributed extinction sizes and realistic exploration of the genotype space. The abstraction of our model permits numerous applications beyond macroevolution, including protein and RNA evolution.
Collapse
|
7
|
Abstract
By reconstructing how an influenza protein collected in 1968 might have evolved into one collected in 2007, researchers have obtained new insights into the interactions between genetic mutations.
Collapse
Affiliation(s)
- Mary M Rorick
- is at the Department of Ecology and Evolutionary Biology , University of Michigan , Ann Arbor , United States
| | | |
Collapse
|
8
|
Artzy-Randrup Y, Rorick MM, Day K, Chen D, Dobson AP, Pascual M. Population structuring of multi-copy, antigen-encoding genes in Plasmodium falciparum. eLife 2012; 1:e00093. [PMID: 23251784 PMCID: PMC3524794 DOI: 10.7554/elife.00093] [Citation(s) in RCA: 39] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2012] [Accepted: 10/04/2012] [Indexed: 11/21/2022] Open
Abstract
The coexistence of multiple independently circulating strains in pathogen populations that undergo sexual recombination is a central question of epidemiology with profound implications for control. An agent-based model is developed that extends earlier ‘strain theory’ by addressing the var gene family of Plasmodium falciparum. The model explicitly considers the extensive diversity of multi-copy genes that undergo antigenic variation via sequential, mutually exclusive expression. It tracks the dynamics of all unique var repertoires in a population of hosts, and shows that even under high levels of sexual recombination, strain competition mediated through cross-immunity structures the parasite population into a subset of coexisting dominant repertoires of var genes whose degree of antigenic overlap depends on transmission intensity. Empirical comparison of patterns of genetic variation at antigenic and neutral sites supports this role for immune selection in structuring parasite diversity. DOI:http://dx.doi.org/10.7554/eLife.00093.001 Malaria is an infectious disease that is estimated to kill more than half a million people every year, mostly young children in Africa. It is spread by mosquitos that are infected with Plasmodium parasites that attack red blood cells in the human body. Plasmodium falciparum, the species that is responsible for most of these deaths, causes malaria by entering red blood cells and releasing antigens that travel to the surface of the cells, where they change the adhesion properties. This causes the infected red blood cells to accumulate in small blood vessels, surface capillaries or the brain, which can have severe consequences for the person infected. P. falciparum is particularly dangerous because of its ability to vary the antigens displayed on the cell surface: this process, known as antigenic variation, helps to maintain infections for extended periods of time by allowing the antigens to stay one step ahead of the immune system (a process known as immune escape). The origins of antigenic variation lie in the fact that each P. falciparum genome has a repertoire of between 50 and 60 var genes that code for the variability of a major antigen that is responsible for immune escape in malaria. Molecular sequencing has shown that local parasite populations contain thousands of different types of var genes: hence, meiotic recombination in the mosquito can create a vast number of combinations of var repertoires. Artzy-Randrup et al. have developed a computational model of this highly diverse and complex system to address the following question: is a local pathogen population composed of largely random and ephemeral repertoires of these genes, or is it structured into independently circulating strains? Their model goes beyond previous models by including interactions within the local host population that arise as a result of indirect competition between different strains of the pathogen for available hosts: this competition is influenced by the history of infection and, therefore, by the patterns of immunity within the host population. Previous models included within-host processes but not these higher, local population-level interactions. The model simulates the dynamics of all the unique combinations of var genes in a population of hosts, and shows that even with high rates of reproduction, the parasite population self-organizes into a limited number of coexisting strains: the distinct var repertoires of these strains only weakly overlap, suggesting that the immune response of the host population has been partitioned into distinct niches. By investigating genetic variation at both antigenic sites and regions of the genome that do not code for antigens, Artzy-Randrup et al. suggest that immune selection—the selection imposed on var repertoires by the build up of specific immunity at the population level—plays a central role in structuring parasite diversity. The new model should lead to a better understanding of the epidemiology of Plasmodium and other pathogens that work in similar ways, including Trypanosoma brucei (sleeping sickness), Borellia burgdorferi (Lyme disease) and Giardia lamblia (gastroenteritis), and help with global efforts to eliminate malaria and other diseases. DOI:http://dx.doi.org/10.7554/eLife.00093.002
Collapse
Affiliation(s)
- Yael Artzy-Randrup
- Department of Ecology and Evolutionary Biology , Howard Hughes Medical Institute and the University of Michigan , Ann Arbor , United States
| | | | | | | | | | | |
Collapse
|
9
|
Abstract
Theory suggests that biological modularity and robustness allow for maintenance of fitness under mutational change, and when this change is adaptive, for evolvability. Empirical demonstrations that these traits promote evolvability in nature remain scant however. This is in part because modularity, robustness, and evolvability are difficult to define and measure in real biological systems. Here, we address whether structural modularity and/or robustness confer evolvability at the level of proteins by looking for associations between indices of protein structural modularity, structural robustness, and evolvability. We propose a novel index for protein structural modularity: the number of regular secondary structure elements (helices and strands) divided by the number of residues in the structure. We index protein evolvability as the proportion of sites with evidence of being under positive selection multiplied by the average rate of adaptive evolution at these sites, and we measure this as an average over a phylogeny of 25 mammalian species. We use contact density as an index of protein designability, and thus, structural robustness. We find that protein evolvability is positively associated with structural modularity as well as structural robustness and that the effect of structural modularity on evolvability is independent of the structural robustness index. We interpret these associations to be the result of reduced constraints on amino acid substitutions in highly modular and robust protein structures, which results in faster adaptation through natural selection.
Collapse
Affiliation(s)
- Mary M Rorick
- Department of Genetics, Yale University, New Haven, Connecticut, USA.
| | | |
Collapse
|
10
|
Rorick MM, Wagner GP. The origin of conserved protein domains and amino acid repeats via adaptive competition for control over amino acid residues. J Mol Evol 2010; 70:29-43. [PMID: 20024539 PMCID: PMC3368225 DOI: 10.1007/s00239-009-9305-7] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [What about the content of this article? (0)] [Affiliation(s)] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/15/2009] [Accepted: 11/18/2009] [Indexed: 10/20/2022]
Abstract
Some proteins, such as homeodomain transcription factors, contain highly conserved regions of sequence. It has recently been suggested that multiple functional domains overlap in the homeodomain, together explaining this high conservation. However, the question remains why so many functional domains cluster together in one relatively small and constrained region of the protein. Here we have modeled an evolutionary mechanism that can produce this kind of clustering: conserved functional domains are displaced from the parts of the molecule that are undergoing adaptive evolution because novel functions generally out-compete conserved functions for control over the identity of amino acid residues. We call this model COAA, for Competition Over Amino Acids. We also studied the evolution of amino acid repeats (a.k.a. homopeptides), which are especially prevalent in transcription factors. Repeats that are encoded by non-homogenous mixtures of synonymous codons cannot be explained by replication slippage alone. Our model provides two explanations for their origin, maintenance, and over-representation in highly conserved proteins. We demonstrate that either competition between multiple functional domains for space within a sequence, or reuse of a sequence for many functions over time, can cause the evolution of amino acid repeats. Both of these processes are characteristic of multifunctional proteins such as homeodomain transcription factors. We conclude that the COAA model can explain two widely recognized features of transcription factor proteins: conserved domains and a tendency to accumulate homopeptides.
Collapse
Affiliation(s)
- Mary M Rorick
- Department of Ecology and Evolutionary Biology, Yale University, New Haven, CT 06520-8106, USA.
| | | |
Collapse
|