1
|
Gu X, Li L, Li S, Shi W, Zhong X, Su Y, Wang T. Adaptive evolution and co-evolution of chloroplast genomes in Pteridaceae species occupying different habitats: overlapping residues are always highly mutated. BMC PLANT BIOLOGY 2023; 23:511. [PMID: 37880608 PMCID: PMC10598918 DOI: 10.1186/s12870-023-04523-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/11/2023] [Accepted: 10/13/2023] [Indexed: 10/27/2023]
Abstract
BACKGROUND The evolution of protein residues depends on the mutation rates of their encoding nucleotides, but it may also be affected by co-evolution with other residues. Chloroplasts function as environmental sensors, transforming fluctuating environmental signals into different physiological responses. We reasoned that habitat diversity may affect their rate and mode of evolution, which might be evidenced in the chloroplast genome. The Pteridaceae family of ferns occupy an unusually broad range of ecological niches, which provides an ideal system for analysis. RESULTS We conducted adaptive evolution and intra-molecular co-evolution analyses of Pteridaceae chloroplast DNAs (cpDNAs). The results indicate that the residues undergoing adaptive evolution and co-evolution were mostly independent, with only a few residues being simultaneously involved in both processes, and these overlapping residues tend to exhibit high mutations. Additionally, our data showed that Pteridaceae chloroplast genes are under purifying selection. Regardless of whether we grouped species by lineage (which corresponded with ecological niches), we determined that positively selected residues mainly target photosynthetic genes. CONCLUSIONS Our work provides evidence for the adaptive evolution of Pteridaceae cpDNAs, especially photosynthetic genes, to different habitats and sheds light on the adaptive evolution and co-evolution of proteins.
Collapse
Affiliation(s)
- Xiaolin Gu
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Lingling Li
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Sicong Li
- College of Natural Resources and Environment, South China Agricultural University, Guangzhou, 510642, China
| | - Wanxin Shi
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Xiaona Zhong
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China
| | - Yingjuan Su
- School of Life Sciences, Sun Yat-sen University, Guangzhou, 510275, China.
- Research Institute of Sun Yat-sen University in Shenzhen, Shenzhen, 518057, China.
| | - Ting Wang
- College of Life Sciences, South China Agricultural University, Guangzhou, 510642, China.
| |
Collapse
|
2
|
Kozieł E, Otulak-Kozieł K, Bujarski JJ. Ultrastructural Analysis of Prune DwarfVirus Intercellular Transport and Pathogenesis. Int J Mol Sci 2018; 19:E2570. [PMID: 30158483 PMCID: PMC6163902 DOI: 10.3390/ijms19092570] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/05/2018] [Revised: 08/17/2018] [Accepted: 08/28/2018] [Indexed: 12/25/2022] Open
Abstract
Prune dwarf virus (PDV) is an important viral pathogen of plum, sweet cherry, peach, and many herbaceous test plants. Although PDV has been intensively investigated, mainly in the context of phylogenetic relationship of its genes and proteins, many gaps exist in our knowledge about the mechanism of intercellular transport of this virus. The aim of this work was to investigate alterations in cellular organelles and the cell-to-cell transport of PDV in Cucumis sativus cv. Polan at ultrastructural level. To analyze the role of viral proteins in local transport, double-immunogold assays were applied to localize PDV coat protein (CP) and movement protein (MP). We observe structural changes in chloroplasts, mitochondria, and cellular membranes. We prove that PDV is transported as viral particles via MP-generated tubular structures through plasmodesmata. Moreover, the computer-run 3D modeling reveals structural resemblances between MPs of PDV and of Alfalfa mosaic virus (AMV), implying similarities of transport mechanisms for both viruses.
Collapse
Affiliation(s)
- Edmund Kozieł
- Faculty of Agriculture and Biology, Department of Botany, Warsaw University of Life Sciences-SGGW, Nowoursynowska Street 159, 02-776 Warsaw, Poland.
| | - Katarzyna Otulak-Kozieł
- Faculty of Agriculture and Biology, Department of Botany, Warsaw University of Life Sciences-SGGW, Nowoursynowska Street 159, 02-776 Warsaw, Poland.
| | - Józef J Bujarski
- Department of Biological Sciences, Northern Illinois University, DeKalb, IL 60115, USA.
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland.
| |
Collapse
|
3
|
Kozieł E, Bujarski JJ, Otulak K. Molecular Biology of Prune Dwarf Virus-A Lesser Known Member of the Bromoviridae but a Vital Component in the Dynamic Virus-Host Cell Interaction Network. Int J Mol Sci 2017; 18:E2733. [PMID: 29258199 PMCID: PMC5751334 DOI: 10.3390/ijms18122733] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/17/2017] [Revised: 11/21/2017] [Accepted: 12/13/2017] [Indexed: 12/29/2022] Open
Abstract
Prune dwarf virus (PDV) is one of the members of Bromoviridae family, genus Ilarvirus. Host components that participate in the regulation of viral replication or cell-to-cell movement via plasmodesmata are still unknown. In contrast, viral infections caused by some other Bromoviridae members are well characterized. Bromoviridae can be distinguished based on localization of their replication process in infected cells, cell-to-cell movement mechanisms, and plant-specific response reactions. Depending upon the genus, "genome activation" and viral replication are linked to various membranous structures ranging from endoplasmic reticulum, to tonoplast. In the case of PDV, there is still no evidence of natural resistance sources in the host plants susceptible to virus infection. Apparently, PDV has a great ability to overcome the natural defense responses in a wide spectrum of plant hosts. The first manifestations of PDV infection are specific cell membrane alterations, and the formation of replicase complexes that support PDV RNA replication inside the spherules. During each stage of its life cycle, the virus uses cell components to replicate and to spread in whole plants, within the largely suppressed cellular immunity environment. This work presents the above stages of the PDV life cycle in the context of current knowledge about other Bromoviridae members.
Collapse
Affiliation(s)
- Edmund Kozieł
- Department of Botany, Faculty of Agriculture and Biology, Warsaw University of Life Sciences-SGGW, Nowoursynowska Street 159, 02-776 Warsaw, Poland.
| | - Józef J Bujarski
- Department of Biological Sciences, Northern Illinois University, DeKalb, IL 60115, USA.
- Institute of Bioorganic Chemistry, Polish Academy of Sciences, Noskowskiego 12/14, 61-704 Poznań, Poland.
| | - Katarzyna Otulak
- Department of Botany, Faculty of Agriculture and Biology, Warsaw University of Life Sciences-SGGW, Nowoursynowska Street 159, 02-776 Warsaw, Poland.
| |
Collapse
|
4
|
Analysis of human rotaviruses from a single location over an 18-year time span suggests that protein coadaption influences gene constellations. J Virol 2014; 88:9842-63. [PMID: 24942570 DOI: 10.1128/jvi.01562-14] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/26/2023] Open
Abstract
UNLABELLED Rotaviruses (RVs) are 11-segmented, double-stranded RNA viruses that cause severe gastroenteritis in children. In addition to an error-prone genome replication mechanism, RVs can increase their genetic diversity by reassorting genes during host coinfection. Such exchanges allow RVs to acquire advantageous genes and adapt in the face of selective pressures. However, reassortment may also impose fitness costs if it unlinks genes/proteins that have accumulated compensatory, coadaptive mutations and that operate best when kept together. To better understand human RV evolutionary dynamics, we analyzed the genome sequences of 135 strains (genotype G1/G3/G4-P[8]-I1-C1-R1-A1-N1-T1-E1-H1) that were collected at a single location in Washington, DC, during the years 1974 to 1991. Intragenotypic phylogenetic trees were constructed for each viral gene using the nucleotide sequences, thereby defining novel allele level gene constellations (GCs) and illuminating putative reassortment events. The results showed that RVs with distinct GCs cocirculated during the vast majority of the collection years and that some of these GCs persisted in the community unchanged by reassortment. To investigate the influence of protein coadaptation on GC maintenance, we performed a mutual information-based analysis of the concatenated amino acid sequences and identified an extensive covariance network. Unexpectedly, amino acid covariation was highest between VP4 and VP2, which are structural components of the RV virion that are not thought to directly interact. These results suggest that GCs may be influenced by the selective constraints placed on functionally coadapted, albeit noninteracting, viral proteins. This work raises important questions about mutation-reassortment interplay and its impact on human RV evolution. IMPORTANCE Rotaviruses are devastating human pathogens that cause severe diarrhea and kill >450,000 children each year. The virus can evolve by accumulating mutations and by acquiring new genes from other strains via a process called reassortment. However, little is known about the relationship between mutation accumulation and gene reassortment for rotaviruses and how it impacts viral evolution. In this study, we analyzed the genome sequences of human strains found in clinical fecal specimens that were collected at a single hospital over an 18-year time span. We found that many rotaviruses did not reassort their genes but instead maintained them as specific sets (i.e., constellations). By analyzing the encoded proteins, we discovered concurrent amino acid changes among them, which suggests that they are functionally coadapted to operate best when kept together. This study increases our understanding of how rotaviruses evolve over time in the human population.
Collapse
|
5
|
Liu J, Duan X, Sun J, Yin Y, Li G, Wang L, Liu B. Bi-factor analysis based on noise-reduction (BIFANR): a new algorithm for detecting coevolving amino acid sites in proteins. PLoS One 2013; 8:e79764. [PMID: 24278175 PMCID: PMC3835919 DOI: 10.1371/journal.pone.0079764] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2013] [Accepted: 09/29/2013] [Indexed: 11/23/2022] Open
Abstract
Previous statistical analyses have shown that amino acid sites in a protein evolve in a correlated way instead of independently. Even though located distantly in the linear sequence, the coevolved amino acids could be spatially adjacent in the tertiary structure, and constitute specific protein sectors. Moreover, these protein sectors are independent of one another in structure, function, and even evolution. Thus, systematic studies on protein sectors inside a protein will contribute to the clarification of protein function. In this paper, we propose a new algorithm BIFANR (Bi-factor Analysis Based on Noise-reduction) for detecting protein sectors in amino acid sequences. After applying BIFANR on S1A family and PDZ family, we carried out internal correlation test, statistical independence test, evolutionary rate analysis, evolutionary independence analysis, and function analysis to assess the prediction. The results showed that the amino acids in certain predicted protein sector are closely correlated in structure, function, and evolution, while protein sectors are nearly statistically independent. The results also indicated that the protein sectors have distinct evolutionary directions. In addition, compared with other algorithms, BIFANR has higher accuracy and robustness under the influence of noise sites.
Collapse
Affiliation(s)
- Juntao Liu
- School of Mathematics, Shandong University, Jinan, China
| | - Xiaoyun Duan
- School of Life Science, Shandong University, Jinan, China
| | - Jianyang Sun
- School of Mathematics, Shandong University, Jinan, China
| | - Yanbin Yin
- Department of Biological Sciences, Northern Illinois University, DeKalb, Illinois, United States of America
| | - Guojun Li
- School of Mathematics, Shandong University, Jinan, China
| | - Lushan Wang
- School of Life Science, Shandong University, Jinan, China
| | - Bingqiang Liu
- School of Mathematics, Shandong University, Jinan, China
- * E-mail: Bingqiang Liu:
| |
Collapse
|
6
|
Abstract
Ilarviruses were among the first 16 groups of plant viruses approved by ICTV. Like Alfalfa mosaic virus (AMV), bromoviruses, and cucumoviruses they are isometric viruses and possess a single-stranded, tripartite RNA genome. However, unlike these other three groups, ilarviruses were recognized as being recalcitrant subjects for research (their ready lability is reflected in the sigla used to create the group name) and were renowned as unpromising subjects for the production of antisera. However, it was recognized that they shared properties with AMV when the phenomenon of genome activation, in which the coat protein (CP) of the virus is required to be present to initiate infection, was demonstrated to cross group boundaries. The CP of AMV could activate the genome of an ilarvirus and vice versa. Development of the molecular information for ilarviruses lagged behind the knowledge available for the more extensively studied AMV, bromoviruses, and cucumoviruses. In the past 20 years, genomic data for most known ilarviruses have been developed facilitating their detection and allowing the factors involved in the molecular biology of the genus to be investigated. Much information has been obtained using Prunus necrotic ringspot virus and the more extensively studied AMV. A relationship between some ilarviruses and the cucumoviruses has been defined with the recognition that members of both genera encode a 2b protein involved in RNA silencing and long distance viral movement. Here, we present a review of the current knowledge of both the taxonomy and the molecular biology of this genus of agronomically and horticulturally important viruses.
Collapse
|
7
|
Pallas V, Aparicio F, Herranz MC, Amari K, Sanchez-Pina MA, Myrta A, Sanchez-Navarro JA. Ilarviruses of Prunus spp.: a continued concern for fruit trees. PHYTOPATHOLOGY 2012; 102:1108-1120. [PMID: 23148725 DOI: 10.1094/phyto-02-12-0023-rvw] [Citation(s) in RCA: 32] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/01/2023]
Abstract
Prunus spp. are affected by a large number of viruses, causing significant economic losses through either direct or indirect damage, which results in reduced yield and fruit quality. Among these viruses, members of the genus Ilarvirus (isometric labile ringspot viruses) occupy a significant position due to their distribution worldwide. Although symptoms caused by these types of viruses were reported early in the last century, their molecular characterization was not achieved until the 1990s, much later than for other agronomically relevant viruses. This was mainly due to the characteristic liability of virus particles in tissue extracts. In addition, ilarviruses, together with Alfalfa mosaic virus, are unique among plant viruses in that they require a few molecules of the coat protein in the inoculum in order to be infectious, a phenomenon known as genome activation. Another factor that has made the study of this group of viruses difficult is that infectious clones have been obtained only for the type member of the genus, Tobacco streak virus. Four ilarviruses, Prunus necrotic ringspot virus, Prune dwarf virus, Apple mosaic virus, and American plum line pattern virus, are pathogens of the main cultivated fruit trees. As stated in the 9th Report of the International Committee on Taxonomy of Viruses, virions of this genus are "unpromising subjects for the raising of good antisera." With the advent of molecular approaches for their detection and characterization, it has been possible to get a more precise view of their prevalence and genome organization. This review updates our knowledge on the incidence, genome organization and expression, genetic diversity, modes of transmission, and diagnosis, as well as control of this peculiar group of viruses affecting fruit trees.
Collapse
Affiliation(s)
- V Pallas
- Instituto de Biologia Celular y Molecular de Plantas, Universidad Politécnica de Valencia-Consejo, Spain.
| | | | | | | | | | | | | |
Collapse
|
8
|
Sen L, Fares MA, Liang B, Gao L, Wang B, Wang T, Su YJ. Molecular evolution of rbcL in three gymnosperm families: identifying adaptive and coevolutionary patterns. Biol Direct 2011; 6:29. [PMID: 21639885 PMCID: PMC3129321 DOI: 10.1186/1745-6150-6-29] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/13/2011] [Accepted: 06/03/2011] [Indexed: 11/10/2022] Open
Abstract
Background The chloroplast-localized ribulose-1, 5-biphosphate carboxylase/oxygenase (Rubisco), the primary enzyme responsible for autotrophy, is instrumental in the continual adaptation of plants to variations in the concentrations of CO2. The large subunit (LSU) of Rubisco is encoded by the chloroplast rbcL gene. Although adaptive processes have been previously identified at this gene, characterizing the relationships between the mutational dynamics at the protein level may yield clues on the biological meaning of such adaptive processes. The role of such coevolutionary dynamics in the continual fine-tuning of RbcL remains obscure. Results We used the timescale and phylogenetic analyses to investigate and search for processes of adaptive evolution in rbcL gene in three gymnosperm families, namely Podocarpaceae, Taxaceae and Cephalotaxaceae. To understand the relationships between regions identified as having evolved under adaptive evolution, we performed coevolutionary analyses using the software CAPS. Importantly, adaptive processes were identified at amino acid sites located on the contact regions among the Rubisco subunits and on the interface between Rubisco and its activase. Adaptive amino acid replacements at these regions may have optimized the holoenzyme activity. This hypothesis was pinpointed by evidence originated from our analysis of coevolution that supported the correlated evolution between Rubisco and its activase. Interestingly, the correlated adaptive processes between both these proteins have paralleled the geological variation history of the concentration of atmospheric CO2. Conclusions The gene rbcL has experienced bursts of adaptations in response to the changing concentration of CO2 in the atmosphere. These adaptations have emerged as a result of a continuous dynamic of mutations, many of which may have involved innovation of functional Rubisco features. Analysis of the protein structure and the functional implications of such mutations put forward the conclusion that this evolutionary scenario has been possible through a complex interplay between adaptive mutations, often structurally destabilizing, and compensatory mutations. Our results unearth patterns of evolution that have likely optimized the Rubisco activity and uncover mutational dynamics useful in the molecular engineering of enzymatic activities. Reviewers This article was reviewed by Prof. Christian Blouin (nominated by Dr W Ford Doolittle), Dr Endre Barta (nominated by Dr Sandor Pongor), and Dr Nicolas Galtier.
Collapse
Affiliation(s)
- Lin Sen
- CAS Key Laboratory of Plant Germplasm Enhancement and Specialty Agriculture, Wuhan Botanical Garden, Chinese Academy of Sciences, Wuhan, China
| | | | | | | | | | | | | |
Collapse
|
9
|
Castaño A, Ruiz L, Elena SF, Hernández C. Population differentiation and selective constraints in Pelargonium line pattern virus. Virus Res 2011; 155:274-82. [DOI: 10.1016/j.virusres.2010.10.022] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/01/2010] [Revised: 09/23/2010] [Accepted: 10/16/2010] [Indexed: 12/23/2022]
|
10
|
Aparicio F, Pallás V, Sánchez-Navarro J. Implication of the C terminus of the Prunus necrotic ringspot virus movement protein in cell-to-cell transport and in its interaction with the coat protein. J Gen Virol 2010; 91:1865-70. [PMID: 20219894 DOI: 10.1099/vir.0.019950-0] [Citation(s) in RCA: 21] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/01/2023] Open
Abstract
The movement protein (MP) of Prunus necrotic ringspot virus (PNRSV) is required for viral transport. Previous analysis with MPs of other members of the family Bromoviridae has shown that the C-terminal part of these MPs plays a critical role in the interaction with the cognate coat protein (CP) and in cell-to-cell transport. Bimolecular fluorescence complementation and overlay analysis confirm an interaction between the C-terminal 38 aa of PNRSV MP and its cognate CP. Mutational analysis of the C-terminal region of the PNRSV MP revealed that its C-terminal 38 aa are dispensable for virus transport, however, the 4 aa preceding the dispensable C terminus are necessary to target the MP to the plasmodesmata and for the functionality of the protein. The capacity of the PNRSV MP to use either a CP-dependent or a CP-independent cell-to-cell transport is discussed.
Collapse
Affiliation(s)
- Frederic Aparicio
- Instituto de Biología Molecular y Celular de Plantas, IBMCP (UPV-CSIC), Avenida Ingeniero Fausto Elio s/n, 46022 Valencia, Spain
| | | | | |
Collapse
|
11
|
Molecular Coevolution and the Three-Dimensionality of Natural Selection. Evol Biol 2009. [DOI: 10.1007/978-3-642-00952-5_14] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/20/2022]
|
12
|
Codoñer FM, Elena SF. The promiscuous evolutionary history of the family Bromoviridae. J Gen Virol 2008; 89:1739-1747. [DOI: 10.1099/vir.0.2008/000166-0] [Citation(s) in RCA: 43] [Impact Index Per Article: 2.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
Recombination and segment reassortment are important contributors to the standing genetic variation of RNA viruses and are often involved in the genesis of new, emerging viruses. This study explored the role played by these two processes in the evolutionary radiation of the plant virus family Bromoviridae. The evolutionary history of this family has been explored previously using standard molecular phylogenetic methods, but incongruences have been found among the trees inferred from different gene sequences. This would not be surprising if RNA exchange was a common event, as it is well known that recombination and reassortment of genomes are poorly described by standard phylogenetic methods. In an attempt to reconcile these discrepancies, this study first explored the extent of segment reassortment and found that it was common at the origin of the bromoviruses and cucumoviruses and at least at the origin of alfalfa mosaic virus, American plum line pattern virus and citrus leaf rugose virus. Secondly, recombination analyses were performed on each of the three genomic RNAs and it was found that recombination was very common in members of the genera Bromovirus, Cucumovirus and Ilarvirus. Several cases of recombination involving species from different genera were also identified. Finally, a phylogenetic network was constructed reflecting these genetic exchanges. The network confirmed the taxonomic status of the different genera within the family, despite the phylogenetic noise introduced by genetic exchange.
Collapse
Affiliation(s)
| | - Santiago F. Elena
- Instituto de Biología Molecular y Celular de Plantas, CSIC-UPV, Valencia, Spain
| |
Collapse
|
13
|
Codoñer FM, O'Dea S, Fares MA. Reducing the false positive rate in the non-parametric analysis of molecular coevolution. BMC Evol Biol 2008; 8:106. [PMID: 18402697 PMCID: PMC2362121 DOI: 10.1186/1471-2148-8-106] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/05/2007] [Accepted: 04/10/2008] [Indexed: 11/14/2022] Open
Abstract
Background The strength of selective constraints operating on amino acid sites of proteins has a multifactorial nature. In fact, amino acid sites within proteins coevolve due to their functional and/or structural relationships. Different methods have been developed that attempt to account for the evolutionary dependencies between amino acid sites. Researchers have invested a significant effort to increase the sensitivity of such methods. However, the difficulty in disentangling functional co-dependencies from historical covariation has fuelled the scepticism over their power to detect biologically meaningful results. In addition, the biological parameters connecting linear sequence evolution to structure evolution remain elusive. For these reasons, most of the evolutionary studies aimed at identifying functional dependencies among protein domains have focused on the structural properties of proteins rather than on the information extracted from linear multiple sequence alignments (MSA). Non-parametric methods to detect coevolution have been reported to be especially susceptible to produce false positive results based on the properties of MSAs. However, no formal statistical analysis has been performed to definitively test the differential effects of these properties on the sensitivity of such methods. Results Here we test the effect that variations on the MSA properties have over the sensitivity of non-parametric methods to detect coevolution. We test the effect that the size of the MSA (number of sequences), mean pairwise amino acid distance per site and the strength of the coevolution signal have on the ability of non-parametric methods to detect coevolution. Our results indicate that all three factors have significant effects on the accuracy of non-parametric methods. Further, introducing statistical filters improves the sensitivity and increases the statistical power of the methods to detect functional coevolution. Statistical analysis of the physico-chemical properties of amino acid sites in the context of the protein structure reveals striking dependencies among amino acid sites. Results indicate a covariation trend in the hydrophobicities and molecular weight characteristics of amino acid sites when analysing a non-redundant set of 8000 protein structures. Using this biological information as filter in coevolutionary analyses minimises the false positive rate of these methods. Application of these filters to three different proteins with known functional domains supports the importance of using biological filters to detect coevolution. Conclusion Coevolutionary analyses using non-parametric methods have proved difficult and highly prone to provide spurious results depending on the properties of MSAs and on the strength of coevolution between amino acid sites. The application of statistical filters to the number of pairs detected as coevolving reduces significantly the number of artifactual results. Analysis of the physico-chemical properties of amino acid sites in the protein structure context reveals their structure-dependent covariation. The application of this known biological information to the analysis of covariation greatly enhances the functional coevolutionary signal and removes historical covariation. Simultaneous use of statistical and biological data is instrumental in the detection of functional amino acid sites dependencies and compensatory changes at the protein level.
Collapse
Affiliation(s)
- Francisco M Codoñer
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College, Dublin, Ireland.
| | | | | |
Collapse
|
14
|
Genetic diversity of the movement and coat protein genes of South American isolates of Prunus necrotic ringspot virus. Arch Virol 2008; 153:909-19. [PMID: 18365129 DOI: 10.1007/s00705-008-0066-1] [Citation(s) in RCA: 14] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/29/2007] [Accepted: 02/14/2008] [Indexed: 10/22/2022]
Abstract
Prunus necrotic ringspot virus (PNRSV) is distributed worldwide, but no molecular data have been previously reported from South American isolates. The nucleotide sequences corresponding to the movement (MP) and coat (CP) proteins of 23 isolates of PNRSV from Chile, Brazil, and Uruguay, and from different Prunus species, have been obtained. Phylogenetic analysis performed with full-length MP and CP sequences from all the PNRSV isolates confirmed the clustering of the isolates into the previously reported PV32-I, PV96-II and PE5-III phylogroups. No association was found between specific sequences and host, geographic origin or symptomatology. Comparative analysis showed that both MP and CP have phylogroup-specific amino acids and all of the motifs previously characterized for both proteins. The study of the distribution of synonymous and nonsynonymous changes along both open reading frames revealed that most amino acid sites are under the effect of negative purifying selection.
Collapse
|
15
|
Codoñer FM, Fares MA. Why should we care about molecular coevolution? Evol Bioinform Online 2008; 4:29-38. [PMID: 19204805 PMCID: PMC2614197] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/27/2022] Open
Abstract
Non-independent evolution of amino acid sites has become a noticeable limitation of most methods aimed at identifying selective constraints at functionally important amino acid sites or protein regions. The need for a generalised framework to account for non-independence of amino acid sites has fuelled the design and development of new mathematical models and computational tools centred on resolving this problem. Molecular coevolution is one of the most active areas of research, with an increasing rate of new models and methods being developed everyday. Both parametric and non-parametric methods have been developed to account for correlated variability of amino acid sites. These methods have been utilised for detecting phylogenetic, functional and structural coevolution as well as to identify surfaces of amino acid sites involved in protein-protein interactions. Here we discuss and briefly describe these methods, and identify their advantages and limitations.
Collapse
Affiliation(s)
- Francisco M. Codoñer
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College, Institute of Immunology, Biology Department, National University of Ireland Maynooth
| | - Mario A. Fares
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College,Correspondence:
| |
Collapse
|
16
|
Abstract
Non-independent evolution of amino acid sites has become a noticeable limitation of most methods aimed at identifying selective constraints at functionally important amino acid sites or protein regions. The need for a generalised framework to account for non-independence of amino acid sites has fuelled the design and development of new mathematical models and computational tools centred on resolving this problem. Molecular coevolution is one of the most active areas of research, with an increasing rate of new models and methods being developed everyday. Both parametric and non-parametric methods have been developed to account for correlated variability of amino acid sites. These methods have been utilised for detecting phylogenetic, functional and structural coevolution as well as to identify surfaces of amino acid sites involved in protein-protein interactions. Here we discuss and briefly describe these methods, and identify their advantages and limitations.
Collapse
Affiliation(s)
- Francisco M. Codoñer
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College
- Institute of Immunology, Biology Department, National University of Ireland Maynooth
| | - Mario A. Fares
- Evolutionary Genetics and Bioinformatics Laboratory, Department of Genetics, Smurfit Institute of Genetics, University of Dublin, Trinity College
| |
Collapse
|
17
|
Herranz MC, Al Rwahnih M, Sánchez-Navarro JA, Elena SF, Choueiri E, Myrta A, Pallás V. Low genetic variability in the coat and movement proteins of American plum line pattern virus isolates from different geographic origins. Arch Virol 2007; 153:367-73. [DOI: 10.1007/s00705-007-1100-4] [Citation(s) in RCA: 12] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/02/2007] [Accepted: 10/29/2007] [Indexed: 11/30/2022]
|
18
|
In silico identification of functional divergence between the multiple groEL gene paralogs in Chlamydiae. BMC Evol Biol 2007; 7:81. [PMID: 17519003 PMCID: PMC1892554 DOI: 10.1186/1471-2148-7-81] [Citation(s) in RCA: 28] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/25/2007] [Accepted: 05/22/2007] [Indexed: 12/26/2022] Open
Abstract
Background Heat-shock proteins are specialized molecules performing different and essential roles in the cell including protein degradation, folding and trafficking. GroEL is a 60 Kda heat-shock protein ubiquitous in bacteria and has been regarded as an important molecule implicated in chronic inflammatory processes caused by Chlamydiae infections. GroEL in Chlamydiae became duplicated at the origin of the Chlamydiae lineage presenting three distinct molecular chaperones, namely the original protein GroEL1 (Ct110), and its paralogous proteins GroEL2 (Ct604) and GroEL3 (Ct755). These chaperones present differential and independent expressions during the different stages of Chlamydiae infections and have been suggested to present differential physiological and regulatory roles. Results In this comprehensive in silico study we show that GroEL protein paralogs have diverged functionally after the different gene duplication events and that this divergence has occurred mainly between GroEL3 and GroEL1. GroEL2 presents an intermediate functional divergence pattern from GroEL1. Our results point to the different protein-protein interaction patterns between GroEL paralogs and known GroEL protein clients supporting their functional divergence after groEL gene duplication. Analysis of selective constraints identifies periods of adaptive evolution after gene duplication that led to the fixation of amino acid replacements in GroEL protein domains involved in the interaction with GroEL protein clients. Conclusion We demonstrate that GroEL protein copies in Chlamydiae species have diverged functionally after the gene duplication events. We also show that functional divergence has occurred in important functional regions of these GroEL proteins and that very probably have affected the ancestral GroEL regulatory role and protein-protein interaction patterns with GroEL client proteins. Most of the amino acid replacements that have affected interaction with protein clients and that were responsible for the functional divergence between GroEL paralogs were fixed by adaptive evolution after the groEL gene duplication events.
Collapse
|
19
|
Zhou D, Fan X, Tan D, Xu Y, Tavis JE, Di Bisceglie AM. Separation of near full-length hepatitis C virus quasispecies variants from a complex population. J Virol Methods 2007; 141:220-4. [PMID: 17208310 DOI: 10.1016/j.jviromet.2006.12.002] [Citation(s) in RCA: 10] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2006] [Revised: 11/28/2006] [Accepted: 12/06/2006] [Indexed: 11/24/2022]
Abstract
A long RT-PCR (LRP) protocol was developed recently for robust amplification of a near full-length HCV genomic sequence from clinical samples, followed by efficient cloning [Fan, X., Xu, Y., Di Biceglie, A.M., 2006. Efficient amplification and cloning of near full-length hepatitis C virus genome from clinical samples. Biochem. Biophys. Res. Commun. 346, 1163-1172]. In the present study, the LRP protocol has been estimated for its error rate and the validation by sequencing fully the near full-length HCV inserts from six recombinant clones derived from a patient sample with complex viral diversity. These sequences were compared with the near full-length HCV sequence that was generated by direct sequencing of multiple overlapped PCR products from the same sample, referred to as the population sequence. Comparative analysis confirmed the artificial nature of the PCR-assembled population sequence and identified potential domains for linked viral mutations. The data also suggested that the hypervariable region 1 (HVR1) may be a biological marker for the phenotype at the quasispecies level. These observations emphasize the significance of the use of near full-length genomic sequences for HCV genetic studies and for reverse genetic analysis using authentic quasispecies variants.
Collapse
Affiliation(s)
- Donghui Zhou
- Division of Gastroenterology and Hepatology, Department of Internal Medicine, Saint Louis University School of Medicine, St. Louis, MO 63104, USA
| | | | | | | | | | | |
Collapse
|