1
|
Cyclophilin A supports translation of intrinsically disordered proteins and affects haematopoietic stem cell ageing. Nat Cell Biol 2024; 26:593-603. [PMID: 38553595 PMCID: PMC11021199 DOI: 10.1038/s41556-024-01387-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/25/2022] [Accepted: 02/23/2024] [Indexed: 04/11/2024]
Abstract
Loss of protein function is a driving force of ageing. We have identified peptidyl-prolyl isomerase A (PPIA or cyclophilin A) as a dominant chaperone in haematopoietic stem and progenitor cells. Depletion of PPIA accelerates stem cell ageing. We found that proteins with intrinsically disordered regions (IDRs) are frequent PPIA substrates. IDRs facilitate interactions with other proteins or nucleic acids and can trigger liquid-liquid phase separation. Over 20% of PPIA substrates are involved in the formation of supramolecular membrane-less organelles. PPIA affects regulators of stress granules (PABPC1), P-bodies (DDX6) and nucleoli (NPM1) to promote phase separation and increase cellular stress resistance. Haematopoietic stem cell ageing is associated with a post-transcriptional decrease in PPIA expression and reduced translation of IDR-rich proteins. Here we link the chaperone PPIA to the synthesis of intrinsically disordered proteins, which indicates that impaired protein interaction networks and macromolecular condensation may be potential determinants of haematopoietic stem cell ageing.
Collapse
|
2
|
FREEDA: An automated computational pipeline guides experimental testing of protein innovation. J Cell Biol 2023; 222:e202212084. [PMID: 37358475 PMCID: PMC10292211 DOI: 10.1083/jcb.202212084] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2022] [Revised: 04/22/2023] [Accepted: 06/07/2023] [Indexed: 06/27/2023] Open
Abstract
Cell biologists typically focus on conserved regions of a protein, overlooking innovations that can shape its function over evolutionary time. Computational analyses can reveal potential innovations by detecting statistical signatures of positive selection that lead to rapid accumulation of beneficial mutations. However, these approaches are not easily accessible to non-specialists, limiting their use in cell biology. Here, we present an automated computational pipeline FREEDA that provides a simple graphical user interface requiring only a gene name; integrates widely used molecular evolution tools to detect positive selection in rodents, primates, carnivores, birds, and flies; and maps results onto protein structures predicted by AlphaFold. Applying FREEDA to >100 centromere proteins, we find statistical evidence of positive selection within loops and turns of ancient domains, suggesting innovation of essential functions. As a proof-of-principle experiment, we show innovation in centromere binding of mouse CENP-O. Overall, we provide an accessible computational tool to guide cell biology research and apply it to experimentally demonstrate functional innovation.
Collapse
|
3
|
Enrichment of intrinsically disordered residues in ohnologs facilitates abiotic stress resilience in Brassica rapa. JOURNAL OF PLANT RESEARCH 2023; 136:239-251. [PMID: 36607467 DOI: 10.1007/s10265-022-01432-6] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/18/2021] [Accepted: 12/18/2022] [Indexed: 06/17/2023]
Abstract
Arabidopsis thaliana and Brassica rapa are in the same evolutionary lineage, although the latter experienced an additional whole genome triplication event. Therefore, it would be intriguing to investigate the traits that gene duplication imposes to mediate plant stress tolerance. Here, we noticed that B. rapa abiotic stress resistance (ASR) genes which code at least one stress responsive domain have a significantly higher number of paralogs than A. thaliana. Analysing the disordered content of the ASR genes in both species, we found that intrinsically disordered residues (IDR) are specifically enriched in whole genome duplication (WGD) derived paralogs. Subsequently, domain similarity analysis between WGD pairs of both species has revealed that majority of WGD pairs in B. rapa did not share domains with each other. Furthermore, domain enrichment analysis has shown that B. rapa paralogs contain 36 distinct stress responsive enriched domains, significantly higher than A. thaliana paralogs. Next, we performed MSA to investigate the domain conservation between orthologs and ohnologs pairs, we found that 80.13% of B. rapa ohnologs acquire new domains, depicting the fact that ohnologs play a significant role in stress-related behaviours. The average IDR content of the ohnologs enriching new domains after gene duplication in B. rapa (0.19), is also significantly higher than A. thaliana (0.04). Interestingly, we also found that all of these attributes i.e., exhibiting higher number of WGD paralogs and enhancement of IDR in ASR genes of B. rapa compared to A. thaliana is exclusive for ASR genes only. No such significant differences were observed in randomly selected non-ASR genes between the two species. Together these results provide strong support for the hypothesis that augmentation of IDR content followed by a whole genome duplication event imposes the stress resistance potentiality in B. rapa. This research will shed light on the mechanism of how B. rapa is able to successfully adapt to stress over the evolutionary timescale.
Collapse
|
4
|
FREEDA: an automated computational pipeline guides experimental testing of protein innovation by detecting positive selection. BIORXIV : THE PREPRINT SERVER FOR BIOLOGY 2023:2023.02.27.530329. [PMID: 36909479 PMCID: PMC10002610 DOI: 10.1101/2023.02.27.530329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 03/06/2023]
Abstract
Cell biologists typically focus on conserved regions of a protein, overlooking innovations that can shape its function over evolutionary time. Computational analyses can reveal potential innovations by detecting statistical signatures of positive selection that leads to rapid accumulation of beneficial mutations. However, these approaches are not easily accessible to non-specialists, limiting their use in cell biology. Here, we present an automated computational pipeline FREEDA (Finder of Rapidly Evolving Exons in De novo Assemblies) that provides a simple graphical user interface requiring only a gene name, integrates widely used molecular evolution tools to detect positive selection, and maps results onto protein structures predicted by AlphaFold. Applying FREEDA to >100 mouse centromere proteins, we find evidence of positive selection in intrinsically disordered regions of ancient domains, suggesting innovation of essential functions. As a proof-of-principle experiment, we show innovation in centromere binding of CENP-O. Overall, we provide an accessible computational tool to guide cell biology research and apply it to experimentally demonstrate functional innovation.
Collapse
|
5
|
Evolution of the orthopoxvirus core genome. Virus Res 2023; 323:198975. [PMID: 36280003 PMCID: PMC9586335 DOI: 10.1016/j.virusres.2022.198975] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Revised: 10/17/2022] [Accepted: 10/18/2022] [Indexed: 11/07/2022]
Abstract
Orthopoxviruses comprise several relevant pathogens, including the causative agent of smallpox and monkeypox virus. Analysis of orthopoxvirus genome evolution mainly focused on gene gains/losses. We instead analyzed core genes, which are conserved in all orthopoxviruses. We show that, despite their strong constraint, some genes involved in viral morphogenesis and transcription/replication were targets of pervasive positive selection, which was relatively uncommon in immunomodulatory genes. However at least three of the positively selected genes, E3L, A24R, and H3L, might have evolved in response to immune selection. Episodic positive selection was particularly common on the internal branches of the orthopox phylogeny and on the monkeypox virus lineage. The latter showed evidence of episodic positive selection at the D14L gene, which encodes a modulator of complement activation (MOPICE). Notably, two genes (B1R and A33R) targeted by episodic selection on more than one branch are involved in forms of intra-genomic conflict. Finally, we found that, in orthopoxvirus proteomes, intrinsically disordered regions (IDRs) tend to be less constrained and are common targets of positive selection. Extension of our analysis to all poxviruses showed no evidence that the IDR fraction differs with host range. Conversely, we found a strong effect of base composition, which was however not sufficient to explain IDR fraction. We thus suggest that, in poxviruses, the IDR fraction is maintained by modulating GC content to accommodate disorder-promoting codons. Overall, our data provide novel insight in orthopoxvirus evolution and provide a list of genes and sites that are expected to modulate viral phenotypes.
Collapse
|
6
|
Biomolecular Condensation: A New Phase in Cancer Research. Cancer Discov 2022; 12:2031-2043. [PMID: 35852417 PMCID: PMC9437557 DOI: 10.1158/2159-8290.cd-21-1605] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/07/2021] [Revised: 04/06/2022] [Accepted: 06/08/2022] [Indexed: 01/09/2023]
Abstract
Multicellularity was a watershed development in evolution. However, it also meant that individual cells could escape regulatory mechanisms that restrict proliferation at a severe cost to the organism: cancer. From the standpoint of cellular organization, evolutionary complexity scales to organize different molecules within the intracellular milieu. The recent realization that many biomolecules can "phase-separate" into membraneless organelles, reorganizing cellular biochemistry in space and time, has led to an explosion of research activity in this area. In this review, we explore mechanistic connections between phase separation and cancer-associated processes and emerging examples of how these become deranged in malignancy. SIGNIFICANCE One of the fundamental functions of phase separation is to rapidly and dynamically respond to environmental perturbations. Importantly, these changes often lead to alterations in cancer-relevant pathways and processes. This review covers recent advances in the field, including emerging principles and mechanisms of phase separation in cancer.
Collapse
|
7
|
Substrate spectrum of PPM1D in the cellular response to DNA double-strand breaks. iScience 2022; 25:104892. [PMID: 36060052 PMCID: PMC9436757 DOI: 10.1016/j.isci.2022.104892] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/28/2022] [Revised: 07/03/2022] [Accepted: 08/02/2022] [Indexed: 12/03/2022] Open
Abstract
PPM1D is a p53-regulated protein phosphatase that modulates the DNA damage response (DDR) and is frequently altered in cancer. Here, we employed chemical inhibition of PPM1D and quantitative mass spectrometry-based phosphoproteomics to identify the substrates of PPM1D upon induction of DNA double-strand breaks (DSBs) by etoposide. We identified 73 putative PPM1D substrates that are involved in DNA repair, regulation of transcription, and RNA processing. One-third of DSB-induced S/TQ phosphorylation sites are dephosphorylated by PPM1D, demonstrating that PPM1D only partially counteracts ATM/ATR/DNA-PK signaling. PPM1D-targeted phosphorylation sites are found in a specific amino acid sequence motif that is characterized by glutamic acid residues, high intrinsic disorder, and poor evolutionary conservation. We identified a functionally uncharacterized protein Kanadaptin as ATM and PPM1D substrate upon DSB induction. We propose that PPM1D plays a role during the response to DSBs by regulating the phosphorylation of DNA- and RNA-binding proteins in intrinsically disordered regions. MS-based phosphoproteomic profiling of PPM1D substrates in U2OS and HCT116 cells PPM1D counteracts ATM in the cellular response to DNA double-strand breaks PPM1D target sites localize to glutamic acid-rich regions with high intrinsic disorder Kanadaptin is a putative DNA damage response factor regulated by ATM and PPM1D
Collapse
|
8
|
Genome-Wide Characterization of the MLO Gene Family in Cannabis sativa Reveals Two Genes as Strong Candidates for Powdery Mildew Susceptibility. FRONTIERS IN PLANT SCIENCE 2021; 12:729261. [PMID: 34589104 PMCID: PMC8475652 DOI: 10.3389/fpls.2021.729261] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 06/22/2021] [Accepted: 08/19/2021] [Indexed: 06/13/2023]
Abstract
Cannabis sativa is increasingly being grown around the world for medicinal, industrial, and recreational purposes. As in all cultivated plants, cannabis is exposed to a wide range of pathogens, including powdery mildew (PM). This fungal disease stresses cannabis plants and reduces flower bud quality, resulting in significant economic losses for licensed producers. The Mildew Locus O (MLO) gene family encodes plant-specific proteins distributed among conserved clades, of which clades IV and V are known to be involved in susceptibility to PM in monocots and dicots, respectively. In several studies, the inactivation of those genes resulted in durable resistance to the disease. In this study, we identified and characterized the MLO gene family members in five different cannabis genomes. Fifteen Cannabis sativa MLO (CsMLO) genes were manually curated in cannabis, with numbers varying between 14, 17, 19, 18, and 18 for CBDRx, Jamaican Lion female, Jamaican Lion male, Purple Kush, and Finola, respectively (when considering paralogs and incomplete genes). Further analysis of the CsMLO genes and their deduced protein sequences revealed that many characteristics of the gene family, such as the presence of seven transmembrane domains, the MLO functional domain, and particular amino acid positions, were present and well conserved. Phylogenetic analysis of the MLO protein sequences from all five cannabis genomes and other plant species indicated seven distinct clades (I through VII), as reported in other crops. Expression analysis revealed that the CsMLOs from clade V, CsMLO1 and CsMLO4, were significantly upregulated following Golovinomyces ambrosiae infection, providing preliminary evidence that they could be involved in PM susceptibility. Finally, the examination of variation within CsMLO1 and CsMLO4 in 32 cannabis cultivars revealed several amino acid changes, which could affect their function. Altogether, cannabis MLO genes were identified and characterized, among which candidates potentially involved in PM susceptibility were noted. The results of this study will lay the foundation for further investigations, such as the functional characterization of clade V MLOs as well as the potential impact of the amino acid changes reported. Those will be useful for breeding purposes in order to develop resistant cultivars.
Collapse
|
9
|
Evidence for a synergistic effect of post-translational modifications and genomic composition of eEF-1α on the adaptation of Phytophthora infestans. Ecol Evol 2021; 11:5484-5496. [PMID: 34026022 PMCID: PMC8131795 DOI: 10.1002/ece3.7442] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2021] [Revised: 02/19/2021] [Accepted: 02/21/2021] [Indexed: 12/18/2022] Open
Abstract
Genetic variation plays a fundamental role in pathogen's adaptation to environmental stresses. Pathogens with low genetic variation tend to survive and proliferate more poorly due to their lack of genotypic/phenotypic polymorphisms in responding to fluctuating environments. Evolutionary theory hypothesizes that the adaptive disadvantage of genes with low genomic variation can be compensated for structural diversity of proteins through post-translation modification (PTM) but this theory is rarely tested experimentally and its implication to sustainable disease management is hardly discussed. In this study, we analyzed nucleotide characteristics of eukaryotic translation elongation factor-1α (eEF-lα) gene from 165 Phytophthora infestans isolates and the physical and chemical properties of its derived proteins. We found a low sequence variation of eEF-lα protein, possibly attributable to purifying selection and a lack of intra-genic recombination rather than reduced mutation. In the only two isoforms detected by the study, the major one accounted for >95% of the pathogen collection and displayed a significantly higher fitness than the minor one. High lysine representation enhances the opportunity of the eEF-1α protein to be methylated and the absence of disulfide bonds is consistent with the structural prediction showing that many disordered regions are existed in the protein. Methylation, structural disordering, and possibly other PTMs ensure the ability of the protein to modify its functions during biological, cellular and biochemical processes, and compensate for its adaptive disadvantage caused by sequence conservation. Our results indicate that PTMs may function synergistically with nucleotide codes to regulate the adaptive landscape of eEF-1α, possibly as well as other housekeeping genes, in P. infestans. Compensatory evolution between pre- and post-translational phase in eEF-1α could enable pathogens quickly adapting to disease management strategies while efficiently maintaining critical roles of the protein playing in biological, cellular, and biochemical activities. Implications of these results to sustainable plant disease management are discussed.
Collapse
|
10
|
Insights into the evolutionary forces that shape the codon usage in the viral genome segments encoding intrinsically disordered protein regions. Brief Bioinform 2021; 22:6231751. [PMID: 33866372 DOI: 10.1093/bib/bbab145] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/08/2021] [Revised: 03/17/2021] [Accepted: 03/26/2021] [Indexed: 12/22/2022] Open
Abstract
Intrinsically disordered regions/proteins (IDRs) are abundant across all the domains of life, where they perform important regulatory roles and supplement the biological functions of structured proteins/regions (SRs). Despite the multifunctionality features of IDRs, several interrogations on the evolution of viral genomic regions encoding IDRs in diverse viral proteins remain unreciprocated. To fill this gap, we benchmarked the findings of two most widely used and reliable intrinsic disorder prediction algorithms (IUPred2A and ESpritz) to a dataset of 6108 reference viral proteomes to unravel the multifaceted evolutionary forces that shape the codon usage in the viral genomic regions encoding for IDRs and SRs. We found persuasive evidence that the natural selection predominantly governs the evolution of codon usage in regions encoding IDRs by most of the viruses. In addition, we confirm not only that codon usage in regions encoding IDRs is less optimized for the protein synthesis machinery (transfer RNAs pool) of their host than for those encoding SRs, but also that the selective constraints imposed by codon bias sustain this reduced optimization in IDRs. Our analysis also establishes that IDRs in viruses are likely to tolerate more translational errors than SRs. All these findings hold true, irrespective of the disorder prediction algorithms used to classify IDRs. In conclusion, our study offers a novel perspective on the evolution of viral IDRs and the evolutionary adaptability to multiple taxonomically divergent hosts.
Collapse
|
11
|
The Phytophthora infestans AVR2 Effector Escapes R2 Recognition Through Effector Disordering. MOLECULAR PLANT-MICROBE INTERACTIONS : MPMI 2020; 33:921-931. [PMID: 32212906 DOI: 10.1094/mpmi-07-19-0179-r] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Intrinsic disorder is a common structural characteristic of proteins and a central player in the biochemical processes of species. However, the role of intrinsic disorder in the evolution of plant-pathogen interactions is rarely investigated. Here, we explored the role of intrinsic disorder in the development of the pathogenicity in the RXLR AVR2 effector of Phytophthora infestans. We found AVR2 exhibited high nucleotide diversity generated by point mutation, early-termination, altered start codon, deletion/insertion, and intragenic recombination and is predicted to be an intrinsically disordered protein. AVR2 amino acid sequences conferring a virulent phenotype had a higher disorder tendency in both the N- and C-terminal regions compared with sequences conferring an avirulent phenotype. In addition, we also found virulent AVR2 mutants gained one or two short linear interaction motifs, the critical components of disordered proteins required for protein-protein interactions. Furthermore, virulent AVR2 mutants were predicted to be unstable and have a short protein half-life. Taken together, these results support the notion that intrinsic disorder is important for the effector function of pathogens and demonstrate that SLiM-mediated protein-protein interaction in the C-terminal effector domain might contribute greatly to the evasion of resistance-protein detection in P. infestans.
Collapse
|
12
|
Intrinsically disordered regions are abundant in simplexvirus proteomes and display signatures of positive selection. Virus Evol 2020; 6:veaa028. [PMID: 32411391 PMCID: PMC7211401 DOI: 10.1093/ve/veaa028] [Citation(s) in RCA: 7] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/08/2023] Open
Abstract
Whereas the majority of herpesviruses co-speciated with their mammalian hosts, human herpes simplex virus 2 (HSV-2, genus Simplexvirus) most likely originated from the cross-species transmission of chimpanzee herpesvirus 1 to an ancestor of modern humans. We exploited the peculiar evolutionary history of HSV-2 to investigate the selective events that drove herpesvirus adaptation to a new host. We show that HSV-2 intrinsically disordered regions (IDRs)-that is, protein domains that do not adopt compact three-dimensional structures-are strongly enriched in positive selection signals. Analysis of viral proteomes indicated that a significantly higher portion of simplexvirus proteins is disordered compared with the proteins of other human herpesviruses. IDR abundance in simplexvirus proteomes was not a consequence of the base composition of their genomes (high G + C content). Conversely, protein function determines the IDR fraction, which is significantly higher in viral proteins that interact with human factors. We also found that the average extent of disorder in herpesvirus proteins tends to parallel that of their human interactors. These data suggest that viruses that interact with fast-evolving, disordered human proteins, in turn, evolve disordered viral interactors poised for innovation. We propose that the high IDR fraction present in simplexvirus proteomes contributes to their wider host range compared with other herpesviruses.
Collapse
|
13
|
Differential B-Cell Receptor Signaling Requirement for Adhesion of Mantle Cell Lymphoma Cells to Stromal Cells. Cancers (Basel) 2020; 12:cancers12051143. [PMID: 32370190 PMCID: PMC7281289 DOI: 10.3390/cancers12051143] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/27/2020] [Revised: 04/28/2020] [Accepted: 04/29/2020] [Indexed: 01/01/2023] Open
Abstract
Interactions between lymphoma cells and stromal cells play a key role in promoting tumor survival and development of drug resistance. We identified differences in key signaling pathways between the JeKo-1 and REC-1 mantle cell lymphoma (MCL) cell lines, displaying different patterns of stromal cell adhesion and chemotaxis towards stroma-conditioned medium. The identified adhesion-regulated genes reciprocated important aspects of microenvironment-mediated gene modulation in MCL patients. Five-hundred and ninety genes were differently regulated between the cell lines upon adhesion to stromal cells, while 32 genes were similarly regulated in both cell lines. Regulation of B-cell Receptor (BCR) signature genes in adherent cells was specific for JeKo-1. Inhibition of BCR using siRNA or clinically approved inhibitors, Ibrutinib and Acalabrutinib, decreased adhesion of JeKo-1, but not REC-1 cells. Cell surface levels of chemokine receptor CXCR4 were higher in JeKo-1, facilitating migration and adhesion of JeKo-1 but not REC-1 cells. Surface levels of ICAM1 adhesion protein differ for REC-1 and JeKo-1. While ICAM1 played a positive role in adherence of both cell lines to stromal cells, S1PR1 had an inhibitory effect. Our results provide a model framework for further investigation of mechanistic differences in patient-response to new pathway-specific drugs.
Collapse
|
14
|
Patronus is the elusive plant securin, preventing chromosome separation by antagonizing separase. Proc Natl Acad Sci U S A 2019; 116:16018-16027. [PMID: 31324745 PMCID: PMC6690013 DOI: 10.1073/pnas.1906237116] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/12/2022] Open
Abstract
Accurate chromosome segregation at mitosis and meiosis is crucial to prevent genome instability, birth defect, and cancer. Accordingly, separase, the protease that triggers chromosome distribution, is tightly regulated by a direct inhibitor, the securin. However, securin has not been identified, neither functionnally nor by sequence similarity, in other clades that fungi and animals. This raised doubts about the conservation of this mechanism in other branches of eukaryotes. Here, we identify and characterize the securin in plants. Despite extreme sequence divergence, the securin kept the same core function and is likely a universal regulator of cell division in eukaryotes. Chromosome distribution at anaphase of mitosis and meiosis is triggered by separase, an evolutionarily conserved protease. Separase must be tightly regulated to prevent the untimely release of chromatid cohesion and disastrous chromosome distribution defects. Securin is the key inhibitor of separase in animals and fungi, but has not been identified in other eukaryotic lineages. Here, we identified PATRONUS1 and PATRONUS2 (PANS1 and PANS2) as the Arabidopsis homologs of securin. Disruption of PANS1 is known to lead to the premature separation of chromosomes at meiosis, and the simultaneous disruption of PANS1 and PANS2 is lethal. Here, we show that PANS1 targeting by the anaphase-promoting complex is required to trigger chromosome separation, mirroring the regulation of securin. We showed that PANS1 acts independently from Shugosins. In a genetic screen for pans1 suppressors, we identified SEPARASE mutants, showing that PANS1 and SEPARASE have antagonistic functions in vivo. Finally, we showed that the PANS1 and PANS2 proteins interact directly with SEPARASE. Altogether, our results show that PANS1 and PANS2 act as a plant securin. Remote sequence similarity was identified between the plant patronus family and animal securins, suggesting that they indeed derive from a common ancestor. Identification of patronus as the elusive plant securin illustrates the extreme sequence divergence of this central regulator of mitosis and meiosis.
Collapse
|
15
|
New Zealand Tree and Giant Wētā (Orthoptera) Transcriptomics Reveal Divergent Selection Patterns in Metabolic Loci. Genome Biol Evol 2019; 11:1293-1306. [PMID: 30957857 PMCID: PMC6486805 DOI: 10.1093/gbe/evz070] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 04/01/2019] [Indexed: 01/01/2023] Open
Abstract
Exposure to low temperatures requires an organism to overcome physiological challenges. New Zealand wētā belonging to the genera Hemideina and Deinacrida are found across a wide range of thermal environments and therefore subject to varying selective pressures. Here we assess the selection pressures across the wētā phylogeny, with a particular emphasis on identifying genes under positive or diversifying selection. We used RNA-seq to generate transcriptomes for all 18 Deinacrida and Hemideina species. A total of 755 orthologous genes were identified using a bidirectional best-hit approach, with the resulting gene set encompassing a diverse range of functional classes. Analysis of ortholog ratios of synonymous to nonsynonymous amino acid changes found 83 genes that are under positive selection for at least one codon. A wide variety of Gene Ontology terms, enzymes, and KEGG (Kyoto Encyclopedia of Genes and Genomes) pathways are represented among these genes. In particular, enzymes involved in oxidative phosphorylation, melanin synthesis, and free-radical scavenging are represented, consistent with physiological and metabolic changes that are associated with adaptation to alpine environments. Structural alignment of the transcripts with the most codons under positive selection revealed that the majority of sites are surface residues, and therefore have the potential to influence the thermostability of the enzyme, with the exception of prophenoloxidase where two residues near the active site are under selection. These proteins provide interesting candidates for further analysis of protein evolution.
Collapse
|
16
|
First Experimental Assessment of Protein Intrinsic Disorder Involvement in an RNA Virus Natural Adaptive Process. Mol Biol Evol 2019; 35:38-49. [PMID: 29029259 PMCID: PMC5850501 DOI: 10.1093/molbev/msx249] [Citation(s) in RCA: 31] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023] Open
Abstract
Intrinsic disorder (ID) in proteins is defined as a lack of stable structure in physiological conditions. Intrinsically disordered regions (IDRs) are highly abundant in some RNA virus proteomes. Low topological constraints exerted on IDRs are expected to buffer the effect of numerous deleterious mutations and could be related to the remarkable adaptive potential of RNA viruses to overcome resistance of their host. To experimentally test this hypothesis in a natural pathosystem, a set of four variants of Potato virus Y (PVY; Potyvirus genus) containing various ID degrees in the Viral genome-linked (VPg) protein, a key determinant of potyvirus adaptation, was designed. To estimate the ID contribution to the VPg-based PVY adaptation, the adaptive ability of the four PVY variants was monitored in the pepper host (Capsicum annuum) carrying a recessive resistance gene. Intriguingly, the two mutants with the highest ID content displayed a significantly higher ability to restore infection in the resistant host, whereas the less intrinsically disordered mutant was unable to restore infection. The role of ID on virus adaptation may be due either to a larger exploration of evolutionary pathways or the minimization of fitness penalty caused by resistance-breaking mutations. This pioneering study strongly suggests the positive impact of ID in an RNA virus adaptive capacity.
Collapse
|
17
|
Comparative analysis of mutational robustness of the intrinsically disordered viral protein VPg and of its interactor eIF4E. PLoS One 2019; 14:e0211725. [PMID: 30763345 PMCID: PMC6375565 DOI: 10.1371/journal.pone.0211725] [Citation(s) in RCA: 16] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2018] [Accepted: 01/20/2019] [Indexed: 01/02/2023] Open
Abstract
Conformational intrinsic disorder is a feature present in many virus proteins. Intrinsically disordered regions (IDRs) have weaker structural requirement than ordered regions and mutations in IDRs could have a lower impact on the virus fitness. This could favor its exploration of adaptive solutions. The potyviral protein VPg contains IDRs with determinants for adaptation to its host plant. To experimentally assess whether IDRs are more resistant to mutations than ordered regions, the biologically relevant interaction between mutant libraries of both VPg and the eukaryotic translation initiation factor 4E (eIF4E) and their respective wild type partner was examined using yeast two hybrid assay. Our data shows that VPg is significantly more robust to mutations than eIF4E and as such belongs to a particular class of intrinsically disordered proteins. This result is discussed from the standpoint of IDRs involvement in the virus adaptive processes.
Collapse
|
18
|
Human long intrinsically disordered protein regions are frequent targets of positive selection. Genome Res 2018; 28:975-982. [PMID: 29858274 PMCID: PMC6028134 DOI: 10.1101/gr.232645.117] [Citation(s) in RCA: 39] [Impact Index Per Article: 6.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/21/2017] [Accepted: 06/01/2018] [Indexed: 12/20/2022]
Abstract
Intrinsically disordered regions occur frequently in proteins and are characterized by a lack of a well-defined three-dimensional structure. Although these regions do not show a higher order of structural organization, they are known to be functionally important. Disordered regions are rapidly evolving, largely attributed to relaxed purifying selection and an increased role of genetic drift. It has also been suggested that positive selection might contribute to their rapid diversification. However, for our own species, it is currently unknown whether positive selection has played a role during the evolution of these protein regions. Here, we address this question by investigating the evolutionary pattern of more than 6600 human proteins with intrinsically disordered regions and their ordered counterparts. Our comparative approach with data from more than 90 mammalian genomes uses a priori knowledge of disordered protein regions, and we show that this increases the power to detect positive selection by an order of magnitude. We can confirm that human intrinsically disordered regions evolve more rapidly, not only within humans but also across the entire mammalian phylogeny. They have, however, experienced substantial evolutionary constraint, hinting at their fundamental functional importance. We find compelling evidence that disordered protein regions are frequent targets of positive selection and estimate that the relative rate of adaptive substitutions differs fourfold between disordered and ordered protein regions in humans. Our results suggest that disordered protein regions are important targets of genetic innovation and that the contribution of positive selection in these regions is more pronounced than in other protein parts.
Collapse
|
19
|
A subset of functional adaptation mutations alter propensity for α-helical conformation in the intrinsically disordered glucocorticoid receptor tau1core activation domain. Biochim Biophys Acta Gen Subj 2018; 1862:1452-1461. [PMID: 29550429 DOI: 10.1016/j.bbagen.2018.03.015] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2017] [Revised: 03/07/2018] [Accepted: 03/10/2018] [Indexed: 01/22/2023]
Abstract
BACKGROUND Adaptive mutations that alter protein functionality are enriched within intrinsically disordered protein regions (IDRs), thus conformational flexibility correlates with evolvability. Pre-structured motifs (PreSMos) with transient propensity for secondary structure conformation are believed to be important for IDR function. The glucocorticoid receptor tau1core transcriptional activation domain (GR tau1core) domain contains three α-helical PreSMos in physiological buffer conditions. METHODS Sixty change-of-function mutants affecting the intrinsically disordered 58-residue GR tau1core were studied using disorder prediction and molecular dynamics simulations. RESULTS Change-of-function mutations were partitioned into seven clusters based on their effect on IDR predictions and gene activation activity. Some mutations selected from clusters characterized by mutations altering the IDR prediction score, altered the apparent stability of the α-helical form of one of the PreSMos in molecular dynamics simulations, suggesting PreSMo stabilization or destabilization as strategies for functional adaptation. Indeed all tested gain-of-function mutations affecting this PreSMo were associated with increased stability of the α-helical PreSMo conformation, suggesting that PreSMo stabilization may be the main mechanism by which adaptive mutations can increase the activity of this IDR type. Some mutations did not appear to affect PreSMo stability. CONCLUSIONS Changes in PreSMo stability account for the effects of a subset of change-of-function mutants affecting the GR tau1core IDR. GENERAL SIGNIFICANCE Long IDRs occur in about 50% of human proteins. They are poorly characterized despite much recent attention. Our results suggest the importance of a subtle balance between PreSMo stability and IDR activity, which may provide a novel target for future pharmaceutical intervention.
Collapse
|
20
|
Pathogens and Disease Play Havoc on the Host Epiproteome-The "First Line of Response" Role for Proteomic Changes Influenced by Disorder. Int J Mol Sci 2018. [PMID: 29518008 PMCID: PMC5877633 DOI: 10.3390/ijms19030772] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022] Open
Abstract
Organisms face stress from multiple sources simultaneously and require mechanisms to respond to these scenarios if they are to survive in the long term. This overview focuses on a series of key points that illustrate how disorder and post-translational changes can combine to play a critical role in orchestrating the response of organisms to the stress of a changing environment. Increasingly, protein complexes are thought of as dynamic multi-component molecular machines able to adapt through compositional, conformational and/or post-translational modifications to control their largely metabolic outputs. These metabolites then feed into cellular physiological homeostasis or the production of secondary metabolites with novel anti-microbial properties. The control of adaptations to stress operates at multiple levels including the proteome and the dynamic nature of proteomic changes suggests a parallel with the equally dynamic epigenetic changes at the level of nucleic acids. Given their properties, I propose that some disordered protein platforms specifically enable organisms to sense and react rapidly as the first line of response to change. Using examples from the highly dynamic host-pathogen and host-stress response, I illustrate by example how disordered proteins are key to fulfilling the need for multiple levels of integration of response at different time scales to create robust control points.
Collapse
|
21
|
Mechanism of intersubunit ketosynthase-dehydratase interaction in polyketide synthases. Nat Chem Biol 2018; 14:270-275. [PMID: 29309054 PMCID: PMC5846730 DOI: 10.1038/nchembio.2549] [Citation(s) in RCA: 27] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/24/2017] [Accepted: 11/15/2017] [Indexed: 12/16/2022]
Abstract
Modular polyketide synthases (PKSs) produce numerous structurally complex natural products that have diverse applications in medicine and agriculture. PKSs typically consist of several multienzyme subunits that utilize structurally defined docking domains (DDs) at their N and C termini to ensure correct assembly into functional multiprotein complexes. Here we report a fundamentally different mechanism for subunit assembly in trans-acyltransferase (trans-AT) modular PKSs at the junction between ketosynthase (KS) and dehydratase (DH) domains. This mechanism involves direct interaction of a largely unstructured docking domain (DD) at the C terminus of the KS with the surface of the downstream DH. Acyl transfer assays and mechanism-based crosslinking established that the DD is required for the KS to communicate with the acyl carrier protein appended to the DH. Two distinct regions for binding of the DD to the DH were identified using NMR spectroscopy, carbene footprinting, and mutagenesis, providing a foundation for future elucidation of the molecular basis for interaction specificity.
Collapse
|
22
|
Therapeutic Interventions of Cancers Using Intrinsically Disordered Proteins as Drug Targets: c-Myc as Model System. Cancer Inform 2017; 16:1176935117699408. [PMID: 28469390 PMCID: PMC5392011 DOI: 10.1177/1176935117699408] [Citation(s) in RCA: 37] [Impact Index Per Article: 5.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/03/2016] [Accepted: 02/20/2017] [Indexed: 12/29/2022] Open
Abstract
The concept of protein intrinsic disorder has taken the driving seat to understand regulatory proteins in general. Reports suggest that in mammals nearly 75% of signalling proteins contain long disordered regions with greater than 30 amino acid residues. Therefore, intrinsically disordered proteins (IDPs) have been implicated in several human diseases and should be considered as potential novel drug targets. Moreover, intrinsic disorder provides a huge multifunctional capability to hub proteins such as c-Myc and p53. c-Myc is the hot spot for understanding and developing therapeutics against cancers and cancer stem cells. Our past understanding is mainly based on in vitro and in vivo experiments conducted using c-Myc as whole protein. Using the reductionist approach, c-Myc oncoprotein has been divided into structured and disordered domains. A wealth of data is available dealing with the structured perspectives of c-Myc, but understanding c-Myc in terms of disordered domains has just begun. Disorderness provides enormous flexibility to proteins in general for binding to numerous partners. Here, we have reviewed the current progress on understanding c-Myc using the emerging concept of IDPs.
Collapse
|
23
|
The Argonaute-binding platform of NRPE1 evolves through modulation of intrinsically disordered repeats. THE NEW PHYTOLOGIST 2016; 212:1094-1105. [PMID: 27431917 PMCID: PMC5125548 DOI: 10.1111/nph.14089] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/19/2016] [Accepted: 06/04/2016] [Indexed: 05/26/2023]
Abstract
Argonaute (Ago) proteins are important effectors in RNA silencing pathways, but they must interact with other machinery to trigger silencing. Ago hooks have emerged as a conserved motif responsible for interaction with Ago proteins, but little is known about the sequence surrounding Ago hooks that must restrict or enable interaction with specific Argonautes. Here we investigated the evolutionary dynamics of an Ago-binding platform in NRPE1, the largest subunit of RNA polymerase V. We compared NRPE1 sequences from > 50 species, including dense sampling of two plant lineages. This study demonstrates that the Ago-binding platform of NRPE1 retains Ago hooks, intrinsic disorder, and repetitive character while being highly labile at the sequence level. We reveal that loss of sequence conservation is the result of relaxed selection and frequent expansions and contractions of tandem repeat arrays. These factors allow a complete restructuring of the Ago-binding platform over 50-60 million yr. This evolutionary pattern is also detected in a second Ago-binding platform, suggesting it is a general mechanism. The presence of labile repeat arrays in all analyzed NRPE1 Ago-binding platforms indicates that selection maintains repetitive character, potentially to retain the ability to rapidly restructure the Ago-binding platform.
Collapse
|
24
|
Protein intrinsic disorder within the Potyvirus genus: from proteome-wide analysis to functional annotation. MOLECULAR BIOSYSTEMS 2016; 12:634-52. [PMID: 26699268 DOI: 10.1039/c5mb00677e] [Citation(s) in RCA: 32] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Within proteins, intrinsically disordered regions (IDRs) are devoid of stable secondary and tertiary structures under physiological conditions and rather exist as dynamic ensembles of inter-converting conformers. Although ubiquitous in all domains of life, the intrinsic disorder content is highly variable in viral genomes. Over the years, functional annotations of disordered regions at the scale of the whole proteome have been conducted for several animal viruses. But to date, similar studies applied to plant viruses are still missing. Based on disorder prediction tools combined with annotation programs and evolutionary studies, we analyzed the intrinsic disorder content in Potyvirus, using a 10-species dataset representative of this genus diversity. In this paper, we revealed that: (i) the Potyvirus proteome displays high disorder content, (ii) disorder is conserved during Potyvirus evolution, suggesting a functional advantage of IDRs, (iii) IDRs evolve faster than ordered regions, and (iv) IDRs may be associated with major biological functions required for the Potyvirus cycle. Notably, the proteins P1, Coat protein (CP) and Viral genome-linked protein (VPg) display a high content of conserved disorder, enriched in specific motifs mimicking eukaryotic functional modules and suggesting strategies of host machinery hijacking. In these three proteins, IDRs are particularly conserved despite their high amino acid polymorphism, indicating a link to adaptive processes. Through this comprehensive study, we further investigate the biological relevance of intrinsic disorder in Potyvirus biology and we propose a functional annotation of potyviral proteome IDRs.
Collapse
|
25
|
Abstract
A diverse group of genes are involved in the tooth development of mammals. Several studies, focused mainly on mice and rats, have provided a detailed depiction of the processes coordinating tooth formation and shape. Here we surveyed 236 tooth-associated genes in 39 mammalian genomes and tested for signatures of selection to assess patterns of molecular adaptation in genes regulating mammalian dentition. Of the 236 genes, 31 (∼13.1%) showed strong signatures of positive selection that may be responsible for the phenotypic diversity observed in mammalian dentition. Mammalian-specific tooth-associated genes had accelerated mutation rates compared with older genes found across all vertebrates. More recently evolved genes had fewer interactions (either genetic or physical), were associated with fewer Gene Ontology terms and had faster evolutionary rates compared with older genes. The introns of these positively selected genes also exhibited accelerated evolutionary rates, which may reflect additional adaptive pressure in the intronic regions that are associated with regulatory processes that influence tooth-gene networks. The positively selected genes were mainly involved in processes like mineralization and structural organization of tooth specific tissues such as enamel and dentin. Of the 236 analyzed genes, 12 mammalian-specific genes (younger genes) provided insights on diversification of mammalian teeth as they have higher evolutionary rates and exhibit different expression profiles compared with older genes. Our results suggest that the evolution and development of mammalian dentition occurred in part through positive selection acting on genes that previously had other functions.
Collapse
|
26
|
Buffering deleterious polymorphisms in highly constrained parts of HIV-1 envelope by flexible regions. Retrovirology 2016; 13:50. [PMID: 27473399 PMCID: PMC4967302 DOI: 10.1186/s12977-016-0285-6] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/24/2016] [Accepted: 07/18/2016] [Indexed: 02/07/2023] Open
Abstract
Background Covariation is an essential process that leads to coevolution of parts of proteins and genomes. In organisms subject to strong selective pressure, coevolution is central to keep the balance between the opposite requirements of antigenic variation and retention of functionality. Being the viral component most exposed to the external environment, the HIV-1 glycoprotein gp120 constitutes the main target of the immune response. Accordingly its more external portions are characterised by extensive sequence heterogeneity fostering constant antigenic variation. Results We report that a single polymorphism, present at the level of the viral population in the conserved internal region C2, was sufficient to totally abolish Env functionality when introduced in an exogenous genetic context. The prominent defect of the non-functional protein is a block occurring after recognition of the co-receptor CCR5, likely due to an interference with the subsequent conformational changes that lead to membrane fusion. We also report that the presence of compensatory polymorphisms at the level of the external and hypervariable region V3 fully restored the functionality of the protein. The functional revertant presents different antigenic profiles and sensitivity to the entry inhibitor TAK 779. Conclusions Our data suggest that variable regions, besides harbouring intrinsic extensive antigenic diversity, can also contribute to sequence diversification in more structurally constrained parts of the gp120 by buffering the deleterious effect of polymorphisms, further increasing the genetic flexibility of the protein and the antigenic repertoire of the viral population.
Collapse
|
27
|
The 3-D Structural Basis for the Pgi Genotypic Differences in the Performance of the Butterfly Melitaea cinxia at Different Temperatures. PLoS One 2016; 11:e0160191. [PMID: 27462709 PMCID: PMC4962976 DOI: 10.1371/journal.pone.0160191] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/15/2016] [Accepted: 07/14/2016] [Indexed: 11/18/2022] Open
Abstract
Although genotype-by-environment interaction has long been used to unveil the genetic variation that affects Darwinian fitness, the mechanisms underlying the interaction usually remain unknown. Genetic variation at the dimeric glycolytic enzyme phosphoglucoisomerase (Pgi) has been observed to interact with temperature to explain the variation in the individual performance of the butterfly Melitaea cinxia. At relatively high temperature, individuals with Pgi-non-f genotypes generally surpass those with Pgi-f genotypes, while the opposite applies at relatively low temperature. In this study, we did protein structure predictions and BlastP homology searches with the aim to understand the structural basis for this temperature-dependent difference in the performance of M. cinxia. Our results show that, at amino acid (AA) site 372, one of the two sites that distinguish Pgi-f (the translated polypeptide of the Pgi-f allele) from Pgi-non-f (the translated polypeptide of the Pgi-non-f allele), the Pgi-non-f-related residue strengthens an electrostatic attraction between a pair of residues (Glu373-Lys472) that are from different monomers, compared to the Pgi-f-related residue. Further, BlastP searches of animal protein sequences reveal a dramatic excess of electrostatically attractive combinations of the residues at the Pgi AA sites equivalent to sites 373 and 472 in M. cinxia. This suggests that factors enhancing the inter-monomer interaction between these two sites, and therefore helping the tight association of two Pgi monomers, are favourable. Our homology-modelling results also show that, at the second AA site that distinguishes Pgi-f from Pgi-non-f in M. cinxia, the Pgi-non-f-related residue is more entropy-favourable (leading to higher structural stability) than the Pgi-f-related residue. To sum up, this study suggests a higher structural stability of the protein products of the Pgi-non-f genotypes than those of the Pgi-f genotypes, which may explain why individuals carrying Pgi-non-f genotypes outperform those carrying Pgi-f genotypes at stressful high temerature.
Collapse
|
28
|
Comprehensive Phylogenetic Analysis Sheds Light on the Diversity and Origin of the MLO Family of Integral Membrane Proteins. Genome Biol Evol 2016; 8:878-95. [PMID: 26893454 PMCID: PMC4824068 DOI: 10.1093/gbe/evw036] [Citation(s) in RCA: 60] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 02/13/2016] [Indexed: 12/11/2022] Open
Abstract
Mildew resistanceLocusO(MLO) proteins are polytopic integral membrane proteins that have long been considered as plant-specific and being primarily involved in plant-powdery mildew interactions. However, research in the past decade has revealed that MLO proteins diverged into a family with several clades whose members are associated with different physiological processes. We provide a largely increased dataset of MLO amino acid sequences, comprising nearly all major land plant lineages. Based on this comprehensive dataset, we defined seven phylogenetic clades and reconstructed the likely evolution of the MLO family in embryophytes. We further identified several MLO peptide motifs that are either conserved in all MLO proteins or confined to one or several clades, supporting the notion that clade-specific diversification of MLO functions is associated with particular sequence motifs. In baker's yeast, some of these motifs are functionally linked to transmembrane (TM) transport of organic molecules and ions. In addition, we attempted to define the evolutionary origin of the MLO family and found that MLO-like proteins with highly diverse membrane topologies are present in green algae, but also in the distinctly related red algae (Rhodophyta), Amoebozoa, and Chromalveolata. Finally, we discovered several instances of putative fusion events between MLO proteins and different kinds of proteins. Such Rosetta stone-type hybrid proteins might be instructive for future analysis of potential MLO functions. Our findings suggest that MLO is an ancient protein that possibly evolved in unicellular photosynthetic eukaryotes, and consolidated in land plants with a conserved topology, comprising seven TM domains and an intrinsically unstructured C-terminus.
Collapse
|
29
|
Insights into the Immunological Properties of Intrinsically Disordered Malaria Proteins Using Proteome Scale Predictions. PLoS One 2015; 10:e0141729. [PMID: 26513658 PMCID: PMC4626106 DOI: 10.1371/journal.pone.0141729] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/27/2015] [Accepted: 10/12/2015] [Indexed: 12/31/2022] Open
Abstract
Malaria remains a significant global health burden. The development of an effective malaria vaccine remains as a major challenge with the potential to significantly reduce morbidity and mortality. While Plasmodium spp. have been shown to contain a large number of intrinsically disordered proteins (IDPs) or disordered protein regions, the relationship of protein structure to subcellular localisation and adaptive immune responses remains unclear. In this study, we employed several computational prediction algorithms to identify IDPs at the proteome level of six Plasmodium spp. and to investigate the potential impact of protein disorder on adaptive immunity against P. falciparum parasites. IDPs were shown to be particularly enriched within nuclear proteins, apical proteins, exported proteins and proteins localised to the parasitophorous vacuole. Furthermore, several leading vaccine candidates, and proteins with known roles in host-cell invasion, have extensive regions of disorder. Presentation of peptides by MHC molecules plays an important role in adaptive immune responses, and we show that IDP regions are predicted to contain relatively few MHC class I and II binding peptides owing to inherent differences in amino acid composition compared to structured domains. In contrast, linear B-cell epitopes were predicted to be enriched in IDPs. Tandem repeat regions and non-synonymous single nucleotide polymorphisms were found to be strongly associated with regions of disorder. In summary, immune responses against IDPs appear to have characteristics distinct from those against structured protein domains, with increased antibody recognition of linear epitopes but some constraints for MHC presentation and issues of polymorphisms. These findings have major implications for vaccine design, and understanding immunity to malaria.
Collapse
|
30
|
How Common Is Disorder? Occurrence of Disordered Residues in Four Domains of Life. Int J Mol Sci 2015; 16:19490-507. [PMID: 26295225 PMCID: PMC4581309 DOI: 10.3390/ijms160819490] [Citation(s) in RCA: 21] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/02/2015] [Revised: 08/02/2015] [Accepted: 08/03/2015] [Indexed: 11/18/2022] Open
Abstract
Disordered regions play important roles in protein adaptation to challenging environmental conditions. Flexible and disordered residues have the highest propensities to alter the protein packing. Therefore, identification of disordered/flexible regions is important for structural and functional analysis of proteins. We used the IsUnstruct program to predict the ordered or disordered status of residues in 122 proteomes, including 97 eukaryotic and 25 large bacterial proteomes larger than 2,500,000 residues. We found that bacterial and eukaryotic proteomes contain comparable fraction of disordered residues, which was 0.31 in the bacterial and 0.38 in the eukaryotic proteomes. Additional analysis of the total of 1540 bacterial proteomes of various sizes yielded a smaller fraction of disordered residues, which was only 0.26. Together, the results showed that the larger is the size of the proteome, the larger is the fraction of the disordered residues. A continuous dependence of the fraction of disordered residues on the size of the proteome is observed for four domains of life: Eukaryota, Bacteria, Archaea, and Viruses. Furthermore, our analysis of 122 proteomes showed that the fraction of disordered residues increased with increasing the length of homo-repeats for polar, charged, and small residues, and decreased for hydrophobic residues. The maximal fraction of disordered residues was obtained for proteins containing lysine and arginine homo-repeats. The minimal fraction was found in valine and leucine homo-repeats. For 15-residue long homo-repeats these values were 0.2 (for Val and Leu) and 0.7 (for Lys and Arg).
Collapse
|
31
|
Novel circular single-stranded DNA viruses identified in marine invertebrates reveal high sequence diversity and consistent predicted intrinsic disorder patterns within putative structural proteins. Front Microbiol 2015. [PMID: 26217327 PMCID: PMC4498126 DOI: 10.3389/fmicb.2015.00696] [Citation(s) in RCA: 60] [Impact Index Per Article: 6.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
Viral metagenomics has recently revealed the ubiquitous and diverse nature of single-stranded DNA (ssDNA) viruses that encode a conserved replication initiator protein (Rep) in the marine environment. Although eukaryotic circular Rep-encoding ssDNA (CRESS-DNA) viruses were originally thought to only infect plants and vertebrates, recent studies have identified these viruses in a number of invertebrates. To further explore CRESS-DNA viruses in the marine environment, this study surveyed CRESS-DNA viruses in various marine invertebrate species. A total of 27 novel CRESS-DNA genomes, with Reps that share less than 60.1% identity with previously reported viruses, were recovered from 21 invertebrate species, mainly crustaceans. Phylogenetic analysis based on the Rep revealed a novel clade of CRESS-DNA viruses that included approximately one third of the marine invertebrate associated viruses identified here and whose members may represent a novel family. Investigation of putative capsid proteins (Cap) encoded within the eukaryotic CRESS-DNA viral genomes from this study and those in GenBank demonstrated conserved patterns of predicted intrinsically disordered regions (IDRs), which can be used to complement similarity-based searches to identify divergent structural proteins within novel genomes. Overall, this study expands our knowledge of CRESS-DNA viruses associated with invertebrates and explores a new tool to evaluate divergent structural proteins encoded by these viruses.
Collapse
|
32
|
Abstract
In biology proteins from different structural classes interact across and within classes in ways that are optimized to achieve balanced functional outputs. The interactions between intrinsically disordered proteins (IDPs) and other proteins rely on changes in flexibility and this is seen as a strong determinant for their function. This has fostered the notion that IDP's bind with low affinity but high specificity. Here we have analyzed available detailed thermodynamic data for protein-protein interactions to put to the test if the thermodynamic profiles of IDP interactions differ from those of other protein-protein interactions. We find that ordered proteins and the disordered ones act as non-identical twins operating by similar principles but where the disordered proteins complexes are on average less stable by 2.5 kcal mol(-1).
Collapse
|
33
|
Estimating the Fitness Effects of New Mutations in the Wild Yeast Saccharomyces paradoxus. Genome Biol Evol 2015; 7:1887-95. [PMID: 26085542 PMCID: PMC4524479 DOI: 10.1093/gbe/evv112] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/27/2022] Open
Abstract
The nature of selection acting on a population is in large measure determined by the distribution of fitness effects of new mutations. In this study, we use DNA sequences from four closely related clades of Saccharomyces paradoxus and Saccharomyces cerevisiae to identify and polarize new mutations and estimate their fitness effects. By progressively restricting the analyses to narrower categories of sites, we further seek to characterize sites with predictable mutational effects, that is, unconditionally deleterious, neutral or beneficial. Consistent with previous studies on S. paradoxus, we have failed to find evidence for mutations with beneficial effects, even in regions that were divergent in two outgroup clades, perhaps a consequence of the relatively unchallenged, predominantly asexual and highly inbred lifestyle of this species. On the other hand, there is abundant evidence of deleterious mutations, varying in severity of effect from strongly deleterious to very mild, particularly in regions conserved in the outgroup taxa, indicating a history of persistent purifying selection. Narrowing the analysis down to individual amino acids reduces further the range of effects: for example, mutations changing cysteine are predicted to be nearly always strongly deleterious, whereas those changing arginine, serine, and tyrosine are expected to be nearly neutral. The proportion of mutations with deleterious effects for a particular amino acid is correlated with long-term stasis of that amino acid among highly divergent sequences from a variety of organisms, showing that functionality of sites tends to persist through the diversification of clades and that our findings are also relevant to longer evolutionary times and other taxa.
Collapse
|
34
|
Polymorphism Analysis Reveals Reduced Negative Selection and Elevated Rate of Insertions and Deletions in Intrinsically Disordered Protein Regions. Genome Biol Evol 2015; 7:1815-26. [PMID: 26047845 PMCID: PMC4494057 DOI: 10.1093/gbe/evv105] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Intrinsically disordered protein regions are abundant in eukaryotic proteins and lack stable tertiary structures and enzymatic functions. Previous studies of disordered region evolution based on interspecific alignments have revealed an increased propensity for indels and rapid rates of amino acid substitution. How disordered regions are maintained at high abundance in the proteome and across taxa, despite apparently weak evolutionary constraints, remains unclear. Here, we use single nucleotide and indel polymorphism data in yeast and human populations to survey the population variation within disordered regions. First, we show that single nucleotide polymorphisms in disordered regions are under weaker negative selection compared with more structured protein regions and have a higher proportion of neutral non-synonymous sites. We also confirm previous findings that nonframeshifting indels are much more abundant in disordered regions relative to structured regions. We find that the rate of nonframeshifting indel polymorphism in intrinsically disordered regions resembles that of noncoding DNA and pseudogenes, and that large indels segregate in disordered regions in the human population. Our survey of polymorphism confirms patterns of evolution in disordered regions inferred based on longer evolutionary comparisons.
Collapse
|
35
|
Abstract
The study of molecular evolution at the level of protein-coding genes often entails comparing large datasets of sequences to infer their evolutionary relationships. Despite the importance of a protein's structure and conformational dynamics to its function and thus its fitness, common phylogenetic methods embody minimal biophysical knowledge of proteins. To underscore the biophysical constraints on natural selection, we survey effects of protein mutations, highlighting the physical basis for marginal stability of natural globular proteins and how requirement for kinetic stability and avoidance of misfolding and misinteractions might have affected protein evolution. The biophysical underpinnings of these effects have been addressed by models with an explicit coarse-grained spatial representation of the polypeptide chain. Sequence-structure mappings based on such models are powerful conceptual tools that rationalize mutational robustness, evolvability, epistasis, promiscuous function performed by 'hidden' conformational states, resolution of adaptive conflicts and conformational switches in the evolution from one protein fold to another. Recently, protein biophysics has been applied to derive more accurate evolutionary accounts of sequence data. Methods have also been developed to exploit sequence-based evolutionary information to predict biophysical behaviours of proteins. The success of these approaches demonstrates a deep synergy between the fields of protein biophysics and protein evolution.
Collapse
|
36
|
Comparative laboratory evolution of ordered and disordered enzymes. J Biol Chem 2015; 290:9310-20. [PMID: 25697360 DOI: 10.1074/jbc.m115.638080] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2015] [Indexed: 11/06/2022] Open
Abstract
Intrinsically disordered proteins are ubiquitous in nature. To assess potential evolutionary advantages and disadvantages of structural disorder under controlled laboratory conditions, we directly compared the evolvability of weakly active ordered and disordered variants of dihydrofolate reductase by genetic selection. The circularly permuted Escherichia coli enzyme, which exists as a molten globule in the absence of ligands, and a well folded deletion mutant of the Bacillus stearothermophilus enzyme served as starting points. Both scaffolds evolved at similar rates and to similar extents, reaching near-native activity after three rounds of mutagenesis and selection. Surprisingly, however, the starting structural properties of the two scaffolds changed only marginally during optimization. Although the ordered and disordered proteins accumulated distinct sets of mutations, the changes introduced likely improved catalytic efficiency indirectly in both cases by bolstering the network of dynamic conformational fluctuations that productively couple into the reaction coordinate.
Collapse
|
37
|
Repeat-containing protein effectors of plant-associated organisms. FRONTIERS IN PLANT SCIENCE 2015; 6:872. [PMID: 26557126 PMCID: PMC4617103 DOI: 10.3389/fpls.2015.00872] [Citation(s) in RCA: 18] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/03/2015] [Accepted: 10/01/2015] [Indexed: 05/10/2023]
Abstract
Many plant-associated organisms, including microbes, nematodes, and insects, deliver effector proteins into the apoplast, vascular tissue, or cell cytoplasm of their prospective hosts. These effectors function to promote colonization, typically by altering host physiology or by modulating host immune responses. The same effectors however, can also trigger host immunity in the presence of cognate host immune receptor proteins, and thus prevent colonization. To circumvent effector-triggered immunity, or to further enhance host colonization, plant-associated organisms often rely on adaptive effector evolution. In recent years, it has become increasingly apparent that several effectors of plant-associated organisms are repeat-containing proteins (RCPs) that carry tandem or non-tandem arrays of an amino acid sequence or structural motif. In this review, we highlight the diverse roles that these repeat domains play in RCP effector function. We also draw attention to the potential role of these repeat domains in adaptive evolution with regards to RCP effector function and the evasion of effector-triggered immunity. The aim of this review is to increase the profile of RCP effectors from plant-associated organisms.
Collapse
|
38
|
Rapid evolution of virus sequences in intrinsically disordered protein regions. PLoS Pathog 2014; 10:e1004529. [PMID: 25502394 PMCID: PMC4263755 DOI: 10.1371/journal.ppat.1004529] [Citation(s) in RCA: 45] [Impact Index Per Article: 4.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/16/2014] [Accepted: 10/20/2014] [Indexed: 11/18/2022] Open
Abstract
Nodamura Virus (NoV) is a nodavirus originally isolated from insects that can replicate in a wide variety of hosts, including mammals. Because of their simplicity and ability to replicate in many diverse hosts, NoV, and the Nodaviridae in general, provide a unique window into the evolution of viruses and host-virus interactions. Here we show that the C-terminus of the viral polymerase exhibits extreme structural and evolutionary flexibility. Indeed, fewer than 10 positively charged residues from the 110 amino acid-long C-terminal region of protein A are required to support RNA1 replication. Strikingly, this region can be replaced by completely unrelated protein sequences, yet still produce a functional replicase. Structure predictions, as well as evolutionary and mutational analyses, indicate that the C-terminal region is structurally disordered and evolves faster than the rest of the viral proteome. Thus, the function of an intrinsically unstructured protein region can be independent of most of its primary sequence, conferring both functional robustness and sequence plasticity on the protein. Our results provide an experimental explanation for rapid evolution of unstructured regions, which enables an effective exploration of the sequence space, and likely function space, available to the virus. Proteins often contain regions with defined structures that enable their function. While important for maintaining the overall architecture of the protein, structural conservation adds constraints on the ability of the protein to mutate, and thus evolve. Viruses of eukaryotes, however, often encode for proteins with unstructured regions. As these regions are less constrained, they are more likely to accumulate mutations, which in turn can facilitate the appearance of novel functions during the evolution of the virus. Even though it has been known that such “disordered protein regions” have been particularly malleable in evolution, their functions and their ability to withstand extensive mutations have not been explored in detail. Here, we discovered that a disordered part of the Nodamura Virus polymerase is both required for replication of the viral genome, and extremely variable among different nodaviruses. We examined the tolerance of this protein region to mutations and found an unexpected ability to accommodate very diverse protein sequences. We propose that disordered protein regions can be a reservoir for evolutionary innovation that can play important roles in virus adaptation to new environments.
Collapse
|
39
|
Identification of intrinsically disordered regions in PTEN and delineation of its function via a network approach. Methods 2014; 77-78:69-74. [PMID: 25449897 DOI: 10.1016/j.ymeth.2014.10.005] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2014] [Revised: 10/01/2014] [Accepted: 10/06/2014] [Indexed: 12/14/2022] Open
Abstract
Intrinsically disordered proteins (IDPs) are proteins that lack stable higher order structures for the entire protein molecule or a significant portion of it. The discovery of IDPs evolved as an antithesis to the conventional structure-function paradigm wherein a higher order structure dictates protein function. Over the last decade, a number of proteins with functionally relevant unstructured regions have been discovered, which includes tumor suppressor PTEN. The protein domains that lack structure provide "hot-spots" for post-translational modifications (PTMs) and protein-protein interactions (PPIs), which facilitate their regulation and participation in multiple cellular processes. Consequently, dysregulation in IDPs contribute to aberrant cellular pathophysiology. Herein, we present PTEN and its translational isoform PTEN-L as a hybrid protein possessing ordered domain and intrinsically disordered C-terminal and an N-terminal tails. We review the role of intrinsic disorder in PTEN function and propose a methodology for the use of intrinsic disorder to study PTEN-regulated higher order protein-networks by associating basic principles of network biology to functional pathway analysis at the systems level.
Collapse
|
40
|
|
41
|
Positive selection and intragenic recombination contribute to high allelic diversity in effector genes of Mycosphaerella fijiensis, causal agent of the black leaf streak disease of banana. MOLECULAR PLANT PATHOLOGY 2014; 15:447-60. [PMID: 24245940 PMCID: PMC6638713 DOI: 10.1111/mpp.12104] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/18/2023]
Abstract
Previously, we have determined the nonhost-mediated recognition of the MfAvr4 and MfEcp2 effector proteins from the banana pathogen Mycosphaerella fijiensis in tomato, by the cognate Cf-4 and Cf-Ecp2 resistance proteins, respectively. These two resistance proteins could thus mediate resistance against M. fijiensis if genetically transformed into banana (Musa spp.). However, disease resistance controlled by single dominant genes can be overcome by mutated effector alleles, whose products are not recognized by the cognate resistance proteins. Here, we surveyed the allelic variation within the MfAvr4, MfEcp2, MfEcp2-2 and MfEcp2-3 effector genes of M. fijiensis in a global population of the pathogen, and assayed its impact on recognition by the tomato Cf-4 and Cf-Ecp2 resistance proteins, respectively. We identified a large number of polymorphisms that could reflect a co-evolutionary arms race between host and pathogen. The analysis of nucleotide substitution patterns suggests that both positive selection and intragenic recombination have shaped the evolution of M. fijiensis effectors. Clear differences in allelic diversity were observed between strains originating from South-East Asia relative to strains from other banana-producing continents, consistent with the hypothesis that M. fijiensis originated in the Asian-Pacific region. Furthermore, transient co-expression of the MfAvr4 effector alleles and the tomato Cf-4 resistance gene, as well as of MfEcp2, MfEcp2-2 and MfEcp2-3 and the putative Cf-Ecp2 resistance gene, indicated that effector alleles able to overcome these resistance genes are already present in natural populations of the pathogen, thus questioning the durability of resistance that can be provided by these genes in the field.
Collapse
|
42
|
Synonymous constraint elements show a tendency to encode intrinsically disordered protein segments. PLoS Comput Biol 2014; 10:e1003607. [PMID: 24809503 PMCID: PMC4014394 DOI: 10.1371/journal.pcbi.1003607] [Citation(s) in RCA: 20] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/14/2014] [Accepted: 03/17/2014] [Indexed: 01/22/2023] Open
Abstract
Synonymous constraint elements (SCEs) are protein-coding genomic regions with very low synonymous mutation rates believed to carry additional, overlapping functions. Thousands of such potentially multi-functional elements were recently discovered by analyzing the levels and patterns of evolutionary conservation in human coding exons. These elements provide a good opportunity to improve our understanding of how the redundant nature of the genetic code is exploited in the cell. Our premise is that the protein segments encoded by such elements might better comply with the increased functional demands if they are structurally less constrained (i.e. intrinsically disordered). To test this idea, we investigated the protein segments encoded by SCEs with computational tools to describe the underlying structural properties. In addition to SCEs, we examined the level of disorder, secondary structure, and sequence complexity of protein regions overlapping with experimentally validated splice regulatory sites. We show that multi-functional gene regions translate into protein segments that are significantly enriched in structural disorder and compositional bias, while they are depleted in secondary structure and domain annotations compared to reference segments of similar lengths. This tendency suggests that relaxed protein structural constraints provide an advantage when accommodating multiple overlapping functions in coding regions.
Collapse
|
43
|
Intrinsic disorder in plant proteins and phytopathogenic bacterial effectors. Chem Rev 2014; 114:6912-32. [PMID: 24697726 DOI: 10.1021/cr400488d] [Citation(s) in RCA: 33] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/11/2022]
|
44
|
Dynamic New World: Refining Our View of Protein Structure, Function and Evolution. Proteomes 2014; 2:128-153. [PMID: 28250374 PMCID: PMC5302727 DOI: 10.3390/proteomes2010128] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/17/2013] [Revised: 02/12/2014] [Accepted: 02/20/2014] [Indexed: 01/06/2023] Open
Abstract
Proteins are crucial to the functioning of all lifeforms. Traditional understanding posits that a single protein occupies a single structure ("fold"), which performs a single function. This view is radically challenged with the recognition that high structural dynamism-the capacity to be extra "floppy"-is more prevalent in functional proteins than previously assumed. As reviewed here, this dynamic take on proteins affects our understanding of protein "structure", function, and evolution, and even gives us a glimpse into protein origination. Specifically, this review will discuss historical developments concerning protein structure, and important new relationships between dynamism and aspects of protein sequence, structure, binding modes, binding promiscuity, evolvability, and origination. Along the way, suggestions will be provided for how key parts of textbook definitions-that so far have excluded membership to intrinsically disordered proteins (IDPs)-could be modified to accommodate our more dynamic understanding of proteins.
Collapse
|
45
|
Evolutionary dynamics of Polynucelotide phosphorylases. Mol Phylogenet Evol 2014; 73:77-86. [PMID: 24503483 DOI: 10.1016/j.ympev.2014.01.020] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/24/2013] [Revised: 01/09/2014] [Accepted: 01/21/2014] [Indexed: 11/17/2022]
Abstract
Polynucleotide phosphorylase (PNPase) is an evolutionarily conserved 3'→5' phosphate-dependent exoribonucease belonging to the PDX family of proteins. It consists of two catalytic RNase PH domains (PNP1 and PNP2), an α-helical domain and two RNA-binding domains. The PNP1 and PNP2 domains share substantial sequence and structural homology with RNase PH (RPH), which is another PDX family member found in all the three major kingdoms of life, suggesting that these three domains originated from a common ancestor. Phylogenetic analysis (based on the PNPase/RNase PH sequence information for 43 vertebrate taxa) shows that PNP2 and RPH are sister taxa which arose through duplication of the ancestral PNP1 domain. Also, all three domains (PNP1, PNP2 and RPH), along with the KH and S1 domains have undergone significant and directional sequence change, as determined by branch and site-specific dN/dS analyses. In general, codons that show dN/dS ratios that are significantly greater than 1.0 are outside the ordered regions (α-helices and β-sheets) of these protein domains. In addition, sites that have been selected for mutagenesis in these proteins lie embedded in regions where there is a preponderance of codons with dN/dS values that are not significantly different from 0.0. Overall, this report is an attempt to further our understanding of the evolutionary history of these three protein domains, and define the evolutionary events that led to their refinement in the vertebrate lineage leading to mammals.
Collapse
|
46
|
A comparative study of Whi5 and retinoblastoma proteins: from sequence and structure analysis to intracellular networks. Front Physiol 2014; 4:315. [PMID: 24478706 PMCID: PMC3897220 DOI: 10.3389/fphys.2013.00315] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/31/2013] [Accepted: 10/13/2013] [Indexed: 11/18/2022] Open
Abstract
Cell growth and proliferation require a complex series of tight-regulated and well-orchestrated events. Accordingly, proteins governing such events are evolutionary conserved, even among distant organisms. By contrast, it is more singular the case of “core functions” exerted by functional analogous proteins that are not homologous and do not share any kind of structural similarity. This is the case of proteins regulating the G1/S transition in higher eukaryotes–i.e., the retinoblastoma (Rb) tumor suppressor Rb—and budding yeast, i.e., Whi5. The interaction landscape of Rb and Whi5 is quite large, with more than one hundred proteins interacting either genetically or physically with each protein. The Whi5 interactome has been used to construct a concept map of Whi5 function and regulation. Comparison of physical and genetic interactors of Rb and Whi5 allows highlighting a significant core of conserved, common functionalities associated with the interactors indicating that structure and function of the network—rather than individual proteins—are conserved during evolution. A combined bioinformatics and biochemical approach has shown that the whole Whi5 protein is highly disordered, except for a small region containing the protein family signature. The comparison with Whi5 homologs from Saccharomycetales has prompted the hypothesis of a modular organization of structural disorder, with most evolutionary conserved regions alternating with highly variable ones. The finding of a consensus sequence points to the conservation of a specific phosphorylation rhythm along with two disordered sequence motifs, probably acting as phosphorylation-dependent seeds in Whi5 folding/unfolding. Thus, the widely disordered Whi5 appears to act as a hierarchical, “date hub” that has evolutionary assayed an original way of modular organization before being supplanted by the globular, multi-domain structured Rb, more suitable to cover the role of a “party hub”.
Collapse
|
47
|
Abstract
Protein intrinsic disorder, a widespread phenomenon characterized by a lack of stable three-dimensional structure, is thought to play an important role in protein function. In the last decade, dozens of computational methods for predicting intrinsic disorder from amino acid sequences have been developed. They are widely used by structural biologists not only for analyzing the biological function of intrinsic disorder but also for finding flexible regions that possibly hinder successful crystallization of the full-length protein. In this chapter, I introduce Prediction Of Order and Disorder by machine LEarning (POODLE), which is a series of programs accurately predicting intrinsic disorder. After giving the theoretical background for predicting intrinsic disorder, I give a detailed guide to using POODLE. I then also briefly introduce a case study where using POODLE for functional analyses of protein disorder led to a novel biological findings.
Collapse
|
48
|
Haloarcula marismortui archaellin genes as ecoparalogs. Extremophiles 2013; 18:341-9. [PMID: 24368632 DOI: 10.1007/s00792-013-0619-4] [Citation(s) in RCA: 15] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/06/2013] [Accepted: 12/05/2013] [Indexed: 01/11/2023]
Abstract
The genome of haloarchaeon Haloarcula marismortui contains two archaellin genes-flaA2 and flaB. Earlier we isolated and characterized two H. marismortui strains in that archaella consisting of FlaA2 archaellin (with a minor FlaB fraction) or of FlaB only. Both the FlaA2 and FlaB strains were motile and produced functional helical archaella. Thus, it may seem that the FlaA2 archaellin is redundant. In this study we investigated the biological roles of archaellin redundancy and demonstrated that FlaA2 archaellin is better adapted to more severe conditions of high temperature/low salinity, while FlaB has an advantage with increasing salinity. We used the thermodynamic data and bioinformatics sequence analysis to demonstrate that archaella formed by FlaA2 are more stable than those formed by FlaB. Our combined data indicate that the monomer FlaA2 archaellin is more flexible and leads to more compact and stable formation of filamentous structures. The difference in response to environmental stress indicates that FlaA2 and FlaB replace each other under different environmental conditions and can be considered as ecoparalogs.
Collapse
|
49
|
Abstract
BACKGROUND The glycolytic pathway is central to cellular energy production. Selection on individual enzymes within glycolysis, particularly phosphoglucose isomerase (Pgi), has been associated with metabolic performance in numerous organisms. Nonetheless, how whole energy-producing pathways evolve to allow organisms to thrive in different environments and adopt new lifestyles remains little explored. The Lanceocercata radiation of Australasian stick insects includes transitions from tropical to temperate climates, lowland to alpine habitats, and winged to wingless forms. This permits a broad investigation to determine which steps within glycolysis and what sites within enzymes are the targets of positive selection. To address these questions we obtained transcript sequences from seven core glycolysis enzymes, including two Pgi paralogues, from 29 Lanceocercata species. RESULTS Using maximum likelihood methods a signature of positive selection was inferred in two core glycolysis enzymes. Pgi and Glyceraldehyde 3-phosphate dehydrogenase (Gaphd) genes both encode enzymes linking glycolysis to the pentose phosphate pathway. Positive selection among Pgi paralogues and orthologues predominately targets amino acids with residues exposed to the protein's surface, where changes in physical properties may alter enzyme performance. CONCLUSION Our results suggest that, for Lancerocercata stick insects, adaptation to new stressful lifestyles requires a balance between maintaining cellular energy production, efficiently exploiting different energy storage pools and compensating for stress-induced oxidative damage.
Collapse
|
50
|
Origins of Myc proteins--using intrinsic protein disorder to trace distant relatives. PLoS One 2013; 8:e75057. [PMID: 24086436 PMCID: PMC3782479 DOI: 10.1371/journal.pone.0075057] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2013] [Accepted: 08/09/2013] [Indexed: 01/10/2023] Open
Abstract
Mammalian Myc proteins are important determinants of cell proliferation as well as the undifferentiated state of stem cells and their activity is frequently deregulated in cancer. Based mainly on conservation in the C-terminal DNA-binding and dimerization domain, Myc-like proteins have been reported in many simpler organisms within and outside the Metazoa but they have not been found in fungi or plants. Several important signature motifs defining mammalian Myc proteins are found in the N-terminal domain but the extent to which these are found in the Myc-like proteins from simpler organisms is not well established. The extent of N-terminal signature sequence conservation would give important insights about the evolution of Myc proteins and their current function in mammalian physiology and disease. In a systematic study of Myc-like proteins we show that N-terminal signature motifs are not readily detectable in individual Myc-like proteins from invertebrates but that weak similarities to Myc boxes 1 and 2 can be found in the N-termini of the simplest Metazoa as well as the unicellular choanoflagellate, Monosiga brevicollis, using multiple protein alignments. Phylogenetic support for the connections of these proteins to established Myc proteins is however poor. We show that the pattern of predicted protein disorder along the length of Myc proteins can be used as a complementary approach to making dendrograms of Myc proteins that aids the classification of Myc proteins. This suggests that the pattern of disorder within Myc proteins is more conserved through evolution than their amino acid sequence. In the disorder-based dendrograms the Myc-like proteins from simpler organisms, including M. brevicollis, are connected to established Myc proteins with a higher degree of certainty. Our results suggest that protein disorder based dendrograms may be of general significance for studying distant relationships between proteins, such as transcription factors, that have high levels of intrinsic disorder.
Collapse
|