1
|
Compound Dynamics and Combinatorial Patterns of Amino Acid Repeats Encode a System of Evolutionary and Developmental Markers. Genome Biol Evol 2020; 11:3159-3178. [PMID: 31589292 PMCID: PMC6839033 DOI: 10.1093/gbe/evz216] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Accepted: 09/27/2019] [Indexed: 01/05/2023] Open
Abstract
Homopolymeric amino acid repeats (AARs) like polyalanine (polyA) and polyglutamine (polyQ) in some developmental proteins (DPs) regulate certain aspects of organismal morphology and behavior, suggesting an evolutionary role for AARs as developmental "tuning knobs." It is still unclear, however, whether these are occasional protein-specific phenomena or hints at the existence of a whole AAR-based regulatory system in DPs. Using novel approaches to trace their functional and evolutionary history, we find quantitative evidence supporting a generalized, combinatorial role of AARs in developmental processes with evolutionary implications. We observe nonrandom AAR distributions and combinations in HOX and other DPs, as well as in their interactomes, defining elements of a proteome-wide combinatorial functional code whereby different AARs and their combinations appear preferentially in proteins involved in the development of specific organs/systems. Such functional associations can be either static or display detectable evolutionary dynamics. These findings suggest that progressive changes in AAR occurrence/combination, by altering embryonic development, may have contributed to taxonomic divergence, leaving detectable traces in the evolutionary history of proteomes. Consistent with this hypothesis, we find that the evolutionary trajectories of the 20 AARs in eukaryotic proteomes are highly interrelated and their individual or compound dynamics can sharply mark taxonomic boundaries, or display clock-like trends, carrying overall a strong phylogenetic signal. These findings provide quantitative evidence and an interpretive framework outlining a combinatorial system of AARs whose compound dynamics mark at the same time DP functions and evolutionary transitions.
Collapse
|
2
|
Didehydro-Cortistatin A Inhibits HIV-1 by Specifically Binding to the Unstructured Basic Region of Tat. mBio 2019; 10:mBio.02662-18. [PMID: 30723126 PMCID: PMC6368365 DOI: 10.1128/mbio.02662-18] [Citation(s) in RCA: 50] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/20/2022] Open
Abstract
The intrinsically disordered HIV-1 Tat protein binds the viral RNA transactivation response structure (TAR), which recruits transcriptional cofactors, amplifying viral mRNA expression. Limited Tat transactivation correlates with HIV-1 latency. Unfortunately, Tat inhibitors are not clinically available. The small molecule didehydro-cortistatin A (dCA) inhibits Tat, locking HIV-1 in persistent latency, blocking viral rebound. We generated chemical derivatives of dCA that rationalized molecular docking of dCA to an active and specific Tat conformer. These revealed the importance of the cycloheptene ring and the isoquinoline nitrogen's positioning in the interaction with specific residues of Tat's basic domain. These features are distinct from the ones required for inhibition of cyclin-dependent kinase 8 (CDK8), the only other known ligand of dCA. Besides, we demonstrated that dCA activity on HIV-1 transcription is independent of CDK8. The binding of dCA to Tat with nanomolar affinity alters the local protein environment, rendering Tat more resistant to proteolytic digestion. dCA thus locks a transient conformer of Tat, specifically blocking functions dependent of its basic domain, namely the Tat-TAR interaction; while proteins with similar basic patches are unaffected by dCA. Our results improve our knowledge of the mode of action of dCA and support structure-based design strategies targeting Tat, to help advance development of dCA, as well as novel Tat inhibitors.IMPORTANCE Tat activates virus production, and limited Tat transactivation correlates with HIV-1 latency. The Tat inhibitor dCA locks HIV in persistent latency. This drug class enables block-and-lock functional cure approaches, aimed at reducing residual viremia during therapy and limiting viral rebound. dCA may also have additional therapeutic benefits since Tat is also neurotoxic. Unfortunately, Tat inhibitors are not clinically available. We generated chemical derivatives and rationalized binding to an active and specific Tat conformer. dCA features required for Tat inhibition are distinct from features needed for inhibition of cyclin-dependent kinase 8 (CDK8), the only other known target of dCA. Furthermore, knockdown of CDK8 did not impact dCA's activity on HIV-1 transcription. Binding of dCA to Tat's basic domain altered the local protein environment and rendered Tat more resistant to proteolytic digestion. dCA locks a transient conformer of Tat, blocking functions dependent on its basic domain, namely its ability to amplify viral transcription. Our results define dCA's mode of action, support structure-based-design strategies targeting Tat, and provide valuable information for drug development around the dCA pharmacophore.
Collapse
|
3
|
Differential Occurrence of Interactions and Interaction Domains in Proteins Containing Homopolymeric Amino Acid Repeats. Front Genet 2015; 6:345. [PMID: 26734058 PMCID: PMC4683181 DOI: 10.3389/fgene.2015.00345] [Citation(s) in RCA: 17] [Impact Index Per Article: 1.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2015] [Accepted: 11/20/2015] [Indexed: 12/13/2022] Open
Abstract
Homopolymeric amino acids repeats (AARs), which are widespread in proteomes, have often been viewed simply as spacers between protein domains, or even as "junk" sequences with no obvious function but with a potential to cause harm upon expansion as in genetic diseases associated with polyglutamine or polyalanine expansions, including Huntington disease and cleidocranial dysplasia. A growing body of evidence indicates however that at least some AARs can form organized, functional protein structures, and can regulate protein function. In particular, certain AARs can mediate protein-protein interactions, either through homotypic AAR-AAR contacts or through heterotypic contacts with other protein domains. It is still unclear however, whether AARs may have a generalized, proteome-wide role in shaping protein-protein interaction networks. Therefore, we have undertaken here a bioinformatics screening of the human proteome and interactome in search of quantitative evidence of such a role. We first identified the sets of proteins that contain repeats of any one of the 20 amino acids, as well as control sets of proteins chosen at random in the proteome. We then analyzed the connectivity between the proteins of the AAR-containing protein sets and we compared it with that observed in the corresponding control networks. We find evidence for different degrees of connectivity in the different AAR-containing protein networks. Indeed, networks of proteins containing polyglutamine, polyglutamate, polyproline, and other AARs show significantly increased levels of connectivity, whereas networks containing polyleucine and other hydrophobic repeats show lower degrees of connectivity. Furthermore, we observed that numerous protein-protein, -nucleic acid, and -lipid interaction domains are significantly enriched in specific AAR protein groups. These findings support the notion of a generalized, combinatorial role of AARs, together with conventional protein interaction domains, in shaping the interaction networks of the human proteome, and define proteome-wide knowledge that may guide the informed biological exploration of the role of AARs in protein interactions.
Collapse
|
4
|
Homepeptide repeats: implications for protein structure, function and evolution. GENOMICS PROTEOMICS & BIOINFORMATICS 2012; 10:217-25. [PMID: 23084777 PMCID: PMC5054710 DOI: 10.1016/j.gpb.2012.04.001] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 11/18/2011] [Revised: 04/03/2012] [Accepted: 04/19/2012] [Indexed: 11/20/2022]
Abstract
Analysis of protein sequences from Mycobacterium tuberculosis H37Rv (Mtb H37Rv) was performed to identify homopeptide repeat-containing proteins (HRCPs). Functional annotation of the HRCPs showed that they are preferentially involved in cellular metabolism. Furthermore, these homopeptide repeats might play some specific roles in protein–protein interaction. Repeat length differences among Bacteria, Archaea and Eukaryotes were calculated in order to identify the conservation of the repeats in these divergent kingdoms. From the results, it was evident that these repeats have a higher degree of conservation in Bacteria and Archaea than in Eukaryotes. In addition, there seems to be a direct correlation between the repeat length difference and the degree of divergence between the species. Our study supports the hypothesis that the presence of homopeptide repeats influences the rate of evolution of the protein sequences in which they are embedded. Thus, homopeptide repeat may have structural, functional and evolutionary implications on proteins.
Collapse
|
5
|
The HIV-1 Rev response element: an RNA scaffold that directs the cooperative assembly of a homo-oligomeric ribonucleoprotein complex. RNA Biol 2012; 9:6-11. [PMID: 22258145 PMCID: PMC3342944 DOI: 10.4161/rna.9.1.18178] [Citation(s) in RCA: 56] [Impact Index Per Article: 4.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/10/2023] Open
Abstract
The HIV-1 Rev response element (RRE) is a ~350 nucleotide, highly structured, cis-acting RNA element essential for viral replication. It is located in the env coding region of the viral genome and is extremely well conserved across different HIV-1 isolates. It is present on all partially spliced and unspliced viral mRNA transcripts, and serves as an RNA framework onto which multiple molecules of the viral protein Rev assemble. The Rev-RRE oligomeric complex mediates the export of these messages from the nucleus to the cytoplasm, where they are translated to produce essential viral proteins and/or packaged as genomes for new virions.
Collapse
|
6
|
Single amino acid and trinucleotide repeats: function and evolution. ADVANCES IN EXPERIMENTAL MEDICINE AND BIOLOGY 2012; 769:26-40. [PMID: 23560303 DOI: 10.1007/978-1-4614-5434-2_3] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/12/2022]
Abstract
The most well known effect of single amino acid repeat expansion, beyond a certain threshold, is the development of a specific disease, depending on the protein in which the expansion has occurred. For example, the expansion of the glutamine repeat in huntingtin leads to the debilitating neurodegenerative disease, Huntington's disease. Similarly, there are a range of other disorders caused by trinucleotide repeat expansions encoding polyglutamine or polyalanine tracts. The age of onset of the polyglutamine-induced neurodegenerative diseases is usually negatively correlated with the length of expanded CAG/glutamine repeat. However, recent studies have given evidence that single amino acid repeats may also play critical roles in normal protein function and that changes in the length of single amino acid repeats is likely to play a beneficial role in evolution. This chapter will look at the prevalence, function and possible role single amino acid repeats have in evolution and other biological processes.
Collapse
|
7
|
Protein homorepeats sequences, structures, evolution, and functions. ADVANCES IN PROTEIN CHEMISTRY AND STRUCTURAL BIOLOGY 2011; 79:59-88. [PMID: 20621281 DOI: 10.1016/s1876-1623(10)79002-7] [Citation(s) in RCA: 49] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/25/2022]
Abstract
The vast majority of protein sequences are aperiodic; they do not have any strong bias in the amino acid composition, and they use a subtle mixture of all or most of the 20 amino acid residues to code a great number of various structures and functions. In this context, homorepeats, runs of a single amino acid residue, represent unusual, eye-catching motifs in proteins. Despite the sequence simplicity and relatively small size, the homorepeat runs have a strong potential for molecular interactions due to the excessively high local concentration of a certain physico-chemical property. Appearance of such runs within proteins may give them new structural and functional features. An increasing number of studies demonstrate the abundance of these motifs in proteins, their important roles in biological processes, and their link to a number of hereditary and age-related diseases. In this chapter, we summarize data on the distribution of homorepeats in proteomes and on their structural properties, evolution, and functions.
Collapse
|
8
|
A solution to limited genomic capacity: using adaptable binding surfaces to assemble the functional HIV Rev oligomer on RNA. Mol Cell 2008; 31:824-34. [PMID: 18922466 DOI: 10.1016/j.molcel.2008.07.016] [Citation(s) in RCA: 99] [Impact Index Per Article: 6.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/13/2008] [Revised: 06/10/2008] [Accepted: 07/01/2008] [Indexed: 11/30/2022]
Abstract
Many ribonucleoprotein (RNP) complexes assemble into large, organized structures in which protein subunits are positioned by interactions with RNA and other proteins. Here we demonstrate that HIV Rev, constrained in size by a limited viral genome, also forms an organized RNP by assembling a homo-oligomer on the Rev response element (RRE) RNA. Rev subunits bind cooperatively to discrete RNA sites using an oligomerization domain and an adaptable protein-RNA interface, forming a complex with 500-fold higher affinity than the tightest single interaction. High-affinity binding correlates strongly with RNA export activity. Rev utilizes different surfaces of its alpha-helical RNA-binding domain to recognize several low-affinity binding sites, including the well-characterized stem IIB site and an additional site in stem IA. We propose that adaptable RNA-binding surfaces allow the Rev oligomer to assemble economically into a discrete, stable RNP and provide a mechanistic role for Rev oligomerization during the HIV life cycle.
Collapse
|
9
|
Derivation of primary sequences and secondary structures of rev responsive element from HIV-1 infected mothers and infants following vertical transmission. Virology 2006; 359:201-11. [PMID: 17045321 DOI: 10.1016/j.virol.2006.09.003] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/29/2006] [Revised: 09/01/2006] [Accepted: 09/05/2006] [Indexed: 11/15/2022]
Abstract
We have characterized the primary RRE sequences of HIV-1, including in vivo genetic variation and functional motifs required for Rev-RRE interactions as well as evaluated the RNA secondary structures of RRE derived from five mother-infant pairs following vertical transmission. Multiple (157) RRE sequences derived from mother-infant pairs showed that primary nucleotide sequences of RRE were highly conserved with a low degree of viral heterogeneity following vertical transmission. We found that the RRE sequences from mothers and infants folded and retained all the essential stem-loop formation required for Rev-RRE interactions. More importantly, a primary 9-nucleotide (5'-CACTATGGG-3') RRE sequence in the stem-loop B that is required for optimal Rev recognition and must be presented as a stem-bulge-stem structure was highly conserved in most of the sequences. The domains required for RRE-host protein interactions were also conserved in most of the RRE sequences. Taken together, the primary RRE sequences in the context of secondary structures were maintained and the Rev-RRE interaction domains were conserved following vertical transmission, which is consistent with a crucial role of RRE in HIV-1 pathogenesis.
Collapse
|
10
|
The molecular biology of bovine immunodeficiency virus: a comparison with other lentiviruses. Anim Health Res Rev 2005; 5:125-43. [PMID: 15984320 DOI: 10.1079/ahr200496] [Citation(s) in RCA: 20] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/11/2022]
Abstract
Bovine immunodeficiency virus (BIV) was first isolated in 1969 from a cow, R-29, with a wasting syndrome. The virus isolated induced the formation of syncytia in cell cultures and was structurally similar to maedi-visna virus. Twenty years later, it was demonstrated that the bovine R-29 isolate was indeed a lentivirus with striking similarity to the human immunodeficiency virus. Like other lentiviruses, BIV has a complex genomic structure characterized by the presence of several regulatory/accessory genes that encode proteins, some of which are involved in the regulation of virus gene expression. This manuscript aims to review biological and, more particularly, molecular aspects of BIV, with emphasis on regulatory/accessory viral genes/proteins, in comparison with those of other lentiviruses.
Collapse
|
11
|
Abstract
Expansion of "low complex" repeats of amino acids such as glutamine (Poly-Q) is associated with protein misfolding and the development of degenerative diseases such as Huntington's disease. The mechanism by which such regions promote misfolding remains controversial, the function of many repeat-containing proteins (RCPs) remains obscure, and the role (if any) of repeat regions remains to be determined. Here, a Web-accessible database of RCPs is presented. The distribution and evolution of RCPs that contain homopeptide repeats tracts are considered, and the existence of functional patterns investigated. Generally, it is found that while polyamino acid repeats are extremely rare in prokaryotes, several eukaryote putative homologs of prokaryote RCP-involved in important housekeeping processes-retain the repetitive region, suggesting an ancient origin for certain repeats. Within eukarya, the most common uninterrupted amino acid repeats are glutamine, asparagines, and alanine. Interestingly, while poly-Q repeats are found in vertebrates and nonvertebrates, poly-N repeats are only common in more primitive nonvertebrate organisms, such as insects and nematodes. We have assigned function to eukaryote RCPs using Online Mendelian Inheritance in Man (OMIM), the Human Reference Protein Database (HRPD), FlyBase, and Wormpep. Prokaryote RCPs were annotated using BLASTp searches and Gene Ontology. These data reveal that the majority of RCPs are involved in processes that require the assembly of large, multiprotein complexes, such as transcription and signaling.
Collapse
|
12
|
Advanced modular self-inactivating lentiviral expression vectors for multigene interventions in mammalian cells and in vivo transduction. Nucleic Acids Res 2002; 30:e113. [PMID: 12409472 PMCID: PMC135834 DOI: 10.1093/nar/gnf112] [Citation(s) in RCA: 82] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/13/2022] Open
Abstract
In recent years, lentiviral expression systems have gained an unmatched reputation among the gene therapy community for their ability to deliver therapeutic transgenes into a wide variety of difficult-to-transfect/transduce target tissues (brain, hematopoietic system, liver, lung, retina) without eliciting significant humoral immune responses. We have cloned a construction kit-like self-inactivating lentiviral expression vector family which is compatible to state-of-the-art packaging and pseudotyping technologies and contains, besides essential cis-acting lentiviral sequences, (i) unparalleled polylinkers with up to 29 unique sites for restriction endonucleases, many of which recognize 8 bp motifs, (ii) strong promoters derived from the human cytomegalovirus immediate-early promoter (P(hCMV)) or the human elongation factor 1alpha (P(hEF1)(alpha)), (iii) P(hCMV-) or P(PGK-) (phosphoglycerate kinase promoter) driven G418 resistance markers or fluorescent protein-based expression tracers and (iv) tricistronic expression cassettes for coordinated expression of up to three transgenes. In addition, we have designed a size-optimized series of highly modular lentiviral expression vectors (pLenti Module) which contain, besides the extensive central polylinker, unique restriction sites flanking any of the 5'U3, R-U5-psi+-SD, cPPT-RRE-SA and 3'LTR(DeltaU3) modules or placed within the 5'U3 (-78 bp) and 3'LTR(DeltaU3) (8666 bp). pLentiModule enables straightforward cassette-type module swapping between lentiviral expression vector family members and facilitates the design of Tat-independent (replacement of 5'LTR by heterologous promoter elements), regulated and self-excisable proviruses (insertion of responsive operators or LoxP in the 3'LTR(DeltaU3) element). We have validated our lentiviral expression vectors by transduction of a variety of insect, chicken, murine and human cell lines as well as adult rat cardiomyocytes, rat hippocampal slices and chicken embryos. The novel multi-purpose construction kit-like vector series described here is compatible with itself as well as many other (non-viral) mammalian expression vectors for straightforward exchange of key components (e.g. promoters, LTRs, resistance genes) and will assist the gene therapy and tissue engineering communities in developing lentiviral expression vectors tailored for optimal treatment of prominent human diseases.
Collapse
|
13
|
An N-terminal arginine-rich cluster and a proline-alanine-threonine repeat region determine the cellular localization of the herpes simplex virus type 1 ICP34.5 protein and its ligand, protein phosphatase 1. J Biol Chem 2002; 277:11423-31. [PMID: 11788604 DOI: 10.1074/jbc.m111553200] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022] Open
Abstract
The ICP34.5 protein facilitates herpes simplex virus replication by binding and activating protein phosphatase 1 (PP1) by means of a very conserved C-terminal GADD34-like region. Natural variants of the ICP34.5 differing in the number of arginines in an Arg-rich cluster at the N terminus and the number of Pro-Ala-Thr repeats in the central bridge region of the protein were cloned as fusion proteins with a reporter peptide (c-Myc or hrGFP) at the C terminus. The natural variants were obtained from strains differing in passage history, tissue culture behavior, and neuroinvasive disease potential. In transfected cells, these variants localized to different subcellular compartments. The N-terminal Arg-rich cluster acted as a cellular localization signal for discrete regions of the nucleus and cytoplasm, but the ultimate location of ICP34.5 was determined by the number of Pro-Ala-Thr repeats in the central bridge region. PP1 colocalized with the ICP34.5 variant in cells expressing the ICP34.5. The ICP34.5-mediated, herpes simplex virus strain-dependent differences in the modulation of PP1 location and function may be responsible for the strain-associated differences in tissue culture behavior and virulence of the virus.
Collapse
|