1
|
Hirai H, Sen Y, Tamura M, Ohta K. TOR inactivation triggers heterochromatin formation in rDNA during glucose starvation. Cell Rep 2023; 42:113320. [PMID: 37913773 DOI: 10.1016/j.celrep.2023.113320] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2023] [Revised: 08/29/2023] [Accepted: 10/05/2023] [Indexed: 11/03/2023] Open
Abstract
In response to environmental cues, such as nutrient starvation, living organisms modulate gene expression through mechanisms involving histone modifications. Specifically, nutrient depletion inactivates the TOR (target of rapamycin) pathway, leading to reduced expression of ribosomal genes. While these regulatory mechanisms are well elucidated in budding yeast Saccharomyces cerevisiae, their conservation across diverse organisms remains unclear. In this study, we demonstrate that fission yeast Schizosaccharomyces pombe cells repress ribosomal gene transcription through a different mechanism. TORC1, which accumulates in the rDNA region, dissociates upon starvation, resulting in enhanced methylation of H3K9 and heterochromatin formation, facilitated by dissociation of the stress-responsive transcription factor Atf1 and accumulation of the histone chaperone FACT. We propose that this mechanism might be adapted in mammals that possess Suv39H1 and HP1, which are absent in budding yeast.
Collapse
Affiliation(s)
- Hayato Hirai
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Komaba 3-8-1, Meguro-ku, Tokyo 153-8902, Japan.
| | - Yuki Sen
- Department of Integrated Sciences, College of Arts and Sciences, The University of Tokyo, Komaba 3-8-1, Meguro-ku, Tokyo 153-8902, Japan
| | - Miki Tamura
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Komaba 3-8-1, Meguro-ku, Tokyo 153-8902, Japan
| | - Kunihiro Ohta
- Department of Life Sciences, Graduate School of Arts and Sciences, The University of Tokyo, Komaba 3-8-1, Meguro-ku, Tokyo 153-8902, Japan; Universal Biology Institute, The University of Tokyo, Hongo 7-3-1, Bunkyo-ku, Tokyo 113-0033, Japan.
| |
Collapse
|
2
|
Schneider B, Černý J, Svozil D, Čech P, Gelly JC, de Brevern AG. Bioinformatic analysis of the protein/DNA interface. Nucleic Acids Res 2014; 42:3381-94. [PMID: 24335080 PMCID: PMC3950675 DOI: 10.1093/nar/gkt1273] [Citation(s) in RCA: 41] [Impact Index Per Article: 3.7] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2013] [Revised: 11/14/2013] [Accepted: 11/14/2013] [Indexed: 01/04/2023] Open
Abstract
To investigate the principles driving recognition between proteins and DNA, we analyzed more than thousand crystal structures of protein/DNA complexes. We classified protein and DNA conformations by structural alphabets, protein blocks [de Brevern, Etchebest and Hazout (2000) (Bayesian probabilistic approach for predicting backbone structures in terms of protein blocks. Prots. Struct. Funct. Genet., 41:271-287)] and dinucleotide conformers [Svozil, Kalina, Omelka and Schneider (2008) (DNA conformations and their sequence preferences. Nucleic Acids Res., 36:3690-3706)], respectively. Assembling the mutually interacting protein blocks and dinucleotide conformers into 'interaction matrices' revealed their correlations and conformer preferences at the interface relative to their occurrence outside the interface. The analyzed data demonstrated important differences between complexes of various types of proteins such as transcription factors and nucleases, distinct interaction patterns for the DNA minor groove relative to the major groove and phosphate and importance of water-mediated contacts. Water molecules mediate proportionally the largest number of contacts in the minor groove and form the largest proportion of contacts in complexes of transcription factors. The generally known induction of A-DNA forms by complexation was more accurately attributed to A-like and intermediate A/B conformers rare in naked DNA molecules.
Collapse
Affiliation(s)
- Bohdan Schneider
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Jiří Černý
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Daniel Svozil
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Petr Čech
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Jean-Christophe Gelly
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| | - Alexandre G. de Brevern
- Institute of Biotechnology AS CR, Videnska 1083, CZ-142 20 Prague, Czech Republic, Laboratory of Informatics and Chemistry, Faculty of Chemical Technology, Institute of Chemical Technology Prague, Technická 5, CZ-166 28 Prague, Czech Republic, INSERM, U665, DSIMB, F-75739 Paris, France, University of Paris Diderot, Sorbonne Paris Cité, UMR_S 665, F-75739 Paris, France, Institut National de la Transfusion Sanguine (INTS), F-75739 Paris, France and Laboratoire d’Excellence GR-Ex, F-75739 Paris, France
| |
Collapse
|
3
|
Coimbatore Narayanan B, Westbrook J, Ghosh S, Petrov AI, Sweeney B, Zirbel CL, Leontis NB, Berman HM. The Nucleic Acid Database: new features and capabilities. Nucleic Acids Res 2013; 42:D114-22. [PMID: 24185695 PMCID: PMC3964972 DOI: 10.1093/nar/gkt980] [Citation(s) in RCA: 150] [Impact Index Per Article: 12.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/28/2023] Open
Abstract
The Nucleic Acid Database (NDB) (http://ndbserver.rutgers.edu) is a web portal providing access to information about 3D nucleic acid structures and their complexes. In addition to primary data, the NDB contains derived geometric data, classifications of structures and motifs, standards for describing nucleic acid features, as well as tools and software for the analysis of nucleic acids. A variety of search capabilities are available, as are many different types of reports. This article describes the recent redesign of the NDB Web site with special emphasis on new RNA-derived data and annotations and their implementation and integration into the search capabilities.
Collapse
Affiliation(s)
- Buvaneswari Coimbatore Narayanan
- Department of Chemistry and Chemical Biology, Center for Integrative Proteomics Research, Rutgers, the State University of New Jersey, 174 Frelinghuysen Road, Piscataway, NJ 08854-8076, USA, Department of Chemistry and Center for Biomolecular Sciences, Bowling Green State University, Bowling Green, OH 43403, USA and Department of Mathematics and Statistics, Bowling Green State University, Bowling Green, OH 43403, USA
| | | | | | | | | | | | | | | |
Collapse
|
4
|
Abdin MZ, Kiran U, Alam A. Analysis of osmotin, a PR protein as metabolic modulator in plants. Bioinformation 2011; 5:336-40. [PMID: 21383921 PMCID: PMC3046038 DOI: 10.6026/97320630005336] [Citation(s) in RCA: 25] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/01/2010] [Accepted: 12/04/2010] [Indexed: 11/23/2022] Open
Abstract
Osmotin is an abundant cationic multifunctional protein discovered in cells of tobacco (Nicotiana tabacum L. var Wisconsin 38) adapted to an environment of low osmotic potential. Beside its role as osmoregulator, it provides plants protection from pathogens, hence also placed in the PRP family of proteins. The osmotin induced proline accumulation has been reported to confer tolerance against both biotic and abiotic stresses in plants including transgenic tomato and strawberry overexpressing osmotin gene. The exact mechanism of induction of proline by osmotin is however, not known till date. These observations have led us to hypothesize that osmotin could be regulating these plant responses through its involvement either as transcription factor, cell signal pathway modulator or both in plants. We have therefore, undertaken the present investigation to analyze the osmotin protein as transcription factor using bioinformatics tools. The results of available online DNA binding motif search programs revealed that osmotin does not contain DNAbinding motifs. The alignment results of osmotin protein with the protein sequence from DATF showed the homology in the range of 0-20%, suggesting that it might not contain a DNA binding motif. Further to find unique DNA-binding domain, the superimposition of osmotin 3D structure on modeled Arabidopsis transcription factors using Chimera also suggested absence of the same. However, evidence implicating osmotin in cell signaling were found during the study. With these results, we therefore, concluded that osmotin is not a transcription factor, but regulating plant responses to biotic and abiotic stresses through cell signaling.
Collapse
Affiliation(s)
- Malik Zainul Abdin
- Department of Biotechnology, Faculty of Science, Jamia Hamdard, New Delhi-110062, India
- Malik Zainul Abdin: Tel: +91-11-26059688, Extn: 5583
| | - Usha Kiran
- Faculty of Engineering and Interdisciplinary Sciences, Jamia Hamdard, New Delhi-110062, Indi
| | - Afshar Alam
- Department of Computer Science, Jamia Hamdard, New Delhi-110062, India
| |
Collapse
|
5
|
Chen X, Wang M, Zhang H. The use of classification trees for bioinformatics. WILEY INTERDISCIPLINARY REVIEWS. DATA MINING AND KNOWLEDGE DISCOVERY 2011; 1:55-63. [PMID: 22523608 PMCID: PMC3329156 DOI: 10.1002/widm.14] [Citation(s) in RCA: 43] [Impact Index Per Article: 3.1] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 05/27/2023]
Abstract
Classification trees are non-parametric statistical learning methods that incorporate feature selection and interactions, possess intuitive interpretability, are efficient, and have high prediction accuracy when used in ensembles. This paper provides a brief introduction to the classification tree-based methods, a review of the recent developments, and a survey of the applications in bioinformatics and statistical genetics.
Collapse
|
6
|
Geurts P, Irrthum A, Wehenkel L. Supervised learning with decision tree-based methods in computational and systems biology. MOLECULAR BIOSYSTEMS 2009; 5:1593-605. [PMID: 20023720 DOI: 10.1039/b907946g] [Citation(s) in RCA: 96] [Impact Index Per Article: 6.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
At the intersection between artificial intelligence and statistics, supervised learning allows algorithms to automatically build predictive models from just observations of a system. During the last twenty years, supervised learning has been a tool of choice to analyze the always increasing and complexifying data generated in the context of molecular biology, with successful applications in genome annotation, function prediction, or biomarker discovery. Among supervised learning methods, decision tree-based methods stand out as non parametric methods that have the unique feature of combining interpretability, efficiency, and, when used in ensembles of trees, excellent accuracy. The goal of this paper is to provide an accessible and comprehensive introduction to this class of methods. The first part of the review is devoted to an intuitive but complete description of decision tree-based methods and a discussion of their strengths and limitations with respect to other supervised learning methods. The second part of the review provides a survey of their applications in the context of computational and systems biology.
Collapse
Affiliation(s)
- Pierre Geurts
- Department of EE and CS & GIGA-Research, University of Liège, Belgium.
| | | | | |
Collapse
|
7
|
Xiong W, Li T, Chen K, Tang K. Local combinational variables: an approach used in DNA-binding helix-turn-helix motif prediction with sequence information. Nucleic Acids Res 2009; 37:5632-40. [PMID: 19651875 PMCID: PMC2761287 DOI: 10.1093/nar/gkp628] [Citation(s) in RCA: 22] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/28/2009] [Revised: 07/14/2009] [Accepted: 07/14/2009] [Indexed: 11/24/2022] Open
Abstract
Sequence-based approach for motif prediction is of great interest and remains a challenge. In this work, we develop a local combinational variable approach for sequence-based helix-turn-helix (HTH) motif prediction. First we choose a sequence data set for 88 proteins of 22 amino acids in length to launch an optimized traversal for extracting local combinational segments (LCS) from the data set. Then after LCS refinement, local combinational variables (LCV) are generated to construct prediction models for HTH motifs. Prediction ability of LCV sets at different thresholds is calculated to settle a moderate threshold. The large data set we used comprises 13 HTH families, with 17 455 sequences in total. Our approach predicts HTH motifs more precisely using only primary protein sequence information, with 93.29% accuracy, 93.93% sensitivity and 92.66% specificity. Prediction results of newly reported HTH-containing proteins compared with other prediction web service presents a good prediction model derived from the LCV approach. Comparisons with profile-HMM models from the Pfam protein families database show that the LCV approach maintains a good balance while dealing with HTH-containing proteins and non-HTH proteins at the same time. The LCV approach is to some extent a complementary to the profile-HMM models for its better identification of false-positive data. Furthermore, genome-wide predictions detect new HTH proteins in both Homo sapiens and Escherichia coli organisms, which enlarge applications of the LCV approach. Software for mining LCVs from sequence data set can be obtained from anonymous ftp site ftp://cheminfo.tongji.edu.cn/LCV/freely.
Collapse
Affiliation(s)
| | - Tonghua Li
- Department of Chemistry, Tongji University, Shanghai, 200092, China
| | | | | |
Collapse
|
8
|
Marathe A, Karandur D, Bansal M. Small local variations in B-form DNA lead to a large variety of global geometries which can accommodate most DNA-binding protein motifs. BMC STRUCTURAL BIOLOGY 2009; 9:24. [PMID: 19393049 PMCID: PMC2687451 DOI: 10.1186/1472-6807-9-24] [Citation(s) in RCA: 26] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 12/05/2008] [Accepted: 04/24/2009] [Indexed: 01/01/2023]
Abstract
BACKGROUND An important question of biological relevance is the polymorphism of the double-helical DNA structure in its free form, and the changes that it undergoes upon protein-binding. We have analysed a database of free DNA crystal structures to assess the inherent variability of the free DNA structure and have compared it with a database of protein-bound DNA crystal structures to ascertain the protein-induced variations. RESULTS Most of the dinucleotide steps in free DNA display high flexibility, assuming different conformations in a sequence-dependent fashion. With the exception of the AA/TT and GA/TC steps, which are 'A-phobic', and the GG/CC step, which is 'A-philic', the dinucleotide steps show no preference for A or B forms of DNA. Protein-bound DNA adopts the B-conformation most often. However, in certain cases, protein-binding causes the DNA backbone to take up energetically unfavourable conformations. At the gross structural level, several protein-bound DNA duplexes are observed to assume a curved conformation in the absence of any large distortions, indicating that a series of normal structural parameters at the dinucleotide and trinucleotide level, similar to the ones in free B-DNA, can give rise to curvature at the overall level. CONCLUSION The results illustrate that the free DNA molecule, even in the crystalline state, samples a large amount of conformational space, encompassing both the A and the B-forms, in the absence of any large ligands. A-form as well as some non-A, non-B, distorted geometries are observed for a small number of dinucleotide steps in DNA structures bound to the proteins belonging to a few specific families. However, for most of the bound DNA structures, across a wide variety of protein families, the average step parameters for various dinucleotide sequences as well as backbone torsion angles are observed to be quite close to the free 'B-like' DNA oligomer values, highlighting the flexibility and biological significance of this structural form.
Collapse
Affiliation(s)
- Arvind Marathe
- Molecular Biophysics Unit, Indian Institute of Science, Bangalore, India.
| | | | | |
Collapse
|
9
|
Liu ZP, Wu LY, Wang Y, Zhang XS, Chen L. Bridging protein local structures and protein functions. Amino Acids 2008; 35:627-50. [PMID: 18421562 PMCID: PMC7088341 DOI: 10.1007/s00726-008-0088-8] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2008] [Accepted: 03/10/2008] [Indexed: 12/11/2022]
Abstract
One of the major goals of molecular and evolutionary biology is to understand the functions of proteins by extracting functional information from protein sequences, structures and interactions. In this review, we summarize the repertoire of methods currently being applied and report recent progress in the field of in silico annotation of protein function based on the accumulation of vast amounts of sequence and structure data. In particular, we emphasize the newly developed structure-based methods, which are able to identify locally structural motifs and reveal their relationship with protein functions. These methods include computational tools to identify the structural motifs and reveal the strong relationship between these pre-computed local structures and protein functions. We also discuss remaining problems and possible directions for this exciting and challenging area.
Collapse
Affiliation(s)
- Zhi-Ping Liu
- Academy of Mathematics and Systems Science, Chinese Academy of Sciences, 100080, Beijing, China
| | | | | | | | | |
Collapse
|
10
|
König I, Malley J, Pajevic S, Weimar C, Diener HC, Ziegler A. Patient-centered yes/no prognosis using learning machines. INT J DATA MIN BIOIN 2008; 2:289-341. [PMID: 19216340 PMCID: PMC2754835 DOI: 10.1504/ijdmb.2008.022149] [Citation(s) in RCA: 24] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
In the last 15 years several machine learning approaches have been developed for classification and regression. In an intuitive manner we introduce the main ideas of classification and regression trees, support vector machines, bagging, boosting and random forests. We discuss differences in the use of machine learning in the biomedical community and the computer sciences. We propose methods for comparing machines on a sound statistical basis. Data from the German Stroke Study Collaboration is used for illustration. We compare the results from learning machines to those obtained by a published logistic regression and discuss similarities and differences.
Collapse
Affiliation(s)
- I.R. König
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Ratzeburger Allee 160, 23538 Lübeck, Germany
| | - J.D. Malley
- Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
| | - S. Pajevic
- Center for Information Technology, National Institutes of Health, Bethesda, MD, USA
| | - C. Weimar
- Klinik und Poliklinik für Neurologie, Universität Duisburg-Essen, Germany
| | - H-C. Diener
- Klinik und Poliklinik für Neurologie, Universität Duisburg-Essen, Germany
| | - A. Ziegler
- Institut für Medizinische Biometrie und Statistik, Universität zu Lübeck, Ratzeburger Allee 160, 23538 Lübeck, Germany, E-mail:
| |
Collapse
|
11
|
Chen YC, Wu CY, Lim C. Predicting DNA-binding amino acid residues from electrostatic stabilization upon mutation to Asp/Glu and evolutionary conservation. Proteins 2007; 67:671-80. [PMID: 17340633 DOI: 10.1002/prot.21366] [Citation(s) in RCA: 18] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Abstract
Binding of polyanionic DNA depends on the cluster of electropositive atoms in the binding site of a DNA-binding protein. Such a cluster of electropositive protein atoms would be electrostatically unfavorable without stabilizing interactions from the respective electronegative DNA atoms and would likely be evolutionary conserved due to its critical biological role. Consequently, our strategy for predicting DNA-binding residues is based on detecting a cluster of evolutionary conserved surface residues that are electrostatically stabilized upon mutation to negatively charged Asp/Glu residues. The method requires as input the protein structure and sufficient sequence homologs to define each residue's relative conservation, and it yields as output experimentally testable residues that are predicted to bind DNA. By incorporating characteristic DNA-binding site features (i.e., electrostatic strain and amino acid conservation), the new method yields a prediction accuracy of 83%, which is much higher than methods based on only electrostatic strain (57%) or conservation alone (50%). It is also less sensitive to protein conformational changes upon DNA binding than methods that mainly depend on the 3D protein structure.
Collapse
Affiliation(s)
- Yao Chi Chen
- Department of Chemistry, National Tsing Hua University, Hsinchu 300, Taiwan
| | | | | |
Collapse
|
12
|
Cukier RI, Morillo M. A targeted reweighting method for accelerating the exploration of high-dimensional configuration space. J Chem Phys 2005; 123:234908. [PMID: 16392950 DOI: 10.1063/1.2137704] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/14/2022] Open
Abstract
Time scales available to biomolecular simulations are limited by barriers among states in a high-dimensional configuration space. If equilibrium averages are to be computed, methods that accelerate barrier passage can be carried out by non-Boltzmann sampling. Barriers can be reduced by modifying the potential-energy function and running dynamics on the modified surface. The Boltzmann average can be restored by reweighting each point along the trajectory. We introduce a targeted reweighting scheme where some barriers are reduced, while others are not modified. If only equilibrium properties are desired, trajectories in configuration space can be generated by Langevin dynamics. Once past a transient time, these trajectories guarantee equilibrium sampling when reweighted. A relatively high-order stochastic integration method can be used to generate trajectories. The targeted reweighting scheme is illustrated by a series of double-well models with varying degrees of freedom and shown to be a very efficient method to provide the correct equilibrium distributions, in comparison with analytic results. The scheme is applied to a protein model consisting of a chain of connected beads characterized by dihedral angles and the van der Waals interactions among the beads. We investigate the sampling of configuration space for a model of a helix-turn-helix motif. The targeted reweighting is found to be essential to permit the original all-helical conformation to bend and generate turn structures while still maintaining the alpha-helical segments.
Collapse
Affiliation(s)
- R I Cukier
- Department of Chemistry, Michigan State University, East Lansing, Michigan 48824, USA.
| | | |
Collapse
|
13
|
McLaughlin WA, Kulp DW, de la Cruz J, Lu XJ, Lawson CL, Berman HM. A structure-based method for identifying DNA-binding proteins and their sites of DNA-interaction. ACTA ACUST UNITED AC 2005; 5:255-65. [PMID: 15704013 DOI: 10.1007/s10969-005-4902-1] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2004] [Accepted: 11/17/2004] [Indexed: 01/11/2023]
Abstract
A classification model of a DNA-binding protein chain was created based on identification of alpha helices within the chain likely to bind to DNA. Using the model, all chains in the Protein Data Bank were classified. For many of the chains classified with high confidence, previous documentation for DNA-binding was found, yet no sequence homology to the structures used to train the model was detected. The result indicates that the chain model can be used to supplement sequence based methods for annotating the function of DNA-binding. Four new candidates for DNA-binding were found, including two structures solved through structural genomics efforts. For each of the candidate structures, possible sites of DNA-binding are indicated by listing the residue ranges of alpha helices likely to interact with DNA.
Collapse
Affiliation(s)
- William A McLaughlin
- Department of Chemistry and Chemical Biology, Rutgers-The State University of New Jersey, 610 Taylor Road, Piscataway, NJ 08854-8087, USA
| | | | | | | | | | | |
Collapse
|
14
|
Pellegrini-Calace M, Thornton JM. Detecting DNA-binding helix-turn-helix structural motifs using sequence and structure information. Nucleic Acids Res 2005; 33:2129-40. [PMID: 15831786 PMCID: PMC1079965 DOI: 10.1093/nar/gki349] [Citation(s) in RCA: 19] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/18/2022] Open
Abstract
In this work, we analyse the potential for using structural knowledge to improve the detection of the DNA-binding helix–turn–helix (HTH) motif from sequence. Starting from a set of DNA-binding protein structures that include a functional HTH motif and have no apparent sequence similarity to each other, two different libraries of hidden Markov models (HMMs) were built. One library included sequence models of whole DNA-binding domains, which incorporate the HTH motif, the second library included shorter models of ‘partial’ domains, representing only the fraction of the domain that corresponds to the functionally relevant HTH motif itself. The libraries were scanned against a dataset of protein sequences, some containing the HTH motifs, others not. HMM predictions were compared with the results obtained from a previously published structure-based method and subsequently combined with it. The combined method proved more effective than either of the single-featured approaches, showing that information carried by motif sequences and motif structures are to some extent complementary and can successfully be used together for the detection of DNA-binding HTHs in proteins of unknown function.
Collapse
|
15
|
Abstract
An ability to assign protein function from protein structure is important for structural genomics consortia. The complex relationship between protein fold and function highlights the necessity of looking beyond the global fold of a protein to specific functional sites. Many computational methods have been developed that address this issue. These include evolutionary trace methods, methods that involve the calculation and assessment of maximal superpositions, methods based on graph theory, and methods that apply machine learning techniques. Such function prediction techniques have been applied to the identification of enzyme catalytic triads and DNA-binding motifs.
Collapse
Affiliation(s)
- Susan Jones
- European Bioinformatics Institute, Wellcome Trust Genome Campus, Hinxton, Cambridge, CB10 1SD, UK.
| | | |
Collapse
|