1
|
Förster D, Idier J, Liberti L, Mucherino A, Lin JH, Malliavin TE. Low-resolution description of the conformational space for intrinsically disordered proteins. Sci Rep 2022; 12:19057. [PMID: 36352011 PMCID: PMC9646904 DOI: 10.1038/s41598-022-21648-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/01/2022] [Accepted: 09/29/2022] [Indexed: 11/11/2022] Open
Abstract
Intrinsically disordered proteins (IDP) are at the center of numerous biological processes, and attract consequently extreme interest in structural biology. Numerous approaches have been developed for generating sets of IDP conformations verifying a given set of experimental measurements. We propose here to perform a systematic enumeration of protein conformations, carried out using the TAiBP approach based on distance geometry. This enumeration was performed on two proteins, Sic1 and pSic1, corresponding to unphosphorylated and phosphorylated states of an IDP. The relative populations of the obtained conformations were then obtained by fitting SAXS curves as well as Ramachandran probability maps, the original finite mixture approach RamaMix being developed for this second task. The similarity between profiles of local gyration radii provides to a certain extent a converged view of the Sic1 and pSic1 conformational space. Profiles and populations are thus proposed for describing IDP conformations. Different variations of the resulting gyration radius between phosphorylated and unphosphorylated states are observed, depending on the set of enumerated conformations as well as on the methods used for obtaining the populations.
Collapse
Affiliation(s)
- Daniel Förster
- grid.112485.b0000 0001 0217 6921UMR7374 Interfaces, Confinement, Matériaux et Nanostructures, Université d’Orléans, Orléans, France
| | - Jérôme Idier
- grid.503212.70000 0000 9563 6044UMR6004 Laboratoire des Sciences du Numérique de Nantes, Nantes, France
| | - Leo Liberti
- grid.508893.fLIX UMR 7161 CNRS École Polytechnique, Institut Polytechnique de Paris, 91128 Palaiseau, France
| | - Antonio Mucherino
- grid.420225.30000 0001 2298 7270IRISA, University of Rennes 1, Rennes, France
| | - Jung-Hsin Lin
- grid.509455.8Biomedical Translation Research Center, Academia Sinica, Taipei, Taiwan
| | - Thérèse E. Malliavin
- grid.428999.70000 0001 2353 6535Institut Pasteur, Université Paris Cité, CNRS UMR3528, Unité de Bioinformatique Structurale, F-75015 Paris, France ,grid.29172.3f0000 0001 2194 6418Université de Lorraine, CNRS UMR7019, LPCT, F-54000 Nancy, France
| |
Collapse
|
2
|
Ha CHX, Lee NK, Rahman T, Hwang SS, Yam WK, Chee XW. Repurposing FDA-approved drugs as HIV-1 integrase inhibitors: an in silico investigation. J Biomol Struct Dyn 2022; 41:2146-2159. [PMID: 35067186 DOI: 10.1080/07391102.2022.2028677] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
The Human Immunodeficiency Virus (HIV) infection is a global pandemic that has claimed 33 million lives to-date. One of the most efficacious treatments for naïve or pretreated HIV patients is the HIV integrase strand transfer inhibitors (INSTIs). However, given that HIV treatment is life-long, the emergence of HIV strains resistant to INSTIs is an imminent challenge. In this work, we showed two best regression QSAR models that were constructed using a boosted Random Forest algorithm (r2 = 0.998, q210CV = 0.721, q2external_test = 0.754) and a boosted K* algorithm (r2 = 0.987, q210CV = 0.721, q2external_test = 0.758) to predict the pIC50 values of INSTIs. Subsequently, the regression QSAR models were deployed against the Drugbank database for drug repositioning. The top-ranked compounds were further evaluated for their target engagement activity using molecular docking studies and accelerated Molecular Dynamics simulation. Lastly, their potential as INSTIs were also evaluated from our literature search. Our study offers the first example of a large-scale regression QSAR modelling effort for discovering highly active INSTIs to combat HIV infection.Communicated by Ramaswamy H. Sarma.
Collapse
Affiliation(s)
- Christopher Heng Xuan Ha
- Faculty of Engineering, Computing and Science, Swinburne University of Technology, Sarawak, Malaysia
| | - Nung Kion Lee
- Faculty of Cognitive Sciences and Human Development, Universiti Malaysia Sarawak, Sarawak, Malaysia
| | - Taufiq Rahman
- Department of Pharmacology, University of Cambridge, Cambridge, United Kingdom
| | - Siaw San Hwang
- Faculty of Engineering, Computing and Science, Swinburne University of Technology, Sarawak, Malaysia
| | - Wai Keat Yam
- Centre for Bioinformatics, School of Data Sciences, Perdana University, Kuala Lumpur, Malaysia
| | - Xavier Wezen Chee
- Faculty of Engineering, Computing and Science, Swinburne University of Technology, Sarawak, Malaysia
| |
Collapse
|
3
|
Mahboubi-Rabbani M, Abbasi M, Hajimahdi Z, Zarghi A. HIV-1 Reverse Transcriptase/Integrase Dual Inhibitors: A Review of Recent Advances and Structure-activity Relationship Studies. IRANIAN JOURNAL OF PHARMACEUTICAL RESEARCH : IJPR 2021; 20:333-369. [PMID: 34567166 PMCID: PMC8457747 DOI: 10.22037/ijpr.2021.115446.15370] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
Abstract
The significant threat to humanity is HIV infection, and it is uncertain whether a definitive treatment or a safe HIV vaccine is. HIV-1 is continually evolving and resistant to commonly used HIV-resistant medications, presenting significant obstacles to HIV infection management. The drug resistance adds to the need for new anti-HIV drugs; it chooses ingenious approaches to fight the emerging virus. Highly Active Antiretroviral Therapy (HAART), a multi-target approach for specific therapies, has proved effective in AIDS treatment. Therefore, it is a dynamic system with high prescription tension, increased risk of medication reactions, and adverse effects, leading to poor compliance with patients. In the HIV-1 lifecycle, two critical enzymes with high structural and functional analogies are reverse transcriptase (RT) and integrase (IN), which can be interpreted as druggable targets for modern dual-purpose inhibitors. Designed multifunctional ligand (DML) is a new technique that recruited many targets to be achieved by one chemical individual. A single chemical entity that acts for multiple purposes can be much more successful than a complex multidrug program. The production of these multifunctional ligands as antiretroviral drugs is valued with the advantage that the viral-replication process may end in two or more phases. This analysis will discuss the RT-IN dual-inhibitory scaffolds' developments documented so far.
Collapse
Affiliation(s)
- Mohammad Mahboubi-Rabbani
- Department of Pharmaceutical Chemistry, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Maryam Abbasi
- Department of Medicinal Chemistry, Faculty of Pharmacy, Hormozgan University of Medical Sciences, Bandar Abbas, Iran
| | - Zahra Hajimahdi
- Department of Pharmaceutical Chemistry, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| | - Afshin Zarghi
- Department of Pharmaceutical Chemistry, School of Pharmacy, Shahid Beheshti University of Medical Sciences, Tehran, Iran
| |
Collapse
|
4
|
Malliavin TE. Tandem domain structure determination based on a systematic enumeration of conformations. Sci Rep 2021; 11:16925. [PMID: 34413388 PMCID: PMC8376923 DOI: 10.1038/s41598-021-96370-z] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/23/2021] [Accepted: 08/04/2021] [Indexed: 12/03/2022] Open
Abstract
Protein structure determination is undergoing a change of perspective due to the larger importance taken in biology by the disordered regions of biomolecules. In such cases, the convergence criterion is more difficult to set up and the size of the conformational space is a obstacle to exhaustive exploration. A pipeline is proposed here to exhaustively sample protein conformations using backbone angle limits obtained by nuclear magnetic resonance (NMR), and then to determine the populations of conformations. The pipeline is applied to a tandem domain of the protein whirlin. An original approach, derived from a reformulation of the Distance Geometry Problem is used to enumerate the conformations of the linker connecting the two domains. Specifically designed procedure then permit to assemble the domains to the linker conformations and to optimize the tandem domain conformations with respect to two sets of NMR measurements: residual dipolar couplings and paramagnetic resonance enhancements. The relative populations of optimized conformations are finally determined by fitting small angle X-ray scattering (SAXS) data. The most populated conformation of the tandem domain is a semi-closed one, fully closed and more extended conformations being in minority, in agreement with previous observations. The SAXS and NMR data show different influences on the determination of populations.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, Institut Pasteur, UMR 3528, CNRS, Paris, France.
- Center of Bioinformatics, Biostatistics and Integrative Biology, Institut Pasteur, USR 3756, CNRS, Paris, France.
| |
Collapse
|
5
|
Malliavin TE, Mucherino A, Lavor C, Liberti L. Systematic Exploration of Protein Conformational Space Using a Distance Geometry Approach. J Chem Inf Model 2019; 59:4486-4503. [PMID: 31442036 DOI: 10.1021/acs.jcim.9b00215] [Citation(s) in RCA: 19] [Impact Index Per Article: 3.2] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
The optimization approaches classically used during the determination of protein structure encounter various difficulties, especially when the size of the conformational space is large. Indeed, in such a case, algorithmic convergence criteria are more difficult to set up. Moreover, the size of the search space makes it difficult to achieve a complete exploration. The interval branch-and-prune (iBP) approach, based on the reformulation of the distance geometry problem (DGP) provides a theoretical frame for the generation of protein conformations, by systematically sampling the conformational space. When an appropriate subset of interatomic distances is known exactly, this worst-case exponential-time algorithm is provably complete and fixed-parameter tractable. These guarantees, however, immediately disappear as distance measurement errors are introduced. Here we propose an improvement of this approach: threading-augmented interval branch-and-prune (TAiBP), where the combinatorial explosion of the original iBP approach arising from its exponential complexity is alleviated by partitioning the input instances into consecutive peptide fragments and by using self-organizing maps (SOMs) to obtain clusters of similar solutions. A validation of the TAiBP approach is presented here on a set of proteins of various sizes and structures. The calculation inputs are a uniform covalent geometry extracted from force field covalent terms, the backbone dihedral angles with error intervals, and a few long-range distances. For most of the proteins smaller than 50 residues and interval widths of 20°, the TAiBP approach yielded solutions with RMSD values smaller than 3 Å with respect to the initial protein conformation. The efficiency of the TAiBP approach for proteins larger than 50 residues will require the use of nonuniform covalent geometry and may have benefits from the recent development of residue-specific force-fields.
Collapse
Affiliation(s)
- Thérèse E Malliavin
- Unité de Bioinformatique Structurale, UMR 3528, CNRS, and Departement de Bioinformatique, Biostatistique et Biologie Intégrative, USR 3756, CNRS , Institut Pasteur , 75015 Paris , France
| | | | - Carlile Lavor
- Applied Math Department , IMECC-University of Campinas , Campinas , SP 13083-970 , Brazil
| | - Leo Liberti
- LIX CNRS, Ecole Polytechnique , Institut Polytechnique de Paris , Route de Saclay , 91128 Palaiseau , France
| |
Collapse
|
6
|
Du W, Zuo K, Sun X, Liu W, Yan X, Liang L, Wan H, Chen F, Hu J. An effective HIV-1 integrase inhibitor screening platform: Rationality validation of drug screening, conformational mobility and molecular recognition analysis for PFV integrase complex with viral DNA. J Mol Graph Model 2017; 78:96-109. [PMID: 29055187 DOI: 10.1016/j.jmgm.2017.10.002] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/24/2017] [Revised: 10/01/2017] [Accepted: 10/03/2017] [Indexed: 01/26/2023]
Abstract
As an important target for the development of novel anti-AIDS drugs, HIV-1 integrase (IN) has been widely concerned. However, the lack of a complete accurate crystal structure of HIV-1 IN greatly blocks the discovery of novel inhibitors. In this work, an effective HIV-1 IN inhibitor screening platform, namely PFV IN, was filtered from all species of INs. Next, the 40.8% similarity with HIV-1 IN, as well as the high efficiency of virtual screening and the good agreement between calculated binding free energies and experimental ones all proved PFV IN is a promising screening platform for HIV-1 IN inhibitors. Then, the molecular recognition mechanism of PFV IN by its substrate viral DNA and six naphthyridine derivatives (NRDs) inhibitors was investigated through molecular docking, molecular dynamics simulations and water-mediated interactions analyses. The functional partition of NRDs IN inhibitors could be divided into hydrophobic and hydrophilic ones, and the Mg2+ ions, water molecules and conserved DDE motif residues all interacted with the hydrophilic partition, while the bases in viral DNA and residues like Tyr212, Pro214 interacted with the hydrophobic one. Finally, the free energy landscape (FEL) and cluster analyses were performed to explore the molecular motion of PFV IN-DNA system. It is found that the association with NRDs inhibitors would obviously decrease the motion amplitude of PFV IN-DNA, which may be one of the most potential mechanisms of IN inhibitors. This work will provide a theoretical basis for the inhibitor design based on the structure of HIV-1 IN.
Collapse
Affiliation(s)
- Wenyi Du
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Ke Zuo
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Xin Sun
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Wei Liu
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Xiao Yan
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Li Liang
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China
| | - Hua Wan
- College of Mathematics and Informatics, South China Agricultural University, Guangzhou, China
| | - Fengzheng Chen
- Department of Chemistry, Leshan Normal University, Leshan, China
| | - Jianping Hu
- College of Pharmacy and Biological Engineering, Sichuan Industrial Institute of Antibiotics, Key Laboratory of Medicinal and Edible Plants Resources Development, Chengdu University, Chengdu, China.
| |
Collapse
|
7
|
Ercan S. Docking and Molecular Dynamics Calculations of Some Previously Studied and newly Designed Ligands to Catalytic Core Domain of HIV-1 Integrase and an Investigation to Effects of Conformational Changes of Protein on Docking Results. JOURNAL OF THE TURKISH CHEMICAL SOCIETY, SECTION A: CHEMISTRY 2016. [DOI: 10.18596/jotcsa.287327] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022] Open
|
8
|
Su M, Tan J, Lin CY. Development of HIV-1 integrase inhibitors: recent molecular modeling perspectives. Drug Discov Today 2015. [PMID: 26220090 DOI: 10.1016/j.drudis.2015.07.012] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/20/2022]
Abstract
Of the three viral enzymes essential to HIV replication, HIV-1 integrase (IN) is gaining popularity as a target for the antiviral therapy of AIDS. Substantial work focusing on IN has been done over the past three decades, which has facilitated and led to the approval of three drugs. Here, we discuss in detail the development of IN inhibitors between January 2012 and May 2014, with a particular focus on molecular simulation. We highlight controversial aspects of computational drug design and refer to alternative practices where appropriate. The analysis of these computational approaches provides some useful clues to the possible future discovery of novel IN inhibitors.
Collapse
Affiliation(s)
- Min Su
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing 100124, China
| | - Jianjun Tan
- College of Life Science and Bioengineering, Beijing University of Technology, Beijing 100124, China.
| | - Chun-Yuan Lin
- Department of Computer Science and Information Engineering, Chang Gung University, Taoyuan 33302, Taiwan.
| |
Collapse
|
9
|
Cortes-Ciriano I, Bouvier G, Nilges M, Maragliano L, Malliavin TE. Temperature Accelerated Molecular Dynamics with Soft-Ratcheting Criterion Orients Enhanced Sampling by Low-Resolution Information. J Chem Theory Comput 2015; 11:3446-54. [PMID: 26575778 DOI: 10.1021/acs.jctc.5b00153] [Citation(s) in RCA: 10] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/29/2023]
Abstract
Many proteins exhibit an equilibrium between multiple conformations, some of them being characterized only by low-resolution information. Visiting all conformations is a demanding task for computational techniques performing enhanced but unfocused exploration of collective variable (CV) space. Otherwise, pulling a structure toward a target condition biases the exploration in a way difficult to assess. To address this problem, we introduce here the soft-ratcheting temperature-accelerated molecular dynamics (sr-TAMD), where the exploration of CV space by TAMD is coupled to a soft-ratcheting algorithm that filters the evolving CV values according to a predefined criterion. Any low resolution or even qualitative information can be used to orient the exploration. We validate this technique by exploring the conformational space of the inactive state of the catalytic domain of the adenyl cyclase AC from Bordetella pertussis. The domain AC gets activated by association with calmodulin (CaM), and the available crystal structure shows that in the complex the protein has an elongated shape. High-resolution data are not available for the inactive, CaM-free protein state, but hydrodynamic measurements have shown that the inactive AC displays a more globular conformation. Here, using as CVs several geometric centers, we use sr-TAMD to enhance CV space sampling while filtering for CV values that correspond to centers moving close to each other, and we thus rapidly visit regions of conformational space that correspond to globular structures. The set of conformations sampled using sr-TAMD provides the most extensive description of the inactive state of AC up to now, consistent with available experimental information.
Collapse
Affiliation(s)
- Isidro Cortes-Ciriano
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Structural Biology and Chemistry Department, Institut Pasteur , 25-28, rue Dr. Roux, 75 724 Paris, France
| | - Guillaume Bouvier
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Structural Biology and Chemistry Department, Institut Pasteur , 25-28, rue Dr. Roux, 75 724 Paris, France
| | - Michael Nilges
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Structural Biology and Chemistry Department, Institut Pasteur , 25-28, rue Dr. Roux, 75 724 Paris, France
| | - Luca Maragliano
- Department of Neuroscience and Brain Technologies, Istituto Italiano di Tecnologia , Genoa, Italy
| | - Thérèse E Malliavin
- Unité de Bioinformatique Structurale, CNRS UMR 3528, Structural Biology and Chemistry Department, Institut Pasteur , 25-28, rue Dr. Roux, 75 724 Paris, France
| |
Collapse
|
10
|
Harigua-Souiai E, Cortes-Ciriano I, Desdouits N, Malliavin TE, Guizani I, Nilges M, Blondel A, Bouvier G. Identification of binding sites and favorable ligand binding moieties by virtual screening and self-organizing map analysis. BMC Bioinformatics 2015; 16:93. [PMID: 25888251 PMCID: PMC4381396 DOI: 10.1186/s12859-015-0518-z] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2014] [Accepted: 02/24/2015] [Indexed: 11/24/2022] Open
Abstract
Background Identifying druggable cavities on a protein surface is a crucial step in structure based drug design. The cavities have to present suitable size and shape, as well as appropriate chemical complementarity with ligands. Results We present a novel cavity prediction method that analyzes results of virtual screening of specific ligands or fragment libraries by means of Self-Organizing Maps. We demonstrate the method with two thoroughly studied proteins where it successfully identified their active sites (AS) and relevant secondary binding sites (BS). Moreover, known active ligands mapped the AS better than inactive ones. Interestingly, docking a naive fragment library brought even more insight. We then systematically applied the method to the 102 targets from the DUD-E database, where it showed a 90% identification rate of the AS among the first three consensual clusters of the SOM, and in 82% of the cases as the first one. Further analysis by chemical decomposition of the fragments improved BS prediction. Chemical substructures that are representative of the active ligands preferentially mapped in the AS. Conclusion The new approach provides valuable information both on relevant BSs and on chemical features promoting bioactivity. Electronic supplementary material The online version of this article (doi:10.1186/s12859-015-0518-z) contains supplementary material, which is available to authorized users.
Collapse
Affiliation(s)
- Emna Harigua-Souiai
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France. .,Laboratory of Molecular Epidemiology and Experimental Pathology - LR11IPT04, Institut Pasteur de Tunis, Université Tunis el Manar - Tunisia, 13, Place Pasteur, Tunis, 1002, Tunisia. .,University of Carthage, Faculty of sciences of Bizerte - Tunisia, Jarzouna, 7021, Tunisia.
| | - Isidro Cortes-Ciriano
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Nathan Desdouits
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Thérèse E Malliavin
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Ikram Guizani
- Laboratory of Molecular Epidemiology and Experimental Pathology - LR11IPT04, Institut Pasteur de Tunis, Université Tunis el Manar - Tunisia, 13, Place Pasteur, Tunis, 1002, Tunisia.
| | - Michael Nilges
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Arnaud Blondel
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| | - Guillaume Bouvier
- Institut Pasteur, Unité de Bioinformatique Structurale, CNRS UMR 3528, Département de Biologie Structurale et Chimie, 25, rue du Dr Roux, Paris, 75015, France.
| |
Collapse
|
11
|
Cassioli A, Bardiaux B, Bouvier G, Mucherino A, Alves R, Liberti L, Nilges M, Lavor C, Malliavin TE. An algorithm to enumerate all possible protein conformations verifying a set of distance constraints. BMC Bioinformatics 2015; 16:23. [PMID: 25627244 PMCID: PMC4384350 DOI: 10.1186/s12859-015-0451-1] [Citation(s) in RCA: 9] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/09/2014] [Accepted: 01/05/2015] [Indexed: 11/15/2023] Open
Abstract
BACKGROUND The determination of protein structures satisfying distance constraints is an important problem in structural biology. Whereas the most common method currently employed is simulated annealing, there have been other methods previously proposed in the literature. Most of them, however, are designed to find one solution only. RESULTS In order to explore exhaustively the feasible conformational space, we propose here an interval Branch-and-Prune algorithm (iBP) to solve the Distance Geometry Problem (DGP) associated to protein structure determination. This algorithm is based on a discretization of the problem obtained by recursively constructing a search space having the structure of a tree, and by verifying whether the generated atomic positions are feasible or not by making use of pruning devices. The pruning devices used here are directly related to features of protein conformations. CONCLUSIONS We described the new algorithm iBP to generate protein conformations satisfying distance constraints, that would potentially allows a systematic exploration of the conformational space. The algorithm iBP has been applied on three α-helical peptides.
Collapse
Affiliation(s)
| | - Benjamin Bardiaux
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Guillaume Bouvier
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | | | - Rafael Alves
- LIX, Ecole Polytechnique, Palaiseau, 91128, France.
| | - Leo Liberti
- LIX, Ecole Polytechnique, Palaiseau, 91128, France. .,IBM TJ Watson Research Center, NY Yorktown Heights, 10598, USA.
| | - Michael Nilges
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| | - Carlile Lavor
- University of Campinas (IMECC-UNICAMP), Campinas-SP, 13083-859, Brasil.
| | - Thérèse E Malliavin
- Institut Pasteur, Structural Bioinformatics Unit, 25, rue du Dr Roux, Paris, 75015, France. .,CNRS UMR3528, 25, rue du Dr Roux, Paris, 75015, France.
| |
Collapse
|
12
|
Bouvier G, Desdouits N, Ferber M, Blondel A, Nilges M. An automatic tool to analyze and cluster macromolecular conformations based on self-organizing maps. ACTA ACUST UNITED AC 2014; 31:1490-2. [PMID: 25543048 DOI: 10.1093/bioinformatics/btu849] [Citation(s) in RCA: 23] [Impact Index Per Article: 2.1] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/26/2014] [Accepted: 12/21/2014] [Indexed: 11/12/2022]
Abstract
MOTIVATION Sampling the conformational space of biological macromolecules generates large sets of data with considerable complexity. Data-mining techniques, such as clustering, can extract meaningful information. Among them, the self-organizing maps (SOMs) algorithm has shown great promise; in particular since its computation time rises only linearly with the size of the data set. Whereas SOMs are generally used with few neurons, we investigate here their behavior with large numbers of neurons. RESULTS We present here a python library implementing the full SOM analysis workflow. Large SOMs can readily be applied on heavy data sets. Coupled with visualization tools they have very interesting properties. Descriptors for each conformation of a trajectory are calculated and mapped onto a 3D landscape, the U-matrix, reporting the distance between neighboring neurons. To delineate clusters, we developed the flooding algorithm, which hierarchically identifies local basins of the U-matrix from the global minimum to the maximum. AVAILABILITY AND IMPLEMENTATION The python implementation of the SOM library is freely available on github: https://github.com/bougui505/SOM. CONTACT michael.nilges@pasteur.fr or guillaume.bouvier@pasteur.fr SUPPLEMENTARY INFORMATION Supplementary data are available at Bioinformatics online.
Collapse
Affiliation(s)
- Guillaume Bouvier
- Institut Pasteur, Unité de Bioinformatique Structurale; CNRS UMR 3528; Département de Biologie Structurale et Chimie; F-75015, Paris, France
| | - Nathan Desdouits
- Institut Pasteur, Unité de Bioinformatique Structurale; CNRS UMR 3528; Département de Biologie Structurale et Chimie; F-75015, Paris, France
| | - Mathias Ferber
- Institut Pasteur, Unité de Bioinformatique Structurale; CNRS UMR 3528; Département de Biologie Structurale et Chimie; F-75015, Paris, France
| | - Arnaud Blondel
- Institut Pasteur, Unité de Bioinformatique Structurale; CNRS UMR 3528; Département de Biologie Structurale et Chimie; F-75015, Paris, France
| | - Michael Nilges
- Institut Pasteur, Unité de Bioinformatique Structurale; CNRS UMR 3528; Département de Biologie Structurale et Chimie; F-75015, Paris, France
| |
Collapse
|