1
|
Steinbeck C, Koepler O, Bach F, Herres-Pawlis S, Jung N, Liermann J, Neumann S, Razum M, Baldauf C, Biedermann F, Bocklitz T, Boehm F, Broda F, Czodrowski P, Engel T, Hicks M, Kast S, Kettner C, Koch W, Lanza G, Link A, Mata R, Nagel W, Porzel A, Schlörer N, Schulze T, Weinig HG, Wenzel W, Wessjohann L, Wulle S. NFDI4Chem - Towards a National Research Data Infrastructure for Chemistry in Germany. RESEARCH IDEAS AND OUTCOMES 2020. [DOI: 10.3897/rio.6.e55852] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/06/2023] Open
Abstract
The vision of NFDI4Chem is the digitalisation of all key steps in chemical research to support scientists in their efforts to collect, store, process, analyse, disclose and re-use research data. Measures to promote Open Science and Research Data Management (RDM) in agreement with the FAIR data principles are fundamental aims of NFDI4Chem to serve the chemistry community with a holistic concept for access to research data. To this end, the overarching objective is the development and maintenance of a national research data infrastructure for the research domain of chemistry in Germany, and to enable innovative and easy to use services and novel scientific approaches based on re-use of research data. NFDI4Chem intends to represent all disciplines of chemistry in academia. We aim to collaborate closely with thematically related consortia. In the initial phase, NFDI4Chem focuses on data related to molecules and reactions including data for their experimental and theoretical characterisation.
This overarching goal is achieved by working towards a number of key objectives:
Key Objective 1: Establish a virtual environment of federated repositories for storing, disclosing, searching and re-using research data across distributed data sources. Connect existing data repositories and, based on a requirements analysis, establish domain-specific research data repositories for the national research community, and link them to international repositories.
Key Objective 2: Initiate international community processes to establish minimum information (MI) standards for data and machine-readable metadata as well as open data standards in key areas of chemistry. Identify and recommend open data standards in key areas of chemistry, in order to support the FAIR principles for research data. Finally, develop standards, if there is a lack.
Key Objective 3: Foster cultural and digital change towards Smart Laboratory Environments by promoting the use of digital tools in all stages of research and promote subsequent Research Data Management (RDM) at all levels of academia, beginning in undergraduate studies curricula.
Key Objective 4: Engage with the chemistry community in Germany through a wide range of measures to create awareness for and foster the adoption of FAIR data management. Initiate processes to integrate RDM and data science into curricula. Offer a wide range of training opportunities for researchers.
Key Objective 5: Explore synergies with other consortia and promote cross-cutting development within the NFDI.
Key Objective 6: Provide a legally reliable framework of policies and guidelines for FAIR and open RDM.
Collapse
|
2
|
Pedraza-González L, Marín MDC, Jorge AN, Ruck TD, Yang X, Valentini A, Olivucci M, De Vico L. Web-ARM: A Web-Based Interface for the Automatic Construction of QM/MM Models of Rhodopsins. J Chem Inf Model 2020; 60:1481-1493. [PMID: 31909998 PMCID: PMC7101466 DOI: 10.1021/acs.jcim.9b00615] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
This article introduces Web-ARM, a specialized tool, online available, designed to build quantum mechanical/molecular mechanical models of rhodopsins, a widely spread family of light-responsive proteins. Web-ARM allows the rapidly building of models of rhodopsins with a documented quality and the prediction of trends in UV-vis absorption maximum wavelengths, based on their excitation energies computed at the CASPT2//CASSCF/Amber level of theory. Web-ARM builds upon the recently reported, python-based a-ARM protocol [J. Chem. Theory Comput., 2019, 15, 3134-3152] and, as such, necessitates only a crystallographic structure or a comparative model in PDB format and a very basic knowledge of the studied rhodopsin system. The user-friendly web interface uses such input to generate congruous, gas-phase models of rhodopsins and, if requested, their mutants. We present two possible applications of Web-ARM, which showcase how the interface can be employed to assist both research and educational activities in fields at the interface between chemistry and biology. The first application shows how, through Web-ARM, research projects (e.g., rhodopsin and rhodopsin mutant screening) can be carried out in significantly less time with respect to using the required computational photochemistry tools via a command line. The second application documents the use of Web-ARM in a real-life educational/training activity, through a hands-on experience illustrating the concepts of rhodopsin color tuning.
Collapse
Affiliation(s)
- Laura Pedraza-González
- Department of Biotechnology, Chemistry and Pharmacy, Università degli Studi di Siena, Via A. Moro 2, I-53100 Siena, Italy
| | - María Del Carmen Marín
- Department of Biotechnology, Chemistry and Pharmacy, Università degli Studi di Siena, Via A. Moro 2, I-53100 Siena, Italy
| | - Alejandro N Jorge
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, United States of America
| | - Tyler D Ruck
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, United States of America
| | - Xuchun Yang
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, United States of America
| | - Alessio Valentini
- Theoretical Physical Chemistry, Research Unit MolSys, Université de Liège, Allée du 6 Août, 11, 4000 Liège, Belgium
| | - Massimo Olivucci
- Department of Biotechnology, Chemistry and Pharmacy, Università degli Studi di Siena, Via A. Moro 2, I-53100 Siena, Italy
- Department of Chemistry, Bowling Green State University, Bowling Green, Ohio 43403, United States of America
| | - Luca De Vico
- Department of Biotechnology, Chemistry and Pharmacy, Università degli Studi di Siena, Via A. Moro 2, I-53100 Siena, Italy
| |
Collapse
|
3
|
Banegas-Luna AJ, Imbernón B, Llanes Castro A, Pérez-Garrido A, Cerón-Carrasco JP, Gesing S, Merelli I, D'Agostino D, Pérez-Sánchez H. Advances in distributed computing with modern drug discovery. Expert Opin Drug Discov 2018; 14:9-22. [PMID: 30484337 DOI: 10.1080/17460441.2019.1552936] [Citation(s) in RCA: 6] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
INTRODUCTION Computational chemistry dramatically accelerates the drug discovery process and high-performance computing (HPC) can be used to speed up the most expensive calculations. Supporting a local HPC infrastructure is both costly and time-consuming, and, therefore, many research groups are moving from in-house solutions to remote-distributed computing platforms. Areas covered: The authors focus on the use of distributed technologies, solutions, and infrastructures to gain access to HPC capabilities, software tools, and datasets to run the complex simulations required in computational drug discovery (CDD). Expert opinion: The use of computational tools can decrease the time to market of new drugs. HPC has a crucial role in handling the complex algorithms and large volumes of data required to achieve specificity and avoid undesirable side-effects. Distributed computing environments have clear advantages over in-house solutions in terms of cost and sustainability. The use of infrastructures relying on virtualization reduces set-up costs. Distributed computing resources can be difficult to access, although web-based solutions are becoming increasingly available. There is a trade-off between cost-effectiveness and accessibility in using on-demand computing resources rather than free/academic resources. Graphics processing unit computing, with its outstanding parallel computing power, is becoming increasingly important.
Collapse
Affiliation(s)
- Antonio Jesús Banegas-Luna
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Baldomero Imbernón
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Antonio Llanes Castro
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Alfonso Pérez-Garrido
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - José Pedro Cerón-Carrasco
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| | - Sandra Gesing
- b Center for Research Computing , University of Notre Dame , Notre Dame , IN , USA
| | - Ivan Merelli
- c Institute for Biomedical Technologies , National Research Council of Italy , Segrate (Milan) , Italy
| | - Daniele D'Agostino
- d Institute for Applied Mathematics and Information Technologies "E. Magenes" , National Research Council of Italy , Genoa , Italy
| | - Horacio Pérez-Sánchez
- a Bioinformatics and High Performance Computing Research Group (BIO-HPC) , Universidad Católica de Murcia (UCAM) , Murcia , Spain
| |
Collapse
|
4
|
Pedretti A, Mazzolari A, Vistoli G. WarpEngine, a Flexible Platform for Distributed Computing Implemented in the VEGA Program and Specially Targeted for Virtual Screening Studies. J Chem Inf Model 2018; 58:1154-1160. [DOI: 10.1021/acs.jcim.8b00086] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Alessandro Pedretti
- Dipartimento di Scienze Farmaceutiche, Facoltà di Farmacia, Università degli Studi di Milano, Via Luigi Mangiagalli, 25, I-20133 Milano, Italy
| | - Angelica Mazzolari
- Dipartimento di Scienze Farmaceutiche, Facoltà di Farmacia, Università degli Studi di Milano, Via Luigi Mangiagalli, 25, I-20133 Milano, Italy
| | - Giulio Vistoli
- Dipartimento di Scienze Farmaceutiche, Facoltà di Farmacia, Università degli Studi di Milano, Via Luigi Mangiagalli, 25, I-20133 Milano, Italy
| |
Collapse
|
5
|
Mohr C, Friedrich A, Wojnar D, Kenar E, Polatkan AC, Codrea MC, Czemmel S, Kohlbacher O, Nahnsen S. qPortal: A platform for data-driven biomedical research. PLoS One 2018; 13:e0191603. [PMID: 29352322 PMCID: PMC5774839 DOI: 10.1371/journal.pone.0191603] [Citation(s) in RCA: 17] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/04/2017] [Accepted: 01/08/2018] [Indexed: 01/16/2023] Open
Abstract
Modern biomedical research aims at drawing biological conclusions from large, highly complex biological datasets. It has become common practice to make extensive use of high-throughput technologies that produce big amounts of heterogeneous data. In addition to the ever-improving accuracy, methods are getting faster and cheaper, resulting in a steadily increasing need for scalable data management and easily accessible means of analysis. We present qPortal, a platform providing users with an intuitive way to manage and analyze quantitative biological data. The backend leverages a variety of concepts and technologies, such as relational databases, data stores, data models and means of data transfer, as well as front-end solutions to give users access to data management and easy-to-use analysis options. Users are empowered to conduct their experiments from the experimental design to the visualization of their results through the platform. Here, we illustrate the feature-rich portal by simulating a biomedical study based on publically available data. We demonstrate the software’s strength in supporting the entire project life cycle. The software supports the project design and registration, empowers users to do all-digital project management and finally provides means to perform analysis. We compare our approach to Galaxy, one of the most widely used scientific workflow and analysis platforms in computational biology. Application of both systems to a small case study shows the differences between a data-driven approach (qPortal) and a workflow-driven approach (Galaxy). qPortal, a one-stop-shop solution for biomedical projects offers up-to-date analysis pipelines, quality control workflows, and visualization tools. Through intensive user interactions, appropriate data models have been developed. These models build the foundation of our biological data management system and provide possibilities to annotate data, query metadata for statistics and future re-analysis on high-performance computing systems via coupling of workflow management systems. Integration of project and data management as well as workflow resources in one place present clear advantages over existing solutions.
Collapse
Affiliation(s)
- Christopher Mohr
- Applied Bioinformatics, Center for Bioinformatics Tübingen, University of Tübingen, Sand 14, 72076 Tübingen, Germany
- * E-mail:
| | - Andreas Friedrich
- Applied Bioinformatics, Center for Bioinformatics Tübingen, University of Tübingen, Sand 14, 72076 Tübingen, Germany
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - David Wojnar
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Erhan Kenar
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Aydin Can Polatkan
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Marius Cosmin Codrea
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Stefan Czemmel
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| | - Oliver Kohlbacher
- Applied Bioinformatics, Center for Bioinformatics Tübingen, University of Tübingen, Sand 14, 72076 Tübingen, Germany
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
- Max Planck Institute for Developmental Biology, Max–Planck–Ring 5, 72076 Tübingen, Germany
| | - Sven Nahnsen
- Quantitative Biology Center (QBiC), University of Tübingen, Auf der Morgenstelle 10, 72076 Tübingen, Germany
| |
Collapse
|
6
|
Metz A, McKeown P, Esser B, Gohlke C, Kröckert K, Laurini L, Scheckenbach M, McCormick SN, Oswald M, Hoffmann A, Jones MD, Herres-Pawlis S. ZnII
Chlorido Complexes with Aliphatic, Chiral Bisguanidine Ligands as Catalysts in the Ring-Opening Polymerisation of rac
-Lactide Using FT-IR Spectroscopy in Bulk. Eur J Inorg Chem 2017. [DOI: 10.1002/ejic.201701147] [Citation(s) in RCA: 19] [Impact Index Per Article: 2.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/05/2023]
Affiliation(s)
- Angela Metz
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Paul McKeown
- Centre for Sustainable Chemical Technologies; Department of Chemistry; University of Bath; Claverton Down BA2 7AY Bath United Kingdom
| | - Bastian Esser
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Clara Gohlke
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Konstantin Kröckert
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Larissa Laurini
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Michael Scheckenbach
- Department of Chemistry and Pharmacy; Ludwig-Maximlians University Munich; Butenandtstraße 5-13 81377 Munich Germany
| | - Strachan N. McCormick
- Centre for Sustainable Chemical Technologies; Department of Chemistry; University of Bath; Claverton Down BA2 7AY Bath United Kingdom
| | - Michaela Oswald
- Department of Chemistry and Pharmacy; Ludwig-Maximlians University Munich; Butenandtstraße 5-13 81377 Munich Germany
| | - Alexander Hoffmann
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| | - Matthew D. Jones
- Centre for Sustainable Chemical Technologies; Department of Chemistry; University of Bath; Claverton Down BA2 7AY Bath United Kingdom
| | - Sonja Herres-Pawlis
- Institute of Inorganic Chemistry; RWTH Aachen University; Landoltweg 1 52074 Germany
| |
Collapse
|
7
|
Reactivity of Zinc Halide Complexes Containing Camphor-Derived Guanidine Ligands with Technical rac-Lactide. INORGANICS 2017. [DOI: 10.3390/inorganics5040085] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/16/2022] Open
|
8
|
Laloo JZA, Laloo N, Rhyman L, Ramasami P. ExcelAutomat: a tool for systematic processing of files as applied to quantum chemical calculations. J Comput Aided Mol Des 2017. [DOI: 10.1007/s10822-017-0031-8] [Citation(s) in RCA: 14] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/06/2023]
|
9
|
Multi-level meta-workflows: new concept for regularly occurring tasks in quantum chemistry. J Cheminform 2016; 8:58. [PMID: 27818709 PMCID: PMC5073744 DOI: 10.1186/s13321-016-0169-8] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/14/2016] [Accepted: 10/04/2016] [Indexed: 11/10/2022] Open
Abstract
Background
In Quantum Chemistry, many tasks are reoccurring frequently, e.g. geometry optimizations, benchmarking series etc. Here, workflows can help to reduce the time of manual job definition and output extraction. These workflows are executed on computing infrastructures and may require large computing and data resources. Scientific workflows hide these infrastructures and the resources needed to run them. It requires significant efforts and specific expertise to design, implement and test these workflows. Significance Many of these workflows are complex and monolithic entities that can be used for particular scientific experiments. Hence, their modification is not straightforward and it makes almost impossible to share them. To address these issues we propose developing atomic workflows and embedding them in meta-workflows. Atomic workflows deliver a well-defined research domain specific function. Publishing workflows in repositories enables workflow sharing inside and/or among scientific communities. We formally specify atomic and meta-workflows in order to define data structures to be used in repositories for uploading and sharing them. Additionally, we present a formal description focused at orchestration of atomic workflows into meta-workflows. Conclusions We investigated the operations that represent basic functionalities in Quantum Chemistry, developed the relevant atomic workflows and combined them into meta-workflows. Having these workflows we defined the structure of the Quantum Chemistry workflow library and uploaded these workflows in the SHIWA Workflow Repository.Meta-workflows and embedded workflows in the template representation ![]() Electronic supplementary material The online version of this article (doi:10.1186/s13321-016-0169-8) contains supplementary material, which is available to authorized users.
Collapse
|
10
|
Metz A, Plothe R, Glowacki B, Koszalkowski A, Scheckenbach M, Beringer A, Rösener T, Michaelis de Vasconcellos J, Haase R, Flörke U, Hoffmann A, Herres-Pawlis S. Zinc Chloride Complexes with Aliphatic and Aromatic Guanidine Hybrid Ligands and Their Activity in the Ring-Opening Polymerisation ofd,l-Lactide. Eur J Inorg Chem 2016. [DOI: 10.1002/ejic.201600870] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Angela Metz
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Ramona Plothe
- Technische Universität Dortmund; Fakultät Chemie und Chemische Biologie; Otto-Hahn-Str. 6 44227 Dortmund Germany
| | - Britta Glowacki
- Technische Universität Dortmund; Fakultät Chemie und Chemische Biologie; Otto-Hahn-Str. 6 44227 Dortmund Germany
| | - Andreas Koszalkowski
- Ludwig-Maximilians-Universität München; Department of Chemistry; Butenandtstr. 5-13 81377 München Germany
| | - Michael Scheckenbach
- Ludwig-Maximilians-Universität München; Department of Chemistry; Butenandtstr. 5-13 81377 München Germany
| | - Andreas Beringer
- Ludwig-Maximilians-Universität München; Department of Chemistry; Butenandtstr. 5-13 81377 München Germany
| | - Thomas Rösener
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | | | - Roxana Haase
- Universität Paderborn, Anorganische Chemie; Department Chemie; Warburger Str. 100 33098 Paderborn Germany
| | - Ulrich Flörke
- Universität Paderborn, Anorganische Chemie; Department Chemie; Warburger Str. 100 33098 Paderborn Germany
| | - Alexander Hoffmann
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Sonja Herres-Pawlis
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| |
Collapse
|
11
|
Hoffmann A, Wern M, Hoppe T, Witte M, Haase R, Liebhäuser P, Glatthaar J, Herres-Pawlis S, Schindler S. Hand in Hand: Experimental and Theoretical Investigations into the Reactions of Copper(I) Mono- and Bis(guanidine) Complexes with Dioxygen. Eur J Inorg Chem 2016. [DOI: 10.1002/ejic.201600906] [Citation(s) in RCA: 25] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/06/2022]
Affiliation(s)
- Alexander Hoffmann
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Miriam Wern
- Institut für Anorganische und Analytische Chemie; Justus-Liebig-Universität Gießen; Heinrich-Buff-Ring 17 35392 Gießen Germany
| | - Tobias Hoppe
- Institut für Anorganische und Analytische Chemie; Justus-Liebig-Universität Gießen; Heinrich-Buff-Ring 17 35392 Gießen Germany
| | - Matthias Witte
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Roxana Haase
- Department Chemie; Universität Paderborn; Warburger Str. 100 33098 Paderborn Germany
| | - Patricia Liebhäuser
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Jörg Glatthaar
- Institut für Organische Chemie; Heinrich-Buff-Ring 17 35392 Gießen Germany
| | - Sonja Herres-Pawlis
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Siegfried Schindler
- Institut für Anorganische und Analytische Chemie; Justus-Liebig-Universität Gießen; Heinrich-Buff-Ring 17 35392 Gießen Germany
| |
Collapse
|
12
|
Hoffmann A, Stanek J, Dicke B, Peters L, Grimm-Lebsanft B, Wetzel A, Jesser A, Bauer M, Gnida M, Meyer-Klaucke W, Rübhausen M, Herres-Pawlis S. Implications of Guanidine Substitution on Copper Complexes as Entatic-State Models. Eur J Inorg Chem 2016. [DOI: 10.1002/ejic.201600655] [Citation(s) in RCA: 34] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/09/2022]
Affiliation(s)
- Alexander Hoffmann
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Julia Stanek
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| | - Benjamin Dicke
- Universität Hamburg; Institut für Nanostruktur- und Festkörperphysik and Center for Free-Electron Laser Science; Notkestrasse 85 22607 Hamburg Germany
| | - Laurens Peters
- Department Chemie; Ludwig-Maximilians Universität München; Butenandtstraße 5-13 81377 München Germany
| | - Benjamin Grimm-Lebsanft
- Universität Hamburg; Institut für Nanostruktur- und Festkörperphysik and Center for Free-Electron Laser Science; Notkestrasse 85 22607 Hamburg Germany
| | - Alina Wetzel
- Universität Hamburg; Institut für Nanostruktur- und Festkörperphysik and Center for Free-Electron Laser Science; Notkestrasse 85 22607 Hamburg Germany
| | - Anton Jesser
- Department Chemie; Ludwig-Maximilians Universität München; Butenandtstraße 5-13 81377 München Germany
| | - Matthias Bauer
- Universität Paderborn; Department Chemie; Warburger Str. 100 33098 Paderborn Germany
| | - Manuel Gnida
- Universität Paderborn; Department Chemie; Warburger Str. 100 33098 Paderborn Germany
| | - Wolfram Meyer-Klaucke
- Universität Paderborn; Department Chemie; Warburger Str. 100 33098 Paderborn Germany
| | - Michael Rübhausen
- Universität Hamburg; Institut für Nanostruktur- und Festkörperphysik and Center for Free-Electron Laser Science; Notkestrasse 85 22607 Hamburg Germany
| | - Sonja Herres-Pawlis
- Institut für Anorganische Chemie; RWTH Aachen University; Landoltweg 1 52074 Aachen Germany
| |
Collapse
|
13
|
Pérez-Sánchez H, Rezaei V, Mezhuyev V, Man D, Peña-García J, den-Haan H, Gesing S. Developing science gateways for drug discovery in a grid environment. SPRINGERPLUS 2016; 5:1300. [PMID: 27547674 PMCID: PMC4978646 DOI: 10.1186/s40064-016-2914-x] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/21/2016] [Accepted: 07/26/2016] [Indexed: 11/16/2022]
Abstract
BACKGROUND Methods for in silico screening of large databases of molecules increasingly complement and replace experimental techniques to discover novel compounds to combat diseases. As these techniques become more complex and computationally costly we are faced with an increasing problem to provide the research community of life sciences with a convenient tool for high-throughput virtual screening on distributed computing resources. RESULTS To this end, we recently integrated the biophysics-based drug-screening program FlexScreen into a service, applicable for large-scale parallel screening and reusable in the context of scientific workflows. CONCLUSIONS Our implementation is based on Pipeline Pilot and Simple Object Access Protocol and provides an easy-to-use graphical user interface to construct complex workflows, which can be executed on distributed computing resources, thus accelerating the throughput by several orders of magnitude.
Collapse
Affiliation(s)
- Horacio Pérez-Sánchez
- Bioinformatics and High Performance Computing Research Group (BIO-HPC), Computer Engineering Department, Universidad Catolica San Antonio de Murcia (UCAM), Murcia, Spain
| | - Vahid Rezaei
- Department of Statistics, Faculty of Mathematics and Computer Sciences, Allameh Tabataba’i University, Tehran, Iran
| | - Vitaliy Mezhuyev
- Faculty of Computer Systems and Software Engineering, University Malaysia Pahang, Pekan, Malaysia
| | - Duhu Man
- Shenzhen Institutes of Advanced Technologies, Chinese Academy of Sciences, Shenzhen, P.R.China
| | - Jorge Peña-García
- Bioinformatics and High Performance Computing Research Group (BIO-HPC), Computer Engineering Department, Universidad Catolica San Antonio de Murcia (UCAM), Murcia, Spain
| | - Helena den-Haan
- Bioinformatics and High Performance Computing Research Group (BIO-HPC), Computer Engineering Department, Universidad Catolica San Antonio de Murcia (UCAM), Murcia, Spain
| | - Sandra Gesing
- Center for Research Computing, University of Notre Dame, Notre Dame, IN USA
| |
Collapse
|
14
|
Jaghoori MM, Bleijlevens B, Olabarriaga SD. 1001 Ways to run AutoDock Vina for virtual screening. J Comput Aided Mol Des 2016; 30:237-49. [PMID: 26897747 PMCID: PMC4801993 DOI: 10.1007/s10822-016-9900-9] [Citation(s) in RCA: 90] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/06/2015] [Accepted: 02/10/2016] [Indexed: 11/28/2022]
Abstract
Large-scale computing technologies have enabled high-throughput virtual screening involving thousands to millions of drug candidates. It is not trivial, however, for biochemical scientists to evaluate the technical alternatives and their implications for running such large experiments. Besides experience with the molecular docking tool itself, the scientist needs to learn how to run it on high-performance computing (HPC) infrastructures, and understand the impact of the choices made. Here, we review such considerations for a specific tool, AutoDock Vina, and use experimental data to illustrate the following points: (1) an additional level of parallelization increases virtual screening throughput on a multi-core machine; (2) capturing of the random seed is not enough (though necessary) for reproducibility on heterogeneous distributed computing systems; (3) the overall time spent on the screening of a ligand library can be improved by analysis of factors affecting execution time per ligand, including number of active torsions, heavy atoms and exhaustiveness. We also illustrate differences among four common HPC infrastructures: grid, Hadoop, small cluster and multi-core (virtual machine on the cloud). Our analysis shows that these platforms are suitable for screening experiments of different sizes. These considerations can guide scientists when choosing the best computing platform and set-up for their future large virtual screening experiments.
Collapse
Affiliation(s)
- Mohammad Mahdi Jaghoori
- />Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - Boris Bleijlevens
- />Department of Medical Biochemistry, Academic Medical Center, University of Amsterdam, Amsterdam, Netherlands
| | - Silvia D. Olabarriaga
- />Department of Clinical Epidemiology, Biostatistics and Bioinformatics, Academic Medical Center, University of Amsterdam, Amsterdam, Netherlands
| |
Collapse
|
15
|
Pamidighantam S, Nakandala S, Abeysinghe E, Wimalasena C, Yodage SR, Marru S, Pierce M. Community Science Exemplars in SEAGrid Science Gateway: Apache Airavata Based Implementation of Advanced Infrastructure. ACTA ACUST UNITED AC 2016. [DOI: 10.1016/j.procs.2016.05.535] [Citation(s) in RCA: 31] [Impact Index Per Article: 3.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/21/2022]
|
16
|
Kabeshov MA, Śliwiński É, Fitzpatrick DE, Musio B, Newby JA, Blaylock WDW, Ley SV. Development of a web-based platform for studying lithiation reactions in silico. Chem Commun (Camb) 2015; 51:7172-5. [PMID: 25811168 DOI: 10.1039/c5cc00782h] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/21/2022]
Abstract
A novel integrated web-based system has been developed to rationalise and predict lithiation reactions in silico.
Collapse
Affiliation(s)
| | - Éric Śliwiński
- Department of Chemistry
- University of Cambridge
- Cambridge CB2 1EW
- UK
| | | | - Biagia Musio
- Department of Chemistry
- University of Cambridge
- Cambridge CB2 1EW
- UK
| | - James A. Newby
- Department of Chemistry
- University of Cambridge
- Cambridge CB2 1EW
- UK
| | | | - Steven V. Ley
- Department of Chemistry
- University of Cambridge
- Cambridge CB2 1EW
- UK
| |
Collapse
|
17
|
Hoffmann A, Rohrmüller M, Jesser A, dos Santos Vieira I, Schmidt WG, Herres-Pawlis S. Geometrical and optical benchmarking of copper(II) guanidine-quinoline complexes: insights from TD-DFT and many-body perturbation theory (part II). J Comput Chem 2014; 35:2146-61. [PMID: 25255876 DOI: 10.1002/jcc.23740] [Citation(s) in RCA: 31] [Impact Index Per Article: 2.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/08/2014] [Revised: 08/15/2014] [Accepted: 08/18/2014] [Indexed: 12/15/2022]
Abstract
Ground- and excited-state properties of copper(II) charge-transfer systems have been investigated starting from density-functional calculations with particular emphasis on the role of (i) the exchange and correlation functional, (ii) the basis set, (iii) solvent effects, and (iv) the treatment of dispersive interactions. Furthermore (v), the applicability of TD-DFT to excitations of copper(II) bis(chelate) charge-transfer systems is explored by performing many-body perturbation theory (GW + BSE), independent-particle approximation and ΔSCF calculations for a small model system that contains simple guanidine and imine groups. These results show that DFT and TD-DFT in particular in combination with hybrid functionals are well suited for the description of the structural and optical properties, respectively, of copper(II) bis(chelate) complexes. Furthermore, it is found an accurate theoretical geometrical description requires the use of dispersion correction with Becke-Johnson damping and triple-zeta basis sets while solvent effects are small. The hybrid functionals B3LYP and TPSSh yielded best performance. The optical description is best with B3LYP, whereby heavily mixed molecular transitions of MLCT and LLCT character are obtained which can be more easily understood using natural transition orbitals. An natural bond orbital analysis sheds light on the donor properties of the different donor functions and the intraguanidine stabilization during coordination to copper(I) and (II).
Collapse
Affiliation(s)
- Alexander Hoffmann
- Department Chemie, Ludwig-Maximilians-Universität München, Butenandtstr. 5-13, 81377, München, Germany
| | | | | | | | | | | |
Collapse
|
18
|
Managing, analysing, and integrating big data in medical bioinformatics: open problems and future perspectives. BIOMED RESEARCH INTERNATIONAL 2014; 2014:134023. [PMID: 25254202 PMCID: PMC4165507 DOI: 10.1155/2014/134023] [Citation(s) in RCA: 60] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 06/18/2014] [Accepted: 08/13/2014] [Indexed: 11/25/2022]
Abstract
The explosion of the data both in the biomedical research and in the healthcare systems demands urgent solutions. In particular, the research in omics sciences is moving from a hypothesis-driven to a data-driven approach. Healthcare is additionally always asking for a tighter integration with biomedical data in order to promote personalized medicine and to provide better treatments. Efficient analysis and interpretation of Big Data opens new avenues to explore molecular biology, new questions to ask about physiological and pathological states, and new ways to answer these open issues. Such analyses lead to better understanding of diseases and development of better and personalized diagnostics and therapeutics. However, such progresses are directly related to the availability of new solutions to deal with this huge amount of information. New paradigms are needed to store and access data, for its annotation and integration and finally for inferring knowledge and making it available to researchers. Bioinformatics can be viewed as the “glue” for all these processes. A clear awareness of present high performance computing (HPC) solutions in bioinformatics, Big Data analysis paradigms for computational biology, and the issues that are still open in the biomedical and healthcare fields represent the starting point to win this challenge.
Collapse
|
19
|
Hoffmann A, Grunzke R, Herres-Pawlis S. Insights into the influence of dispersion correction in the theoretical treatment of guanidine-quinoline copper(I) complexes. J Comput Chem 2014; 35:1943-50. [DOI: 10.1002/jcc.23706] [Citation(s) in RCA: 48] [Impact Index Per Article: 4.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/10/2014] [Revised: 07/14/2014] [Accepted: 07/16/2014] [Indexed: 01/22/2023]
Affiliation(s)
- Alexander Hoffmann
- Department of Chemistry; Ludwig-Maximilians-Universität München, Butenandtstr. 5 - 13; 81377 München Germany
| | - Richard Grunzke
- Zentrum für Informationsdienste und Hochleistungsrechnen; Technische Universität Dresden; Zellescher Weg 12-14 01062 Dresden Germany
| | - Sonja Herres-Pawlis
- Department of Chemistry; Ludwig-Maximilians-Universität München, Butenandtstr. 5 - 13; 81377 München Germany
| |
Collapse
|
20
|
Performance studies on distributed virtual screening. BIOMED RESEARCH INTERNATIONAL 2014; 2014:624024. [PMID: 25032219 PMCID: PMC4083208 DOI: 10.1155/2014/624024] [Citation(s) in RCA: 7] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/06/2014] [Revised: 05/17/2014] [Accepted: 05/19/2014] [Indexed: 11/17/2022]
Abstract
Virtual high-throughput screening (vHTS) is an invaluable method in modern drug discovery. It permits screening large datasets or databases of chemical structures for those structures binding possibly to a drug target. Virtual screening is typically performed by docking code, which often runs sequentially. Processing of huge vHTS datasets can be parallelized by chunking the data because individual docking runs are independent of each other. The goal of this work is to find an optimal splitting maximizing the speedup while considering overhead and available cores on Distributed Computing Infrastructures (DCIs).
We have conducted thorough performance studies accounting not only for the runtime of the docking itself, but also for structure preparation. Performance studies were conducted via the workflow-enabled science gateway MoSGrid (Molecular Simulation Grid). As input we used benchmark datasets for protein kinases. Our performance studies show that docking workflows can be made to scale almost linearly up to 500 concurrent processes distributed even over large DCIs, thus accelerating vHTS campaigns significantly.
Collapse
|