1
|
Sigmund LM, Assante M, Johansson MJ, Norrby PO, Jorner K, Kabeshov M. Computational tools for the prediction of site- and regioselectivity of organic reactions. Chem Sci 2025; 16:5383-5412. [PMID: 40070469 PMCID: PMC11891785 DOI: 10.1039/d5sc00541h] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/21/2025] [Accepted: 03/03/2025] [Indexed: 03/14/2025] Open
Abstract
The regio- and site-selectivity of organic reactions is one of the most important aspects when it comes to synthesis planning. Due to that, massive research efforts were invested into computational models for regio- and site-selectivity prediction, and the introduction of machine learning to the chemical sciences within the past decade has added a whole new dimension to these endeavors. This review article walks through the currently available predictive tools for regio- and site-selectivity with a particular focus on machine learning models while being organized along the individual reaction classes of organic chemistry. Respective featurization techniques and model architectures are described and compared to each other; applications of the tools to critical real-world examples are highlighted. This paper aims to serve as an overview of the field's status quo for both the intended users of the tools, that is synthetic chemists, as well as for developers to find potential new research avenues.
Collapse
Affiliation(s)
- Lukas M Sigmund
- Molecular AI, Discovery Sciences, R&D, AstraZeneca Gothenburg Pepparedsleden 1 43183 Mölndal Sweden
| | - Michele Assante
- Innovation Centre in Digital Molecular Technologies, Department of Chemistry, University of Cambridge Lensfield Rd Cambridge CB2 1EW UK
- Compound Synthesis & Management, The Discovery Centre, AstraZeneca Cambridge Cambridge Biomedical Campus, 1 Francis Crick Avenue CB2 0AA Cambridge UK
| | - Magnus J Johansson
- Medicinal Chemistry, Research and Early Development, Cardiovascular, Renal and Metabolism (CVRM), BioPharmaceuticals, R&D, AstraZeneca Gothenburg Pepparedsleden 1 43183 Mölndal Sweden
| | - Per-Ola Norrby
- Data Science & Modelling, Pharmaceutical Sciences, R&D, AstraZeneca Gothenburg Pepparedsleden 1 43183 Mölndal Sweden
| | - Kjell Jorner
- ETH Zürich, Institute of Chemical and Bioengineering, Department of Chemistry and Applied Biosciences Vladimir-Prelog-Weg 1 CH-8093 Zürich Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, ETH Zurich Zurich Switzerland
| | - Mikhail Kabeshov
- Molecular AI, Discovery Sciences, R&D, AstraZeneca Gothenburg Pepparedsleden 1 43183 Mölndal Sweden
| |
Collapse
|
2
|
Afridi MB, Sardar H, Serdaroğlu G, Shah SWA, Alsharif KF, Khan H. SwissADME studies and Density Functional Theory (DFT) approaches of methyl substituted curcumin derivatives. Comput Biol Chem 2024; 112:108153. [PMID: 39067349 DOI: 10.1016/j.compbiolchem.2024.108153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/06/2024] [Revised: 06/27/2024] [Accepted: 07/12/2024] [Indexed: 07/30/2024]
Abstract
Research suggests curcumin's safety and efficacy, prompting interest in its use for treating and preventing various human diseases. The current study aimed to predict drag ability of methyl substituted curcumin derivatives (BL1 to BL4) using SwissADME and Density Functional Theory (DFT) approaches. The curcumin derivatives investigated mostly adhere to Lipinski's rule of five, with molecular properties including MW, F. Csp3, nHBA, nHBD, and TPSA falling within acceptable limits. The compounds demonstrating high lipophilicity while poor water solubility. The pharmacokinetic evaluation revealed favorable gastrointestinal absorption and blood-brain barrier permeation while none were identified as substrates for P-glycoprotein, however, revealed inhibitory actions against various cytochrome P450 enzymes. Additionally, all derivatives exhibited a consistent bioavailability score of 0.55. Similarly, the DFT computations of the compounds of the curcumin derivatives were conducted at B3LYP/6-311 G** level to predict and then assess the key electronic characteristics underlying the bioactivity. Accordingly, the BL4 molecule (ΔEgap= 4.105 eV) would prefer to interact with the external molecular system more than the other molecules due to having the biggest energy gap. The ΔNmax (2.328 eV) and Δεback-donat. (-0.446 eV) scores implied that BL1 would have more charge transfer capability and the lowest stability via back donation among the compounds. In short, the derivative (BL1 to BL4) exhibited strong extrinsic therapeutic properties and therefore stand eligible for further in vitro and in vivo studies.
Collapse
Affiliation(s)
| | - Haseeba Sardar
- Department of Pharmacy, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan.
| | | | | | - Khalaf F Alsharif
- Department of Clinical Laboratory Sciences, College of Applied Medical Sciences, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia.
| | - Haroon Khan
- Department of Pharmacy, Abdul Wali Khan University Mardan, Mardan 23200, Pakistan.
| |
Collapse
|
3
|
Paderni D, Voccia M, Macedi E, Formica M, Giorgi L, Caporaso L, Fusi V. A combined solid state, solution and DFT study of a dimethyl-cyclen-Pd(II) complex. Dalton Trans 2024; 53:14300-14314. [PMID: 39133309 DOI: 10.1039/d4dt01791a] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/13/2024]
Abstract
A new palladium(II) complex containing the previously synthesized 4,10-bis[(3-hydroxy-4-pyron-2-yl)methyl]-1,7-dimethyl-1,4,7,10-tetraazacyclododecane ligand maltonis was prepared and characterized both in solution and in the solid state. Hirshfeld surface and energy framework analyses were also performed. Because maltonis already showed antineoplastic activity, the complexation of Pd(II), chosen as an alternative to Pt(II), was investigated to study its possible biological activity. UV-vis and NMR studies confirmed the formation and stability of the complex in aqueous solution at physiological pH. X-ray diffraction data revealed a structure where the Pd(II) ion is lodged in the dimethyl-cyclen cavity, with maltol rings facing each other (closed shape) even if they are not involved in the coordination. DFT analysis was performed in order to understand the most stable shape of the complex. In view of evaluating its possible bioactive conformation, the DFT study suggested a slight energetic preference for the closed one. The resulting closed complex was stabilized in the X-ray structure by intermolecular interactions that replace the intramolecular interactions present in the optimized complex. According to the DFT calculated formation energies, notwithstanding its rarity, the Pd(II) complex of maltonis is the thermodynamically preferred one among analogous complexes containing different metal ions (Pt(II), Co(II), and Cu(II)). Finally, to study its possible biological activity, the interaction between the Pd(II) complex of maltonis and nucleosides was evaluated through NMR and DFT calculations, revealing a possible interaction with purines via the maltol moieties.
Collapse
Affiliation(s)
- Daniele Paderni
- Department of Pure and Applied Sciences, University of Urbino, via Ca' le Suore 2-4, 61029 Urbino, Italy.
| | - Maria Voccia
- Department of Chemistry and Biology, University of Salerno, via Giovanni Paolo II 132, 84084 Fisciano, Salerno, Italy
| | - Eleonora Macedi
- Department of Pure and Applied Sciences, University of Urbino, via Ca' le Suore 2-4, 61029 Urbino, Italy.
| | - Mauro Formica
- Department of Pure and Applied Sciences, University of Urbino, via Ca' le Suore 2-4, 61029 Urbino, Italy.
| | - Luca Giorgi
- Department of Pure and Applied Sciences, University of Urbino, via Ca' le Suore 2-4, 61029 Urbino, Italy.
| | - Lucia Caporaso
- Department of Chemistry and Biology, University of Salerno, via Giovanni Paolo II 132, 84084 Fisciano, Salerno, Italy
| | - Vieri Fusi
- Department of Pure and Applied Sciences, University of Urbino, via Ca' le Suore 2-4, 61029 Urbino, Italy.
| |
Collapse
|
4
|
Huynh H, Le K, Vu L, Nguyen T, Holcomb M, Forli S, Phan H. Synergy of machine learning and density functional theory calculations for predicting experimental Lewis base affinity and Lewis polybase binding atoms. J Comput Chem 2024; 45:1552-1561. [PMID: 38500409 PMCID: PMC11099847 DOI: 10.1002/jcc.27329] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/31/2023] [Revised: 01/26/2024] [Accepted: 01/31/2024] [Indexed: 03/20/2024]
Abstract
Investigation of Lewis acid-base interactions has been conducted by ab initio calculations and machine learning (ML) models. This study aims to resolve two critical tasks that have not been quantitatively investigated. First, ML models developed from density functional theory (DFT) calculations predict experimental BF3 affinity with Pearson correlation coefficients around 0.9 and mean absolute errors around 10 kJ mol-1. The ML models are trained by DFT-calculated BF3 affinity of more than 3000 adducts, with input features readily obtained by rdkit. Second, the ML models have the capability of predicting the relative strength of Lewis base binding atoms in Lewis polybases, which is either an extremely challenging task to conduct experimentally or a computationally expensive task for ab initio methods. The study demonstrates and solidifies the potential of combining DFT calculations and ML models to predict experimental properties, especially those that are scarce and impractical to empirically acquire.
Collapse
Affiliation(s)
- Hieu Huynh
- Fulbright University Vietnam, Ho Chi Minh city, Vietnam, Ho Chi Minh City 700000
| | - Khanh Le
- Fulbright University Vietnam, Ho Chi Minh city, Vietnam, Ho Chi Minh City 700000
| | - Linh Vu
- Fulbright University Vietnam, Ho Chi Minh city, Vietnam, Ho Chi Minh City 700000
| | - Trang Nguyen
- Fulbright University Vietnam, Ho Chi Minh city, Vietnam, Ho Chi Minh City 700000
| | - Matthew Holcomb
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037 USA
| | - Stefano Forli
- Department of Integrative Structural and Computational Biology, The Scripps Research Institute, La Jolla, CA 92037 USA
| | - Hung Phan
- Fulbright University Vietnam, Ho Chi Minh city, Vietnam, Ho Chi Minh City 700000
- Soka University of America, Aliso Viejo, California, United States, CA 92656
| |
Collapse
|
5
|
Luan Y, Li X, Kong D, Li W, Li W, Zhang Q, Pang A. Development and uniqueness test of highly selective atomic topological indices based on the number of attached hydrogen atoms. J Mol Graph Model 2024; 129:108752. [PMID: 38479237 DOI: 10.1016/j.jmgm.2024.108752] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/09/2023] [Accepted: 02/27/2024] [Indexed: 04/15/2024]
Abstract
On the basis of the atomic graph-theoretical index - aEAID (atomic Extended Adjacency matrix IDentification) and molecular adjacent topological index - ATID (Adjacent Topological IDentification) suggested by one of the authors (Zhang Q), a highly selective atomic topological index - aATID (atomic Adjacent Topological IDentification) index was suggested to identify the equivalent atoms in this study. The aATID index of an atom was derived from the number of the attached hydrogen atoms of the atom but omitting bond types. In this case, the suggested index can be used to identify equivalent atoms in chemistry but perhaps not equivalent in the molecular graph. To test the uniqueness of aATID indices, the virtual atomic data sets were derived from alkanes containing 15-20 carbon atoms and the isomers of Octogen, as well as a real data set was derived from the NCI database. Only four pairs of atoms from alkanes containing 20 carbons can't be discriminated by aATID, that is, four pairs of degenerates were found for this data set. To solve this problem, the aATID index was modified by introducing distance factors between atoms, and the 2-aATID index was suggested. Its uniqueness was examined by 5,939,902 atoms derived from alkanes containing 20 carbons and further 16,166,984 atoms from alkanes of 21 carbons, and no degenerates were found. In addition, another large real data set of 16,650,688 atoms derived from the PubChem database was also used to test the uniqueness of both aATID and 2-aATID. As a result, each atom was successfully discriminated by any of the two indices. Finally, the suggested aATID index was applied to the identification of duplicate atoms as data pretreatment for QSPR (Quantitative Structure-Property Relationships) studies.
Collapse
Affiliation(s)
- Yue Luan
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Xianlan Li
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Dingling Kong
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Wanli Li
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China
| | - Wei Li
- Science and Technology on Aerospace Chemical Power Laboratory, Hubei Institute of Aerospace Chemotechnology, Xiangyang, 441003, Hubei, China
| | - Qingyou Zhang
- Henan Engineering Research Center of Industrial Circulating Water Treatment, Henan Joint International Research Laboratory of Environmental Pollution Control Materials, Henan University, Kaifeng, 475004, China.
| | - Aimin Pang
- Science and Technology on Aerospace Chemical Power Laboratory, Hubei Institute of Aerospace Chemotechnology, Xiangyang, 441003, Hubei, China.
| |
Collapse
|
6
|
Gou Q, Liu J, Su H, Guo Y, Chen J, Zhao X, Pu X. Exploring an accurate machine learning model to quickly estimate stability of diverse energetic materials. iScience 2024; 27:109452. [PMID: 38523799 PMCID: PMC10960145 DOI: 10.1016/j.isci.2024.109452] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/18/2023] [Revised: 01/27/2024] [Accepted: 03/06/2024] [Indexed: 03/26/2024] Open
Abstract
High energy and low sensitivity have been the focus of developing new energetic materials (EMs). However, there has been a lack of a quick and accurate method for evaluating the stability of diverse EMs. Here, we develop a machine learning prediction model with high accuracy for bond dissociation energy (BDE) of EMs. A reliable and representative BDE dataset of EMs is constructed by collecting 778 experimental energetic compounds and quantum mechanics calculation. To sufficiently characterize the BDE of EMs, a hybrid feature representation is proposed by coupling the local target bond into the global structure characteristics. To alleviate the limitation of the low dataset, pairwise difference regression is utilized as a data augmentation with the advantage of reducing systematic errors and improving diversity. Benefiting from these improvements, the XGBoost model achieves the best prediction accuracy with R2 of 0.98 and MAE of 8.8 kJ mol-1, significantly outperforming competitive models.
Collapse
Affiliation(s)
- Qiaolin Gou
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Jing Liu
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Haoming Su
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Yanzhi Guo
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Jiayi Chen
- College of Chemistry, Sichuan University, Chengdu 610064, China
| | - Xueyan Zhao
- Institute of Chemical Materials, China Academy of Engineering Physics, Mianyang 621900, China
| | - Xuemei Pu
- College of Chemistry, Sichuan University, Chengdu 610064, China
| |
Collapse
|
7
|
Fliszkiewicz B, Sajdak M. Toward Quantum-Informed Atom Pairs. ACS OMEGA 2024; 9:5966-5971. [PMID: 38343973 PMCID: PMC10851243 DOI: 10.1021/acsomega.3c09744] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/06/2023] [Revised: 01/12/2024] [Accepted: 01/18/2024] [Indexed: 10/28/2024]
Abstract
In the following research, a new modification of traditional atom pairs is studied. The atom pairs are enriched with values originating from quantum chemistry calculations. A random forest machine learning algorithm is applied to model 10 different properties and biological activities based on different molecular representations, and it is evaluated via repeated cross-validation. The predictive power of modified atom pairs, quantum atom pairs, are compared to the predictive powers of traditional molecular representations known and widely applied in cheminformatics. The root mean squared error (RMSE), R2, area under the receiver operating characteristic curve (AUC) and balanced accuracy were used to evaluate the predictive power of the applied molecular representations. Research has shown that while performing regression tasks, quantum atom pairs provide better fits to the data than do their precursors.
Collapse
Affiliation(s)
- Bartłomiej Fliszkiewicz
- Faculty
of New Technologies and Chemistry, Military
University of Technology, Warsaw 00-908, Poland
| | - Marcin Sajdak
- Faculty
of Energy and Environmental Engineering Silesian University of Technology, Gliwice 44-100, Poland
- School
of Chemical Engineering, University of Birmingham, Edgbaston, Birmingham B15 2TT, United
Kingdom
| |
Collapse
|
8
|
García-Andrade X, García Tahoces P, Pérez-Ríos J, Martínez Núñez E. Barrier Height Prediction by Machine Learning Correction of Semiempirical Calculations. J Phys Chem A 2023; 127:2274-2283. [PMID: 36877614 PMCID: PMC10845151 DOI: 10.1021/acs.jpca.2c08340] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 02/19/2023] [Indexed: 03/07/2023]
Abstract
Different machine learning (ML) models are proposed in the present work to predict density functional theory-quality barrier heights (BHs) from semiempirical quantum mechanical (SQM) calculations. The ML models include a multitask deep neural network, gradient-boosted trees by means of the XGBoost interface, and Gaussian process regression. The obtained mean absolute errors are similar to those of previous models considering the same number of data points. The ML corrections proposed in this paper could be useful for rapid screening of the large reaction networks that appear in combustion chemistry or in astrochemistry. Finally, our results show that 70% of the features with the highest impact on model output are bespoke predictors. This custom-made set of predictors could be employed by future Δ-ML models to improve the quantitative prediction of other reaction properties.
Collapse
Affiliation(s)
| | - Pablo García Tahoces
- Department
of Electronics and Computer Science, University
of Santiago de Compostela, Santiago de Compostela 15782, Spain
| | - Jesús Pérez-Ríos
- Department
of Physics, Stony Brook University, Stony Brook, New York 11794, United States
- Institute
for Advanced Computational Science, Stony
Brook University, Stony
Brook, New York 11794-3800, United States
| | - Emilio Martínez Núñez
- Department
of Physical Chemistry, University of Santiago
de Compostela, Santiago
de Compostela 15782, Spain
| |
Collapse
|
9
|
Cameron AR, Proud AJ, Pearson JK. Machine Learned Composite Methods for Electronic Structure Theory. J Chem Theory Comput 2023; 19:51-60. [PMID: 36507875 DOI: 10.1021/acs.jctc.2c00564] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/15/2022]
Abstract
Because of the prohibitive scaling of ab initio techniques for modeling chemical species with high accuracy, they are not generally tractable for large systems. It is therefore of considerable interest to develop high-accuracy computational models with low computational cost that can afford predictions of electronic structure and properties of macromolecular species. Composite methods, as first introduced by Pople [Pople, J. A.; Head-Gordon, M.; Fox, D. J.; Raghavachari, K.; Curtiss, L. A. J. Chem. Phys.1989, 90, 5622.], are an intuitive solution to this problem as they seek to systematically increase accuracy in model chemistries by taking advantage of favorable error cancellation among reasonably low-cost models. By linearly combining a series of carefully chosen model chemistries, the result of a prohibitive-scaling correlated model chemistry with a large basis set may be approximated with relatively good fidelity. However, the full extent to which the choice of low-cost models dictates the predictive accuracy of composite methods is not known, and a full exploration of all model chemistries would be advantageous for the design and validation of a generalizable composite method for widespread application. Here, we show that remarkable accuracy can be generally achieved with composite methods that are more judiciously constructed, leading to increased accuracy with significantly reduced computational cost. By designing a systematic procedure for the automated generation and assessment of over 10 billion unique composite methods, we have extensively explored the space of modern model chemistries to elucidate important design principles in the construction of reliable composite procedures. We anticipate our work to be the starting point in the pursuit of creative approaches to modeling large chemical systems with high accuracy by using novel combinatorial modeling.
Collapse
Affiliation(s)
- Andrew R Cameron
- Institute for Quantum Computing, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada.,Department of Physics & Astronomy, University of Waterloo, Waterloo, OntarioN2L 3G1, Canada
| | - Adam J Proud
- Department of Chemistry, University of Prince Edward Island, 550 University Avenue, Charlottetown, Prince Edward IslandC1A 4P3, Canada
| | - Jason K Pearson
- Department of Chemistry, University of Prince Edward Island, 550 University Avenue, Charlottetown, Prince Edward IslandC1A 4P3, Canada
| |
Collapse
|
10
|
Li W, Luan Y, Zhang Q, Aires‐de‐Sousa J. Machine Learning to Predict Homolytic Dissociation Energies of C-H Bonds: Calibration of DFT-based Models with Experimental Data. Mol Inform 2023; 42:e2200193. [PMID: 36167940 PMCID: PMC10078411 DOI: 10.1002/minf.202200193] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/01/2022] [Accepted: 09/27/2022] [Indexed: 01/12/2023]
Abstract
Random Forest (RF) QSPR models were developed with a data set of homolytic bond dissociation energies (BDE) previously calculated by B3LYP/6-311++G(d,p)//DFTB for 2263 sp3C-H covalent bonds. The best set of attributes consisted in 114 descriptors of the carbon atom (counts of atom types in 5 spheres around the kernel atom and ring descriptors). The optimized model predicted the DFT-calculated BDE of an independent test set of 224 bonds with MAE=2.86 kcal/mol. A new data set of 409 bonds from the iBonD database (http://ibond.nankai.edu.cn) was predicted by the RF with a modest MAE (5.36 kcal/mol) but a relatively high R2 (0.75) against experimental energies. A prediction scheme was explored that corrects the RF prediction with the average deviation observed for the k nearest neighbours (KNN) in an additional memory of experimental data. The corrected predictions achieved MAE=2.22 kcal/mol for an independent test set of 145 bonds and the corresponding experimental bond energies.
Collapse
Affiliation(s)
- Wanli Li
- Henan Engineering Research Center of Industrial Circulating Water TreatmentHenan Joint International Research Laboratory of Environmental Pollution Control MaterialsHenan UniversityKaifeng475004P.R. China
| | - Yue Luan
- Henan Engineering Research Center of Industrial Circulating Water TreatmentHenan Joint International Research Laboratory of Environmental Pollution Control MaterialsHenan UniversityKaifeng475004P.R. China
| | - Qingyou Zhang
- Henan Engineering Research Center of Industrial Circulating Water TreatmentHenan Joint International Research Laboratory of Environmental Pollution Control MaterialsHenan UniversityKaifeng475004P.R. China
| | - Joao Aires‐de‐Sousa
- LAQV and REQUIMTEChemistry DepartmentNOVA School of Science and TechnologyUniversidade Nova de Lisboa2829-516CaparicaPortugal
| |
Collapse
|
11
|
Elsenety MM, Mohamed MBI, Sultan ME, Elsayed BA. Facile and highly precise pH-value estimation using common pH paper based on machine learning techniques and supported mobile devices. Sci Rep 2022; 12:22584. [PMID: 36585481 PMCID: PMC9803664 DOI: 10.1038/s41598-022-27054-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/10/2022] [Accepted: 12/23/2022] [Indexed: 12/31/2022] Open
Abstract
Numerous scientific, health care, and industrial applications are showing increasing interest in developing optical pH sensors with low-cost, high precision that cover a wide pH range. Although serious efforts, the development of high accuracy and cost-effectiveness, remains challenging. In this perspective, we present the implementation of the machine learning technique on the common pH paper for precise pH-value estimation. Further, we develop a simple, flexible, and free precise mobile application based on a machine learning algorithm to predict the accurate pH value of a solution using an available commercial pH paper. The common light conditions were studied under different light intensities of 350, 200, and 20 Lux. The models were trained using 2689 experimental values without a special instrument control. The pH range of 1: 14 is covered by an interval of ~ 0.1 pH value. The results show a significant relationship between pH values and both the red color and green color, in contrast to the poor correlation by the blue color. The K Neighbors Regressor model improves linearity and shows a significant coefficient of determination of 0.995 combined with the lowest errors. The free, publicly accessible online and mobile application was developed and enables the highly precise estimation of the pH value as a function of the RGB color code of typical pH paper. Our findings could replace higher expensive pH instruments using handheld pH detection, and an intelligent smartphone system for everyone, even the chef in the kitchen, without the need for additional costly and time-consuming experimental work.
Collapse
Affiliation(s)
- Mohamed M. Elsenety
- grid.411303.40000 0001 2155 6022Department of Chemistry, Faculty of Science, Al-Azhar University, Nasr City, Cairo 11884 Egypt
| | - Mahmoud Basseem I. Mohamed
- grid.411303.40000 0001 2155 6022Department of Chemistry, Faculty of Science, Al-Azhar University, Nasr City, Cairo 11884 Egypt
| | - Mohamed E. Sultan
- grid.411303.40000 0001 2155 6022Department of Chemistry, Faculty of Science, Al-Azhar University, Nasr City, Cairo 11884 Egypt
| | - Badr A. Elsayed
- grid.411303.40000 0001 2155 6022Department of Chemistry, Faculty of Science, Al-Azhar University, Nasr City, Cairo 11884 Egypt
| |
Collapse
|
12
|
Atwi R, Bliss M, Makeev M, Rajput NN. MISPR: an open-source package for high-throughput multiscale molecular simulations. Sci Rep 2022; 12:15760. [PMID: 36130978 PMCID: PMC9492707 DOI: 10.1038/s41598-022-20009-w] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2022] [Accepted: 09/07/2022] [Indexed: 11/09/2022] Open
Abstract
Computational tools provide a unique opportunity to study and design optimal materials by enhancing our ability to comprehend the connections between their atomistic structure and functional properties. However, designing materials with tailored functionalities is complicated due to the necessity to integrate various computational-chemistry software (not necessarily compatible with one another), the heterogeneous nature of the generated data, and the need to explore vast chemical and parameter spaces. The latter is especially important to avoid bias in scattered data points-based models and derive statistical trends only accessible by systematic datasets. Here, we introduce a robust high-throughput multi-scale computational infrastructure coined MISPR (Materials Informatics for Structure-Property Relationships) that seamlessly integrates classical molecular dynamics (MD) simulations with density functional theory (DFT). By enabling high-performance data analytics and coupling between different methods and scales, MISPR addresses critical challenges arising from the needs of automated workflow management and data provenance recording. The major features of MISPR include automated DFT and MD simulations, error handling, derivation of molecular and ensemble properties, and creation of output databases that organize results from individual calculations to enable reproducibility and transparency. In this work, we describe fully automated DFT workflows implemented in MISPR to compute various properties such as nuclear magnetic resonance chemical shift, binding energy, bond dissociation energy, and redox potential with support for multiple methods such as electron transfer and proton-coupled electron transfer reactions. The infrastructure also enables the characterization of large-scale ensemble properties by providing MD workflows that calculate a wide range of structural and dynamical properties in liquid solutions. MISPR employs the methodologies of materials informatics to facilitate understanding and prediction of phenomenological structure-property relationships, which are crucial to designing novel optimal materials for numerous scientific applications and engineering technologies.
Collapse
Affiliation(s)
- Rasha Atwi
- Department of Materials Science and Chemical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Matthew Bliss
- Department of Materials Science and Chemical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Maxim Makeev
- Department of Materials Science and Chemical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA
| | - Nav Nidhi Rajput
- Department of Materials Science and Chemical Engineering, Stony Brook University, Stony Brook, NY, 11794, USA.
| |
Collapse
|
13
|
Boni YT, Cammarota RC, Liao K, Sigman MS, Davies HML. Leveraging Regio- and Stereoselective C(sp 3)-H Functionalization of Silyl Ethers to Train a Logistic Regression Classification Model for Predicting Site-Selectivity Bias. J Am Chem Soc 2022; 144:15549-15561. [PMID: 35977100 DOI: 10.1021/jacs.2c04383] [Citation(s) in RCA: 10] [Impact Index Per Article: 3.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The C-H functionalization of silyl ethers via carbene-induced C-H insertion represents an efficient synthetic disconnection strategy. In this work, site- and stereoselective C(sp3)-H functionalization at α, γ, δ, and even more distal positions to the siloxy group has been achieved using donor/acceptor carbene intermediates. By exploiting the predilections of Rh2(R-TCPTAD)4 and Rh2(S-2-Cl-5-BrTPCP)4 catalysts to target either more electronically activated or more spatially accessible C-H sites, respectively, divergent desired products can be formed with good diastereocontrol and enantiocontrol. Notably, the reaction can also be extended to enable desymmetrization of meso silyl ethers. Leveraging the broad substrate scope examined in this study, we have trained a machine learning classification model using logistic regression to predict the major C-H functionalization site based on intrinsic substrate reactivity and catalyst propensity for overriding it. This model enables prediction of the major product when applying these C-H functionalization methods to a new substrate of interest. Applying this model broadly, we have demonstrated its utility for guiding late-stage functionalization in complex settings and developed an intuitive visualization tool to assist synthetic chemists in such endeavors.
Collapse
Affiliation(s)
- Yannick T Boni
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, Georgia 30322, United States
| | - Ryan C Cammarota
- Department of Chemistry, University of Utah, 315 South 1400 East, Salt Lake City, Utah 84112, United States
| | - Kuangbiao Liao
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, Georgia 30322, United States
| | - Matthew S Sigman
- Department of Chemistry, University of Utah, 315 South 1400 East, Salt Lake City, Utah 84112, United States
| | - Huw M L Davies
- Department of Chemistry, Emory University, 1515 Dickey Drive, Atlanta, Georgia 30322, United States
| |
Collapse
|
14
|
Aziz M, Ejaz SA, Zargar S, Akhtar N, Aborode AT, A. Wani T, Batiha GES, Siddique F, Alqarni M, Akintola AA. Deep Learning and Structure-Based Virtual Screening for Drug Discovery against NEK7: A Novel Target for the Treatment of Cancer. Molecules 2022; 27:4098. [PMID: 35807344 PMCID: PMC9268522 DOI: 10.3390/molecules27134098] [Citation(s) in RCA: 28] [Impact Index Per Article: 9.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/26/2022] [Revised: 06/17/2022] [Accepted: 06/18/2022] [Indexed: 01/09/2023] Open
Abstract
NIMA-related kinase7 (NEK7) plays a multifunctional role in cell division and NLRP3 inflammasone activation. A typical expression or any mutation in the genetic makeup of NEK7 leads to the development of cancer malignancies and fatal inflammatory disease, i.e., breast cancer, non-small cell lung cancer, gout, rheumatoid arthritis, and liver cirrhosis. Therefore, NEK7 is a promising target for drug development against various cancer malignancies. The combination of drug repurposing and structure-based virtual screening of large libraries of compounds has dramatically improved the development of anticancer drugs. The current study focused on the virtual screening of 1200 benzene sulphonamide derivatives retrieved from the PubChem database by selecting and docking validation of the crystal structure of NEK7 protein (PDB ID: 2WQN). The compounds library was subjected to virtual screening using Auto Dock Vina. The binding energies of screened compounds were compared to standard Dabrafenib. In particular, compound 762 exhibited excellent binding energy of -42.67 kJ/mol, better than Dabrafenib (-33.89 kJ/mol). Selected drug candidates showed a reactive profile that was comparable to standard Dabrafenib. To characterize the stability of protein-ligand complexes, molecular dynamic simulations were performed, providing insight into the molecular interactions. The NEK7-Dabrafenib complex showed stability throughout the simulated trajectory. In addition, binding affinities, pIC50, and ADMET profiles of drug candidates were predicted using deep learning models. Deep learning models predicted the binding affinity of compound 762 best among all derivatives, which supports the findings of virtual screening. These findings suggest that top hits can serve as potential inhibitors of NEK7. Moreover, it is recommended to explore the inhibitory potential of identified hits compounds through in-vitro and in-vivo approaches.
Collapse
Affiliation(s)
- Mubashir Aziz
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan;
| | - Syeda Abida Ejaz
- Department of Pharmaceutical Chemistry, Faculty of Pharmacy, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan;
| | - Seema Zargar
- Department of Biochemistry, College of Science, King Saud University, P.O. Box 22452, Riyadh 11451, Saudi Arabia;
| | - Naveed Akhtar
- Department of Pharmaceutics, Faculty of Pharmacy, The Islamia University of Bahawalpur, Bahawalpur 63100, Pakistan;
| | | | - Tanveer A. Wani
- Department of Pharmaceutical Chemistry, College of Pharmacy, King Saud University, P.O. Box 2457, Riyadh 11451, Saudi Arabia
| | - Gaber El-Saber Batiha
- Department of Pharmacology and Therapeutics, Faculty of Veterinary Medicine, Damanhour University, Damanhour 22511, AlBeheira, Egypt;
| | - Farhan Siddique
- Laboratory of Organic Electronics, Department of Science and Technology, Linköping University, SE-60174 Norrköping, Sweden;
- Department of Pharmacy, Royal Institute of Medical Sciences (RIMS), Multan 60000, Pakistan
| | - Mohammed Alqarni
- Department of Pharmaceutical Chemistry, College of Pharmacy, Taif University, P.O. Box 11099, Taif 21944, Saudi Arabia;
| | - Ashraf Akintayo Akintola
- Department of Biomedical Convergence Science and Technology, Kyungpook National University, Daegu 41566, Korea;
| |
Collapse
|
15
|
Mamede R, Pereira F, Aires-de-Sousa J. Machine learning prediction of UV-Vis spectra features of organic compounds related to photoreactive potential. Sci Rep 2021; 11:23720. [PMID: 34887473 PMCID: PMC8660842 DOI: 10.1038/s41598-021-03070-9] [Citation(s) in RCA: 12] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/28/2021] [Accepted: 11/22/2021] [Indexed: 11/09/2022] Open
Abstract
Machine learning (ML) algorithms were explored for the classification of the UV-Vis absorption spectrum of organic molecules based on molecular descriptors and fingerprints generated from 2D chemical structures. Training and test data (~ 75 k molecules and associated UV-Vis data) were assembled from a database with lists of experimental absorption maxima. They were labeled with positive class (related to photoreactive potential) if an absorption maximum is reported in the range between 290 and 700 nm (UV/Vis) with molar extinction coefficient (MEC) above 1000 Lmol-1 cm-1, and as negative if no such a peak is in the list. Random forests were selected among several algorithms. The models were validated with two external test sets comprising 998 organic molecules, obtaining a global accuracy up to 0.89, sensitivity of 0.90 and specificity of 0.88. The ML output (UV-Vis spectrum class) was explored as a predictor of the 3T3 NRU phototoxicity in vitro assay for a set of 43 molecules. Comparable results were observed with the classification directly based on experimental UV-Vis data in the same format.
Collapse
Affiliation(s)
- Rafael Mamede
- LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal
| | - Florbela Pereira
- LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal
| | - João Aires-de-Sousa
- LAQV-REQUIMTE, Department of Chemistry, NOVA School of Science and Technology, Universidade Nova de Lisboa, 2829-516, Caparica, Portugal.
| |
Collapse
|
16
|
Prasad VK, Khalilian MH, Otero-de-la-Roza A, DiLabio GA. BSE49, a diverse, high-quality benchmark dataset of separation energies of chemical bonds. Sci Data 2021; 8:300. [PMID: 34815431 PMCID: PMC8611007 DOI: 10.1038/s41597-021-01088-2] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/06/2021] [Accepted: 11/01/2021] [Indexed: 01/23/2023] Open
Abstract
We present an extensive and diverse dataset of bond separation energies associated with the homolytic cleavage of covalently bonded molecules (A-B) into their corresponding radical fragments (A. and B.). Our dataset contains two different classifications of model structures referred to as "Existing" (molecules with associated experimental data) and "Hypothetical" (molecules with no associated experimental data). In total, the dataset consists of 4502 datapoints (1969 datapoints from the Existing and 2533 datapoints from the Hypothetical classes). The dataset covers 49 unique X-Y type single bonds (except H-H, H-F, and H-Cl), where X and Y are H, B, C, N, O, F, Si, P, S, and Cl atoms. All the reference data was calculated at the (RO)CBS-QB3 level of theory. The reference bond separation energies are non-relativistic ground-state energy differences and contain no zero-point energy corrections. This new dataset of bond separation energies (BSE49) is presented as a high-quality reference dataset for assessing and developing computational chemistry methods.
Collapse
Affiliation(s)
- Viki Kumar Prasad
- Department of Chemistry, University of British Columbia, Kelowna, British Columbia, V1V 1V7, Canada
| | - M Hossein Khalilian
- Department of Chemistry, University of British Columbia, Kelowna, British Columbia, V1V 1V7, Canada
| | - Alberto Otero-de-la-Roza
- Departamento de Química Física y Analítica, Facultad de Química, Universidad de Oviedo, MALTA Consolider Team, E-33006, Oviedo, Spain
| | - Gino A DiLabio
- Department of Chemistry, University of British Columbia, Kelowna, British Columbia, V1V 1V7, Canada.
| |
Collapse
|
17
|
Gallegos LC, Luchini G, St. John PC, Kim S, Paton RS. Importance of Engineered and Learned Molecular Representations in Predicting Organic Reactivity, Selectivity, and Chemical Properties. Acc Chem Res 2021; 54:827-836. [PMID: 33534534 DOI: 10.1021/acs.accounts.0c00745] [Citation(s) in RCA: 46] [Impact Index Per Article: 11.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/03/2023]
Abstract
Machine-readable chemical structure representations are foundational in all attempts to harness machine learning for the prediction of reactivities, selectivities, and chemical properties directly from molecular structure. The featurization of discrete chemical structures into a continuous vector space is a critical phase undertaken before model selection, and the development of new ways to quantitatively encode molecules is an active area of research. In this Account, we highlight the application and suitability of different representations, from expert-guided "engineered" descriptors to automatically "learned" features, in different prediction tasks relevant to organic and organometallic chemistry, where differing amounts of training data are available. These tasks include statistical models of stereo- and enantioselectivity, thermochemistry, and kinetics developed using experimental and quantum chemical data.The use of expert-guided molecular descriptors provides an opportunity to incorporate chemical knowledge, domain expertise, and physical constraints into statistical modeling. In applications to stereoselective organic and organometallic catalysis, where data sets may be relatively small and 3D-geometries and conformations play an important role, mechanistically informed features can be used successfully to obtain predictive statistical models that are also chemically interpretable. We provide an overview of several recent applications of this approach to obtain quantitative models for reactivity and selectivity, where topological descriptors, quantum mechanical calculations of electronic and steric properties, along with conformational ensembles, all feature as essential ingredients of the molecular representations used.Alternatively, more flexible, general-purpose molecular representations such as attributed molecular graphs can be used with machine learning approaches to learn the complex relationship between a structure and prediction target. This approach has the potential to out-perform more traditional representation methods such as "hand-crafted" molecular descriptors, particularly as data set sizes grow. One area where this is particularly relevant is in the use of large sets of quantum mechanical data to train quantitative structure-property relationships. A general approach toward curating useful data sets and training highly accurate graph neural network models is discussed in the context of organic bond dissociation enthalpies, where this strategy outperforms regression using precomputed descriptors.Finally, we describe how graph neural network predictions can be incorporated into mechanistically informed statistical models of chemical reactivity and selectivity. Once trained, this approach avoids the expensive computational overhead associated with quantum mechanical calculations, while maintaining chemical interpretability. We illustrate examples for which fast predictions of bond dissociation enthalpy and of the identities of radicals formed through cleavage of a molecule's weakest bond are used in simple physical models of site-selectivity and reactivity.
Collapse
Affiliation(s)
- Liliana C. Gallegos
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Guilian Luchini
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| | - Peter C. St. John
- Biosciences Center, National Renewable Energy Laboratory, 15103 Denver West Parkway, Golden, Colorado 80401, United States
| | - Seonah Kim
- Biosciences Center, National Renewable Energy Laboratory, 15103 Denver West Parkway, Golden, Colorado 80401, United States
| | - Robert S. Paton
- Department of Chemistry, Colorado State University, Fort Collins, Colorado 80523, United States
| |
Collapse
|
18
|
Wen M, Blau SM, Spotte-Smith EWC, Dwaraknath S, Persson KA. BonDNet: a graph neural network for the prediction of bond dissociation energies for charged molecules. Chem Sci 2020; 12:1858-1868. [PMID: 34163950 PMCID: PMC8179073 DOI: 10.1039/d0sc05251e] [Citation(s) in RCA: 36] [Impact Index Per Article: 7.2] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/20/2020] [Accepted: 12/03/2020] [Indexed: 12/13/2022] Open
Abstract
A broad collection of technologies, including e.g. drug metabolism, biofuel combustion, photochemical decontamination of water, and interfacial passivation in energy production/storage systems rely on chemical processes that involve bond-breaking molecular reactions. In this context, a fundamental thermodynamic property of interest is the bond dissociation energy (BDE) which measures the strength of a chemical bond. Fast and accurate prediction of BDEs for arbitrary molecules would lay the groundwork for data-driven projections of complex reaction cascades and hence a deeper understanding of these critical chemical processes and, ultimately, how to reverse design them. In this paper, we propose a chemically inspired graph neural network machine learning model, BonDNet, for the rapid and accurate prediction of BDEs. BonDNet maps the difference between the molecular representations of the reactants and products to the reaction BDE. Because of the use of this difference representation and the introduction of global features, including molecular charge, it is the first machine learning model capable of predicting both homolytic and heterolytic BDEs for molecules of any charge. To test the model, we have constructed a dataset of both homolytic and heterolytic BDEs for neutral and charged (-1 and +1) molecules. BonDNet achieves a mean absolute error (MAE) of 0.022 eV for unseen test data, significantly below chemical accuracy (0.043 eV). Besides the ability to handle complex bond dissociation reactions that no previous model could consider, BonDNet distinguishes itself even in only predicting homolytic BDEs for neutral molecules; it achieves an MAE of 0.020 eV on the PubChem BDE dataset, a 20% improvement over the previous best performing model. We gain additional insight into the model's predictions by analyzing the patterns in the features representing the molecules and the bond dissociation reactions, which are qualitatively consistent with chemical rules and intuition. BonDNet is just one application of our general approach to representing and learning chemical reactivity, and it could be easily extended to the prediction of other reaction properties in the future.
Collapse
Affiliation(s)
- Mingjian Wen
- Department of Materials Science and Engineering, University of California Berkeley CA 94720 USA
- Energy Technologies Area, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Samuel M Blau
- Energy Technologies Area, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Evan Walter Clark Spotte-Smith
- Department of Materials Science and Engineering, University of California Berkeley CA 94720 USA
- Energy Technologies Area, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Shyam Dwaraknath
- Energy Technologies Area, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Kristin A Persson
- Department of Materials Science and Engineering, University of California Berkeley CA 94720 USA
- Molecular Foundry, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| |
Collapse
|
19
|
Prediction of organic homolytic bond dissociation enthalpies at near chemical accuracy with sub-second computational cost. Nat Commun 2020; 11:2328. [PMID: 32393773 PMCID: PMC7214445 DOI: 10.1038/s41467-020-16201-z] [Citation(s) in RCA: 114] [Impact Index Per Article: 22.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/24/2019] [Accepted: 04/15/2020] [Indexed: 12/31/2022] Open
Abstract
Bond dissociation enthalpies (BDEs) of organic molecules play a fundamental role in determining chemical reactivity and selectivity. However, BDE computations at sufficiently high levels of quantum mechanical theory require substantial computing resources. In this paper, we develop a machine learning model capable of accurately predicting BDEs for organic molecules in a fraction of a second. We perform automated density functional theory (DFT) calculations at the M06-2X/def2-TZVP level of theory for 42,577 small organic molecules, resulting in 290,664 BDEs. A graph neural network trained on a subset of these results achieves a mean absolute error of 0.58 kcal mol-1 (vs DFT) for BDEs of unseen molecules. We further demonstrate the model on two applications: first, we rapidly and accurately predict major sites of hydrogen abstraction in the metabolism of drug-like molecules, and second, we determine the dominant molecular fragmentation pathways during soot formation.
Collapse
|
20
|
Theoretical O–CH3 bond dissociation enthalpies of selected aromatic and non-aromatic molecules. Theor Chem Acc 2020. [DOI: 10.1007/s00214-020-02592-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
21
|
Besora M, Olmos A, Gava R, Noverges B, Asensio G, Caballero A, Maseras F, Pérez PJ. A Quantitative Model for Alkane Nucleophilicity Based on C−H Bond Structural/Topological Descriptors. Angew Chem Int Ed Engl 2020. [DOI: 10.1002/ange.201914386] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
Affiliation(s)
- Maria Besora
- Institute of Chemical Research of Catalonia (ICIQ) The Barcelona Institute of Science and Technology Avgda. Països Catalans 16, 43007 Tarragona Spain
- Departament de Química Física i Inorgànica Universitat Rovira i Virgili 43007 Tarragona Spain
| | - Andrea Olmos
- Departamento de Química Orgánica, Facultad de Farmacia Universitat de València Burjassot 46100 València Spain
| | - Riccardo Gava
- Laboratorio de Catálisis Homogénea Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química Universidad de Huelva 21007 Huelva Spain
| | - Bárbara Noverges
- Departamento de Química Orgánica, Facultad de Farmacia Universitat de València Burjassot 46100 València Spain
| | - Gregorio Asensio
- Departamento de Química Orgánica, Facultad de Farmacia Universitat de València Burjassot 46100 València Spain
| | - Ana Caballero
- Laboratorio de Catálisis Homogénea Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química Universidad de Huelva 21007 Huelva Spain
| | - Feliu Maseras
- Institute of Chemical Research of Catalonia (ICIQ) The Barcelona Institute of Science and Technology Avgda. Països Catalans 16, 43007 Tarragona Spain
- Departament de Química Física i Inorgànica Universitat Rovira i Virgili 43007 Tarragona Spain
| | - Pedro J. Pérez
- Laboratorio de Catálisis Homogénea Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química Universidad de Huelva 21007 Huelva Spain
| |
Collapse
|
22
|
Besora M, Olmos A, Gava R, Noverges B, Asensio G, Caballero A, Maseras F, Pérez PJ. A Quantitative Model for Alkane Nucleophilicity Based on C-H Bond Structural/Topological Descriptors. Angew Chem Int Ed Engl 2020; 59:3112-3116. [PMID: 31826300 DOI: 10.1002/anie.201914386] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/11/2019] [Indexed: 11/11/2022]
Abstract
A first quantitative model for calculating the nucleophilicity of alkanes is described. A statistical treatment was applied to the analysis of the reactivity of 29 different alkane C-H bonds towards in situ generated metal carbene electrophiles. The correlation of the recently reported experimental reactivity with two different sets of descriptors comprising a total of 86 parameters was studied, resulting in the quantitative descriptor-based alkane nucleophilicity (QDEAN) model. This model consists of an equation with only six structural/topological descriptors, and reproduces the relative reactivity of the alkane C-H bonds. This reactivity can be calculated from parameters emerging from the schematic drawing of the alkane and a simple set of sums.
Collapse
Affiliation(s)
- Maria Besora
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007, Tarragona, Spain.,Departament de Química Física i Inorgànica, Universitat Rovira i Virgili, 43007, Tarragona, Spain
| | - Andrea Olmos
- Departamento de Química Orgánica, Facultad de Farmacia, Universitat de València, Burjassot, 46100, València, Spain
| | - Riccardo Gava
- Laboratorio de Catálisis Homogénea, Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química, Universidad de Huelva, 21007, Huelva, Spain
| | - Bárbara Noverges
- Departamento de Química Orgánica, Facultad de Farmacia, Universitat de València, Burjassot, 46100, València, Spain
| | - Gregorio Asensio
- Departamento de Química Orgánica, Facultad de Farmacia, Universitat de València, Burjassot, 46100, València, Spain
| | - Ana Caballero
- Laboratorio de Catálisis Homogénea, Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química, Universidad de Huelva, 21007, Huelva, Spain
| | - Feliu Maseras
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007, Tarragona, Spain.,Departament de Química Física i Inorgànica, Universitat Rovira i Virgili, 43007, Tarragona, Spain
| | - Pedro J Pérez
- Laboratorio de Catálisis Homogénea, Unidad Asociada al CSIC, CIQSO-Centro de Investigación en Química Sostenible and Departamento de Química, Universidad de Huelva, 21007, Huelva, Spain
| |
Collapse
|
23
|
Guda AA, Guda SA, Lomachenko KA, Soldatov MA, Pankin IA, Soldatov AV, Braglia L, Bugaev AL, Martini A, Signorile M, Groppo E, Piovano A, Borfecchia E, Lamberti C. Quantitative structural determination of active sites from in situ and operando XANES spectra: From standard ab initio simulations to chemometric and machine learning approaches. Catal Today 2019. [DOI: 10.1016/j.cattod.2018.10.071] [Citation(s) in RCA: 35] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/19/2023]
|
24
|
Bauer CA, Schneider G, Göller AH. Machine learning models for hydrogen bond donor and acceptor strengths using large and diverse training data generated by first-principles interaction free energies. J Cheminform 2019; 11:59. [PMID: 33430967 PMCID: PMC6737620 DOI: 10.1186/s13321-019-0381-4] [Citation(s) in RCA: 14] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/04/2019] [Accepted: 08/10/2019] [Indexed: 02/06/2023] Open
Abstract
We present machine learning (ML) models for hydrogen bond acceptor (HBA) and hydrogen bond donor (HBD) strengths. Quantum chemical (QC) free energies in solution for 1:1 hydrogen-bonded complex formation to the reference molecules 4-fluorophenol and acetone serve as our target values. Our acceptor and donor databases are the largest on record with 4426 and 1036 data points, respectively. After scanning over radial atomic descriptors and ML methods, our final trained HBA and HBD ML models achieve RMSEs of 3.8 kJ mol-1 (acceptors), and 2.3 kJ mol-1 (donors) on experimental test sets, respectively. This performance is comparable with previous models that are trained on experimental hydrogen bonding free energies, indicating that molecular QC data can serve as substitute for experiment. The potential ramifications thereof could lead to a full replacement of wetlab chemistry for HBA/HBD strength determination by QC. As a possible chemical application of our ML models, we highlight our predicted HBA and HBD strengths as possible descriptors in two case studies on trends in intramolecular hydrogen bonding.
Collapse
Affiliation(s)
- Christoph A Bauer
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), 8093, Zurich, Switzerland
| | - Gisbert Schneider
- Department of Chemistry and Applied Biosciences, Swiss Federal Institute of Technology (ETH), 8093, Zurich, Switzerland.
| | | |
Collapse
|
25
|
Nakai H, Seino J, Nakamura K. Bond Energy Density Analysis Combined with Informatics Technique. J Phys Chem A 2019; 123:7777-7784. [DOI: 10.1021/acs.jpca.9b04030] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Affiliation(s)
- Hiromi Nakai
- Department of Chemistry and Biochemistry, School of Advanced Science and Engineering, Waseda University, Tokyo 169-8555, Japan
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan
- ESICB, Kyoto University, Kyotodaigaku-Katsura, Nishigyoku, Kyoto 615-8520, Japan
| | - Junji Seino
- Waseda Research Institute for Science and Engineering, Waseda University, Tokyo 169-8555, Japan
- PRESTO, Japan Science and Technology Agency, 4-1-8 Honcho, Kawaguchi, Saitama 332-0012, Japan
| | - Kairi Nakamura
- Department of Chemistry and Biochemistry, School of Advanced Science and Engineering, Waseda University, Tokyo 169-8555, Japan
| |
Collapse
|
26
|
Nikolaienko TY. The maximum occupancy condition for the localized property-optimized orbitals. Phys Chem Chem Phys 2019; 21:5285-5294. [PMID: 30778429 DOI: 10.1039/c8cp07276k] [Citation(s) in RCA: 5] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023]
Abstract
It is shown analytically that the Chemist's Localized Property-optimized Orbitals (CLPOs), which are the localized orbitals obtainable from the results of ab initio calculations by the open-source program JANPA (http://janpa.sourceforge.net/) according to the recently proposed optimal property partitioning condition, form the Lewis structure with nearly maximum possible total electron occupancy. The conditions required for this additional optimality to hold are discussed. In particular, when a single-determinant wavefunction is used to describe the molecular system without a noticeable electron delocalization, CLPOs derived from this wavefunction approximately optimize the same target quantity as the Natural Bond Orbitals (NBOs), establishing in this way the link between the two sets of localized orbitals. The performance of CLPO and NBO methods is compared by using a dataset containing 7101 small molecules, and the relevant methodological features of both methods are discussed.
Collapse
Affiliation(s)
- Tymofii Yu Nikolaienko
- Faculty of Physics of Taras Shevchenko National University of Kyiv, 64/13 Volodymyrska Str., Kyiv 01601, Ukraine.
| |
Collapse
|
27
|
Lakuntza O, Besora M, Maseras F. Searching for Hidden Descriptors in the Metal–Ligand Bond through Statistical Analysis of Density Functional Theory (DFT) Results. Inorg Chem 2018; 57:14660-14670. [DOI: 10.1021/acs.inorgchem.8b02372] [Citation(s) in RCA: 16] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
Affiliation(s)
- Oier Lakuntza
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007 Tarragona, Catalonia, Spain
| | - Maria Besora
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007 Tarragona, Catalonia, Spain
| | - Feliu Maseras
- Institute of Chemical Research of Catalonia (ICIQ), The Barcelona Institute of Science and Technology, Avgda. Països Catalans, 16, 43007 Tarragona, Catalonia, Spain
- Department de Química, Universitat Autònoma de Barcelona, 08193 Bellaterra, Catalonia, Spain
| |
Collapse
|
28
|
Li H, Collins C, Tanha M, Gordon GJ, Yaron DJ. A Density Functional Tight Binding Layer for Deep Learning of Chemical Hamiltonians. J Chem Theory Comput 2018; 14:5764-5776. [PMID: 30351008 DOI: 10.1021/acs.jctc.8b00873] [Citation(s) in RCA: 61] [Impact Index Per Article: 8.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Current neural networks for predictions of molecular properties use quantum chemistry only as a source of training data. This paper explores models that use quantum chemistry as an integral part of the prediction process. This is done by implementing self-consistent-charge Density-Functional-Tight-Binding (DFTB) theory as a layer for use in deep learning models. The DFTB layer takes, as input, Hamiltonian matrix elements generated from earlier layers and produces, as output, electronic properties from self-consistent field solutions of the corresponding DFTB Hamiltonian. Backpropagation enables efficient training of the model to target electronic properties. Two types of input to the DFTB layer are explored, splines and feed-forward neural networks. Because overfitting can cause models trained on smaller molecules to perform poorly on larger molecules, regularizations are applied that penalize nonmonotonic behavior and deviation of the Hamiltonian matrix elements from those of the published DFTB model used to initialize the model. The approach is evaluated on 15 700 hydrocarbons by comparing the root-mean-square error in energy and dipole moment, on test molecules with eight heavy atoms, to the error from the initial DFTB model. When trained on molecules with up to seven heavy atoms, the spline model reduces the test error in energy by 60% and in dipole moments by 42%. The neural network model performs somewhat better, with error reductions of 67% and 59%, respectively. Training on molecules with up to four heavy atoms reduces performance, with both the spline and neural net models reducing the test error in energy by about 53% and in dipole by about 25%.
Collapse
|
29
|
Collins CR, Gordon GJ, von Lilienfeld OA, Yaron DJ. Constant size descriptors for accurate machine learning models of molecular properties. J Chem Phys 2018; 148:241718. [DOI: 10.1063/1.5020441] [Citation(s) in RCA: 73] [Impact Index Per Article: 10.4] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/07/2023] Open
Affiliation(s)
- Christopher R. Collins
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - Geoffrey J. Gordon
- Machine Learning Department, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| | - O. Anatole von Lilienfeld
- Department of Chemistry, Institute of Physical Chemistry and National Center for Computational Design and Discovery of Novel Materials (MARVEL), University of Basel, 4056 Basel, Switzerland
| | - David J. Yaron
- Department of Chemistry, Carnegie Mellon University, Pittsburgh, Pennsylvania 15213, USA
| |
Collapse
|
30
|
Carpenter BK, Ezra GS, Farantos SC, Kramer ZC, Wiggins S. Empirical Classification of Trajectory Data: An Opportunity for the Use of Machine Learning in Molecular Dynamics. J Phys Chem B 2017; 122:3230-3241. [PMID: 28968092 DOI: 10.1021/acs.jpcb.7b08707] [Citation(s) in RCA: 13] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Abstract
Classical Hamiltonian trajectories are initiated at random points in phase space on a fixed energy shell of a model two degrees of freedom potential, consisting of two interacting minima in an otherwise flat energy plane of infinite extent. Below the energy of the plane, the dynamics are demonstrably chaotic. However, most of the work in this paper involves trajectories at a fixed energy that is 1% above that of the plane, in which regime the dynamics exhibit behavior characteristic of chaotic scattering. The trajectories are analyzed without reference to the potential, as if they had been generated in a typical direct molecular dynamics simulation. The questions addressed are whether one can recover useful information about the structures controlling the dynamics in phase space from the trajectory data alone, and whether, despite the at least partially chaotic nature of the dynamics, one can make statistically meaningful predictions of trajectory outcomes from initial conditions. It is found that key unstable periodic orbits, which can be identified on the analytical potential, appear by simple classification of the trajectories, and that the specific roles of these periodic orbits in controlling the dynamics are also readily discerned from the trajectory data alone. Two different approaches to predicting trajectory outcomes from initial conditions are evaluated, and it is shown that the more successful of them has ∼90% success. The results are compared with those from a simple neural network, which has higher predictive success (97%) but requires the information obtained from the "by-hand" analysis to achieve that level. Finally, the dynamics, which occur partly on the very flat region of the potential, show characteristics of the much-studied phenomenon called "roaming." On this potential, it is found that roaming trajectories are effectively "failed" periodic orbits and that angular momentum can be identified as a key controlling factor, despite the fact that it is not a strictly conserved quantity. It is also noteworthy that roaming on this potential occurs in the absence of a "roaming saddle," which has previously been hypothesized to be a necessary feature for roaming to occur.
Collapse
Affiliation(s)
- Barry K Carpenter
- School of Chemistry , Cardiff University , Cardiff CF10 3AT , United Kingdom
| | - Gregory S Ezra
- Department of Chemistry and Chemical Biology , Cornell University , Ithaca , New York 14853-1301 , United States
| | - Stavros C Farantos
- Institute of Electronic Structure and Laser, Foundation for Research and Technology - Hellas, and Department of Chemistry , University of Crete , Iraklion 711 10 , Greece
| | - Zeb C Kramer
- Department of Chemistry and Biochemistry , La Salle University , 1900 West Olney Avenue , Philadelphia , Pennsylvania 19141 , United States
| | - Stephen Wiggins
- School of Mathematics , University of Bristol , Bristol BS8 1TW , United Kingdom
| |
Collapse
|
31
|
Yao K, Herr JE, Brown SN, Parkhill J. Intrinsic Bond Energies from a Bonds-in-Molecules Neural Network. J Phys Chem Lett 2017; 8:2689-2694. [PMID: 28573865 DOI: 10.1021/acs.jpclett.7b01072] [Citation(s) in RCA: 76] [Impact Index Per Article: 9.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 05/14/2023]
Abstract
Neural networks are being used to make new types of empirical chemical models as inexpensive as force fields, but with accuracy similar to the ab initio methods used to build them. In this work, we present a neural network that predicts the energies of molecules as a sum of intrinsic bond energies. The network learns the total energies of the popular GDB9 database to a competitive MAE of 0.94 kcal/mol on molecules outside of its training set, is naturally linearly scaling, and applicable to molecules consisting of thousands of bonds. More importantly, it gives chemical insight into the relative strengths of bonds as a function of their molecular environment, despite only being trained on total energy information. We show that the network makes predictions of relative bond strengths in good agreement with measured trends and human predictions. A Bonds-in-Molecules Neural Network (BIM-NN) learns heuristic relative bond strengths like expert synthetic chemists, and compares well with ab initio bond order measures such as NBO analysis.
Collapse
Affiliation(s)
- Kun Yao
- Department of Chemistry and Biochemistry, The University of Notre Dame du Lac , Notre Dame, Indiana 46556, United States
| | - John E Herr
- Department of Chemistry and Biochemistry, The University of Notre Dame du Lac , Notre Dame, Indiana 46556, United States
| | - Seth N Brown
- Department of Chemistry and Biochemistry, The University of Notre Dame du Lac , Notre Dame, Indiana 46556, United States
| | - John Parkhill
- Department of Chemistry and Biochemistry, The University of Notre Dame du Lac , Notre Dame, Indiana 46556, United States
| |
Collapse
|
32
|
Pereira F, Xiao K, Latino DARS, Wu C, Zhang Q, Aires-de-Sousa J. Machine Learning Methods to Predict Density Functional Theory B3LYP Energies of HOMO and LUMO Orbitals. J Chem Inf Model 2016; 57:11-21. [PMID: 28033004 DOI: 10.1021/acs.jcim.6b00340] [Citation(s) in RCA: 87] [Impact Index Per Article: 9.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Machine learning algorithms were explored for the fast estimation of HOMO and LUMO orbital energies calculated by DFT B3LYP, on the basis of molecular descriptors exclusively based on connectivity. The whole project involved the retrieval and generation of molecular structures, quantum chemical calculations for a database with >111 000 structures, development of new molecular descriptors, and training/validation of machine learning models. Several machine learning algorithms were screened, and an applicability domain was defined based on Euclidean distances to the training set. Random forest models predicted an external test set of 9989 compounds achieving mean absolute error (MAE) up to 0.15 and 0.16 eV for the HOMO and LUMO orbitals, respectively. The impact of the quantum chemical calculation protocol was assessed with a subset of compounds. Inclusion of the orbital energy calculated by PM7 as an additional descriptor significantly improved the quality of estimations (reducing the MAE in >30%).
Collapse
Affiliation(s)
- Florbela Pereira
- LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa , 2829-516 Caparica, Portugal
| | - Kaixia Xiao
- Henan Engineering Research Center of Industrial Circulating Water Treatment, College of Chemistry and Chemical Engineering, Henan University , Kaifeng, 475004, PR China
| | - Diogo A R S Latino
- LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa , 2829-516 Caparica, Portugal
| | - Chengcheng Wu
- Henan Engineering Research Center of Industrial Circulating Water Treatment, College of Chemistry and Chemical Engineering, Henan University , Kaifeng, 475004, PR China
| | - Qingyou Zhang
- Henan Engineering Research Center of Industrial Circulating Water Treatment, College of Chemistry and Chemical Engineering, Henan University , Kaifeng, 475004, PR China
| | - Joao Aires-de-Sousa
- LAQV and REQUIMTE, Departamento de Química, Faculdade de Ciências e Tecnologia, Universidade Nova de Lisboa , 2829-516 Caparica, Portugal
| |
Collapse
|
33
|
Almas QL, Keefe BL, Profitt T, Pearson JK. Choosing an appropriate model chemistry in a big data context: Application to dative bonding. COMPUT THEOR CHEM 2016. [DOI: 10.1016/j.comptc.2016.04.005] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022]
|
34
|
Chemoinformatics: Achievements and Challenges, a Personal View. Molecules 2016; 21:151. [PMID: 26828468 PMCID: PMC6273366 DOI: 10.3390/molecules21020151] [Citation(s) in RCA: 47] [Impact Index Per Article: 5.2] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/20/2015] [Revised: 01/14/2016] [Accepted: 01/20/2016] [Indexed: 11/16/2022] Open
Abstract
Chemoinformatics provides computer methods for learning from chemical data and for modeling tasks a chemist is facing. The field has evolved in the past 50 years and has substantially shaped how chemical research is performed by providing access to chemical information on a scale unattainable by traditional methods. Many physical, chemical and biological data have been predicted from structural data. For the early phases of drug design, methods have been developed that are used in all major pharmaceutical companies. However, all domains of chemistry can benefit from chemoinformatics methods; many areas that are not yet well developed, but could substantially gain from the use of chemoinformatics methods. The quality of data is of crucial importance for successful results. Computer-assisted structure elucidation and computer-assisted synthesis design have been attempted in the early years of chemoinformatics. Because of the importance of these fields to the chemist, new approaches should be made with better hardware and software techniques. Society's concern about the impact of chemicals on human health and the environment could be met by the development of methods for toxicity prediction and risk assessment. In conjunction with bioinformatics, our understanding of the events in living organisms could be deepened and, thus, novel strategies for curing diseases developed. With so many challenging tasks awaiting solutions, the future is bright for chemoinformatics.
Collapse
|