1
|
Deng Q, Cao CT, Cao C. General Equation for Expressing the Physicochemical Properties of Aliphatic Alcohols. ACS OMEGA 2025; 10:1571-1580. [PMID: 39829593 PMCID: PMC11739981 DOI: 10.1021/acsomega.4c09457] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/16/2024] [Revised: 12/17/2024] [Accepted: 12/20/2024] [Indexed: 01/22/2025]
Abstract
In this work, two general equations were proposed to express the nonlinear and linear changes in physicochemical properties of aliphatic alcohols, involving boiling point, refractive index, critical temperature, critical volume, and so on. The two general equations all are expressed with the same six molecular descriptors. The results show that the linear and nonlinear change properties of aliphatic alcohols have good correlations with the same six molecular descriptors. Using the obtained correlation equations, various properties of aliphatic alcohols without experimental values were predicted, involving 15 normal boiling points, 96 refractive indexes, 105 critical temperatures, 109 critical pressures, 100 liquid densities, 136 heat capacities, 107 critical volumes, and 130 enthalpies of the formation of liquid, a total of 798 values. The relationship between the nonlinear and linear change properties can be deduced by using the obtained general equations, which connect different properties of aliphatic alcohols. In addition, this paper combined the general estimation models used in the properties of aliphatic alcohols and aliphatic primary amines, and then, by taking the boiling point and critical temperature as examples, a general estimation model with good correlation and high prediction accuracy was obtained via adding another two molecular structure characteristic parameters. It is a meaningful exploration for establishing general models for monosubstituted alkane RX compounds with different substituents in the future.
Collapse
Affiliation(s)
- Qing Deng
- Key Laboratory of Theoretical Organic
Chemistry and Function Molecule, Ministry of Education, School of
Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| | - Chao-Tun Cao
- Key Laboratory of Theoretical Organic
Chemistry and Function Molecule, Ministry of Education, School of
Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| | - Chenzhong Cao
- Key Laboratory of Theoretical Organic
Chemistry and Function Molecule, Ministry of Education, School of
Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| |
Collapse
|
2
|
Panwar P, Yang Q, Martini A. Temperature-Dependent Density and Viscosity Prediction for Hydrocarbons: Machine Learning and Molecular Dynamics Simulations. J Chem Inf Model 2024; 64:2760-2774. [PMID: 37582234 DOI: 10.1021/acs.jcim.3c00231] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Machine learning-based predictive models allow rapid and reliable prediction of material properties and facilitate innovative materials design. Base oils used in the formulation of lubricant products are complex hydrocarbons of varying sizes and structure. This study developed Gaussian process regression-based models to accurately predict the temperature-dependent density and dynamic viscosity of 305 complex hydrocarbons. In our approach, strongly correlated/collinear predictors were trimmed, important predictors were selected by least absolute shrinkage and selection operator (LASSO) regularization and prior domain knowledge, hyperparameters were systematically optimized by Bayesian optimization, and the models were interpreted. The approach provided versatile and quantitative structure-property relationship (QSPR) models with relatively simple predictors for determining the dynamic viscosity and density of complex hydrocarbons at any temperature. In addition, we developed molecular dynamics simulation-based descriptors and evaluated the feasibility and versatility of dynamic descriptors from simulations for predicting the material properties. It was found that the models developed using a comparably smaller pool of dynamic descriptors performed similarly in predicting density and viscosity to models based on many more static descriptors. The best models were shown to predict density and dynamic viscosity with coefficient of determination (R2) values of 99.6% and 97.7%, respectively, for all data sets, including a test data set of 45 molecules. Finally, partial dependency plots (PDPs), individual conditional expectation (ICE) plots, local interpretable model-agnostic explanation (LIME) values, and trimmed model R2 values were used to identify the most important static and dynamic predictors of the density and viscosity.
Collapse
Affiliation(s)
- Pawan Panwar
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| | - Quanpeng Yang
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| | - Ashlie Martini
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| |
Collapse
|
3
|
Creton B, Barraud E, Nieto-Draghi C. Prediction of critical micelle concentration for per- and polyfluoroalkyl substances. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024; 35:309-324. [PMID: 38591134 DOI: 10.1080/1062936x.2024.2337011] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/11/2024] [Accepted: 03/26/2024] [Indexed: 04/10/2024]
Abstract
In this study, we focus on the development of Quantitative Structure-Property Relationship (QSPR) models to predict the critical micelle concentration (CMC) for per- and polyfluoroalkyl substances (PFASs). Experimental CMC values for both fluorinated and non-fluorinated compounds were meticulously compiled from existing literature sources. Our approach involved constructing two distinct types of models based on Support Vector Machine (SVM) algorithms applied to the dataset. Type (I) models were trained exclusively on CMC values for fluorinated compounds, while Type (II) models were developed utilizing the entire dataset, incorporating both fluorinated and non-fluorinated compounds. Comparative analyses were conducted against reference data, as well as between the two model types. Encouragingly, both types of models exhibited robust predictive capabilities and demonstrated high reliability. Subsequently, the model having the broadest applicability domain was selected to complement the existing experimental data, thereby enhancing our understanding of PFAS behaviour.
Collapse
Affiliation(s)
- B Creton
- Thermodynamics and Molecular Simulation, IFP Energies nouvelles, Rueil-Malmaison, France
| | - E Barraud
- Thermodynamics and Molecular Simulation, IFP Energies nouvelles, Rueil-Malmaison, France
| | - C Nieto-Draghi
- Thermodynamics and Molecular Simulation, IFP Energies nouvelles, Rueil-Malmaison, France
| |
Collapse
|
4
|
Chen J, Zhu L, Wang J. Quantitative structure-property relationship modelling on autoignition temperature: evaluation and comparative analysis. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2024; 35:199-218. [PMID: 38372083 DOI: 10.1080/1062936x.2024.2312527] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/06/2023] [Accepted: 01/25/2024] [Indexed: 02/20/2024]
Abstract
The autoignition temperature (AIT) serves as a crucial indicator for assessing the potential hazards associated with a chemical substance. In order to gain deeper insights into model performance and facilitate the establishment of effective methodological practices for AIT predictions, this study conducts a benchmark investigation on Quantitative Structure-Property Relationship (QSPR) modelling for AIT. As novelties of this work, three significant advancements are implemented in the AIT modelling process, including explicit consideration of data quality, utilization of state-of-the-art feature engineering workflows, and the innovative application of graph-based deep learning techniques, which are employed for the first time in AIT prediction. Specifically, three traditional QSPR models (multi-linear regression, support vector regression, and artificial neural networks) are evaluated, alongside the assessment of a deep-learning model employing message passing neural network architecture supplemented by graph-data augmentation techniques.
Collapse
Affiliation(s)
- J Chen
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| | - L Zhu
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| | - J Wang
- College of Chemical Engineering, Zhejiang University of Technology, Hangzhou, China
| |
Collapse
|
5
|
Cao CT, Chen S, Cao C. General Equation to Estimate the Physicochemical Properties of Aliphatic Amines. ACS OMEGA 2023; 8:49088-49097. [PMID: 38162734 PMCID: PMC10753552 DOI: 10.1021/acsomega.3c06992] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 09/13/2023] [Revised: 11/27/2023] [Accepted: 11/29/2023] [Indexed: 01/03/2024]
Abstract
Changes in various physicochemical properties (P(n)) of aliphatic amines (including primary, secondary, and tertiary amines) can be roughly divided into nonlinear (P(n)) and linear (PLC(n)) changes. In our previous paper, nonlinear and linear change properties of noncyclic alkanes all were correlated with four parameters, n, SCNE, ΔAOEI, and ΔAIMPI, indicating number of carbon atoms, sum of carbon number effects, average odd-even index difference, and average inner molecular polarizability index difference, respectively. To date, there has been no general equation to express changes in the properties of substituted alkanes. This work, based on the molecular structure characteristics of aliphatic amine molecules, proposes a general equation to express nonlinear changes in their physicochemical properties, named as the "NPAA equation" (eq 12), ln(P(n)) = a + b(n) + c(SCNE) + d(ΔAOEI) + e(PEI) + f(APEI) + g(GN), and proposes a general equation to express linear changes in the physicochemical properties of them, named as the "LPAA equation" (eq 13), PLC(n) = a + b(n) + c(SCNE) + d(ΔAOEI) + e(PEI) + f(APEI) + g(GN). In NPAA and LPAA equations, a, b, c, d, e, f, and g are coefficients, and PEI, APEI, and GN represent the polarizability effect index, average polarizability effect index, and N atomic influence factor, respectively. The results show that nonlinear and linear change properties of aliphatic amines all can be correlated with six parameters, n, SCNE, ΔAOEI, PEI, APEI, and GN. NPAA and LPAA equations have the advantages of uniform expression, high estimation accuracy, and usage of fewer parameters. Further, by employing the above six parameters, a quantitative correlation equation can be established between any two properties of aliphatic amines. Using the obtained equations as model equations, the property data of aliphatic amines were predicted, involving 107 normal boiling points, 10 refractive indexes, 11 liquid densities, 54 critical temperatures, 54 critical pressures, 62 liquid thermal conductivities, 59 surface tensions, 56 heat capacities, 55 critical volumes, 54 gas enthalpies of formation, and 57 gas Gibbs energies of formation, a total of 579 values, which have not been experimentally determined yet. This work not only provides a simple and convenient method for estimating or predicting the properties of aliphatic amines but can also provide new perspectives for quantitative structure-property relationships of substituted alkanes.
Collapse
Affiliation(s)
- Chao-Tun Cao
- Key Laboratory of Theoretical
Organic Chemistry and Function Molecule, Ministry of Education, School
of Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| | - Shurui Chen
- Key Laboratory of Theoretical
Organic Chemistry and Function Molecule, Ministry of Education, School
of Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| | - Chenzhong Cao
- Key Laboratory of Theoretical
Organic Chemistry and Function Molecule, Ministry of Education, School
of Chemistry and Chemical Engineering, Hunan
University of Science and Technology, Xiangtan 411201, China
| |
Collapse
|
6
|
Fayet G, Rotureau P. QSPR models to predict the physical hazards of mixtures: a state of art. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:745-764. [PMID: 37706255 DOI: 10.1080/1062936x.2023.2253150] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/22/2023] [Accepted: 08/24/2023] [Indexed: 09/15/2023]
Abstract
Physical hazards of chemical mixtures, associated for example with their fire or explosion risks, are generally characterized using experimental tools. These tests can be expensive, complex, long to perform and even dangerous for operators. Therefore, for several years and especially with the implementation of the REACH regulation, predictive methods like quantitative structure-property relationships have been encouraged as alternatives tests to determine (eco)toxicological but also physical hazards of chemical substances. Initially, these approaches were intended for pure products, by considering a molecular similarity principle. However, additional to those for pure products, QSPR models for mixtures recently appeared and represent an increasing field of research. This study proposes a state of the art of existing QSPR models specifically dedicated to the prediction of the physical hazards of mixtures. Identified models have been analysed on the key elements of model development (experimental data and fields of application, descriptors used, development and validation methods). It draws up an overview of the potential and limitations of current models as well as areas of progress towards enlarged deployment as a complement to experimental characterizations, for example in the search for safer substances (according to safety-by-design concepts).
Collapse
Affiliation(s)
- G Fayet
- Ineris, Parc Technologique Alata, Verneuil-en-Halatte, France
| | - P Rotureau
- Ineris, Parc Technologique Alata, Verneuil-en-Halatte, France
| |
Collapse
|
7
|
Moreno Jimenez R, Creton B, Marre S. Machine learning-based models for accessing thermal conductivity of liquids at different temperature conditions. SAR AND QSAR IN ENVIRONMENTAL RESEARCH 2023; 34:605-617. [PMID: 37642367 DOI: 10.1080/1062936x.2023.2244410] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/21/2023] [Accepted: 07/29/2023] [Indexed: 08/31/2023]
Abstract
Combating global warming-related climate change demands prompt actions to reduce greenhouse gas emissions, particularly carbon dioxide. Biomass-based biofuels represent a promising alternative fossil energy source. To convert biomass into energy, numerous conversion processes are performed at high pressure and temperature conditions, and the design and dimensioning of such processes requires thermophysical property data, particularly thermal conductivity, which are not always available in the literature. In this paper, we proposed the application of Chemoinformatics methodologies to investigate the prediction of thermal conductivity for hydrocarbons and oxygenated compounds. A compilation of experimental data followed by a careful data curation were performed to establish a database. The support vector machine algorithm has been applied to the database leading to models with good predictive abilities. The support vector regression (SVR) model has then been applied to an external set of compounds, i.e. not considered during the training of models. It showed that our SVR model can be used for the prediction of thermal conductivity values for temperatures and/or compounds that are not covered experimentally in the literature.
Collapse
Affiliation(s)
- R Moreno Jimenez
- IFP Energies nouvelles, Rueil-Malmaison, France
- CNRS, University of Bordeaux, ICMCB, UMR 5026, 33600 Pessac, France
| | - B Creton
- IFP Energies nouvelles, Rueil-Malmaison, France
| | - S Marre
- CNRS, University of Bordeaux, ICMCB, UMR 5026, 33600 Pessac, France
| |
Collapse
|
8
|
Tillotson MJ, Diamantonis NI, Buda C, Bolton LW, Müller EA. Molecular modelling of the thermophysical properties of fluids: expectations, limitations, gaps and opportunities. Phys Chem Chem Phys 2023; 25:12607-12628. [PMID: 37114325 DOI: 10.1039/d2cp05423j] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/29/2023]
Abstract
This manuscript provides an overview of the current state of the art in terms of the molecular modelling of the thermophysical properties of fluids. It is intended to manage the expectations and serve as guidance to practising physical chemists, chemical physicists and engineers in terms of the scope and accuracy of the more commonly available intermolecular potentials along with the peculiarities of the software and methods employed in molecular simulations while providing insights on the gaps and opportunities available in this field. The discussion is focused around case studies which showcase both the precision and the limitations of frequently used workflows.
Collapse
Affiliation(s)
- Marcus J Tillotson
- Department of Chemical Engineering, Imperial College London, London, UK.
| | | | | | | | - Erich A Müller
- Department of Chemical Engineering, Imperial College London, London, UK.
| |
Collapse
|
9
|
Cao CT, Cao C. New Method of NPOH Equation-Based to Estimate the Physicochemical Properties of Noncyclic Alkanes. ACS OMEGA 2023; 8:6492-6506. [PMID: 36844565 PMCID: PMC9948200 DOI: 10.1021/acsomega.2c06856] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 10/24/2022] [Accepted: 01/19/2023] [Indexed: 06/18/2023]
Abstract
Changes in various physicochemical properties (P (n)) of noncyclic alkanes can be roughly classified as linear and nonlinear changes. In our previous study, the NPOH equation was proposed to express nonlinear changes in the properties of organic homologues. Until now, there has been no general equation to express nonlinear changes in the properties of noncyclic alkanes involving linear and branched alkane isomers. This work, on the basis of NPOH equation, proposes a general equation to express nonlinear changes in the physicochemical properties of noncyclic alkanes, including a total of 12 properties, boiling point, critical temperature, critical pressure, acentric factor, heat capacity, liquid viscosity, and flash point, named as the "NPNA equation", as follows: ln(P (n)) = a + b(n - 1) + c(S CNE) + d (ΔAOEI) + f(ΔAIMPI), where a, b, c, and f are coefficients, and P (n) represents the property of the alkane with n carbon atom number. n, S CNE, ΔAOEI, and ΔAIMPI are number of carbon atoms, sum of carbon number effects, average odd-even index difference, and average inner molecular polarizability index difference, respectively. The obtained results show that various nonlinear changes in the properties of noncyclic alkanes can be expressed by the NPNA equation. Nonlinear and linear change properties of noncyclic alkanes can be correlated with four parameters, n, S CNE, ΔAOEI, and ΔAIMPI. The NPNA equation has the advantages of uniform expression, usage of fewer parameters, and high estimation accuracy. Furthermore, using the above four parameters, a quantitative correlation equation can be established between any two properties of noncyclic alkanes. Employing the obtained equations as model equations, the property data of noncyclic alkanes, involving 142 critical temperatures, 142 critical pressures, 115 acentric factors, 116 flash points, 174 heat capacities, 142 critical volumes, and 155 gas enthalpies of formation, a total of 986 values, were predicted, which have not be experimentally measured. NPNA equation not only can provide a simple and convenient estimation or prediction method for the properties of noncyclic alkanes but also can provide new perspectives for studying quantitative structure-property relationships of branched organic compounds.
Collapse
|
10
|
Gupta S, Elliott JR, Anderko A, Crosthwaite J, Chapman WG, Lira CT. Current Practices and Continuing Needs in Thermophysical Properties for the Chemical Industry. Ind Eng Chem Res 2023. [DOI: 10.1021/acs.iecr.2c03153] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/22/2023]
Affiliation(s)
- Sumnesh Gupta
- The Dow Chemical Company, 1254 Enclave Parkway, Houston, Texas 77077, United States
| | - J. Richard Elliott
- Chemical, Biomolecular, and Corrosion Engineering Department, University of Akron, Akron, Ohio 44325-3906, United States
| | - Andrzej Anderko
- OLI Systems, Inc., 2 Gatehall Drive, Suite 1D, Parsippany, New Jersey 07054, United States
| | - Jacob Crosthwaite
- The Dow Chemical Company, 1897 Building, Midland, Michigan 48667, United States
| | - Walter G. Chapman
- Chemical and Biomolecular Engineering Department, Rice University, 6100 Main Street, Houston, Texas 77005, United States
| | - Carl T. Lira
- Chemical Engineering & Materials Science, Michigan State University, East Lansing, Michigan 48824-2288, United States
| |
Collapse
|
11
|
Iftakher A, Monjur MS, Hasan MMF. An Overview of Computer‐aided Molecular and Process Design. CHEM-ING-TECH 2023. [DOI: 10.1002/cite.202200172] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/31/2023]
Affiliation(s)
- Ashfaq Iftakher
- Texas A&M University Artie McFerrin Department of Chemical Engineering 100 Spence St. TX 77843-3122 College Station USA
| | - Mohammed Sadaf Monjur
- Texas A&M University Artie McFerrin Department of Chemical Engineering 100 Spence St. TX 77843-3122 College Station USA
| | - M. M. Faruque Hasan
- Texas A&M University Artie McFerrin Department of Chemical Engineering 100 Spence St. TX 77843-3122 College Station USA
| |
Collapse
|
12
|
Lansford JL, Barnes BC, Rice BM, Jensen KF. Building Chemical Property Models for Energetic Materials from Small Datasets Using a Transfer Learning Approach. J Chem Inf Model 2022; 62:5397-5410. [PMID: 36240441 DOI: 10.1021/acs.jcim.2c00841] [Citation(s) in RCA: 6] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/13/2022]
Abstract
For many experimentally measured chemical properties that cannot be directly computed from first-principles, the existing physics-based models do not extrapolate well to out-of-sample molecules, and experimental datasets themselves are too small for traditional machine learning (ML) approaches. To overcome these limitations, we apply a transfer learning approach, whereby we simultaneously train a multi-target regression model on a small number of molecules with experimentally measured values and a large number of molecules with related computed properties. We demonstrate this methodology on predicting the experimentally measured impact sensitivity of energetic crystals, finding that both characteristics of the computed dataset and model architecture are important to prediction accuracy of the small experimental dataset. Our directed-message passing neural network (D-MPNN) ML model using transfer learning outperforms direct-ML and physics-based models on a diverse test set, and the new methods described here are widely applicable to modeling many other structure-property relationships.
Collapse
Affiliation(s)
- Joshua L Lansford
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States.,Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| | - Brian C Barnes
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Betsy M Rice
- U.S. Army Combat Capabilities Development Command (DEVCOM) Army Research Laboratory, Aberdeen Proving Ground, Maryland 21005, United States
| | - Klavs F Jensen
- Department of Chemical Engineering, MIT, Cambridge, Massachusetts 02139, United States
| |
Collapse
|
13
|
Pal R, Patra SG, Chattaraj PK. Quantitative Structure-Toxicity Relationship in Bioactive Molecules from a Conceptual DFT Perspective. Pharmaceuticals (Basel) 2022; 15:1383. [PMID: 36355555 PMCID: PMC9695291 DOI: 10.3390/ph15111383] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2022] [Revised: 11/01/2022] [Accepted: 11/07/2022] [Indexed: 10/29/2023] Open
Abstract
The preclinical drug discovery stage often requires a large amount of costly and time-consuming experiments using huge sets of chemical compounds. In the last few decades, this process has undergone significant improvements by the introduction of quantitative structure-activity relationship (QSAR) modelling that uses a certain percentage of experimental data to predict the biological activity/property of compounds with similar structural skeleton and/or containing a particular functional group(s). The use of machine learning tools along with it has made life even easier for pharmaceutical researchers. Here, we discuss the toxicity of certain sets of bioactive compounds towards Pimephales promelas and Tetrahymena pyriformis in terms of the global conceptual density functional theory (CDFT)-based descriptor, electrophilicity index (ω). We have compared the results with those obtained by using the commonly used hydrophobicity parameter, logP (where P is the n-octanol/water partition coefficient), considering the greater ease of computing the ω descriptor. The Human African trypanosomiasis (HAT) curing activity of 32 pyridyl benzamide derivatives is also studied against Tryphanosoma brucei. In this review article, we summarize these multiple linear regression (MLR)-based QSAR studies in terms of electrophilicity (ω, ω2) and hydrophobicity (logP, (logP)2) parameters.
Collapse
Affiliation(s)
- Ranita Pal
- Advanced Technology Development Centre, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Shanti Gopal Patra
- Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| | - Pratim Kumar Chattaraj
- Department of Chemistry, Indian Institute of Technology Kharagpur, Kharagpur 721302, India
| |
Collapse
|
14
|
Fang X, Li Y, Kua YL, Chew ZL, Gan S, Tan KW, Lee TZE, Cheng WK, Lau HLN. Insights on the potential of natural deep eutectic solvents (NADES) to fine-tune durian seed gum for use as edible food coating. Food Hydrocoll 2022. [DOI: 10.1016/j.foodhyd.2022.107861] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/16/2023]
|
15
|
Delforce L, Duprat F, Ploix JL, Ontiveros JF, Goussard V, Nardello-Rataj V, Aubry JM. Fast Prediction of the Equivalent Alkane Carbon Number Using Graph Machines and Neural Networks. ACS OMEGA 2022; 7:38869-38881. [PMID: 36340160 PMCID: PMC9631404 DOI: 10.1021/acsomega.2c04592] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 07/20/2022] [Accepted: 08/09/2022] [Indexed: 06/16/2023]
Abstract
The hydrophobicity of oils is a key parameter to design surfactant/oil/water (SOW) macro-, micro-, or nano-dispersed systems with the desired features. This essential physicochemical characteristic is quantitatively expressed by the equivalent alkane carbon number (EACN) whose experimental determination is tedious since it requires knowledge of the phase behavior of the SOW systems at different temperatures and for different surfactant concentrations. In this work, two mathematical models are proposed for the rapid prediction of the EACN of oils. They have been designed using artificial intelligence (machine-learning) methods, namely, neural networks (NN) and graph machines (GM). While the GM model is implemented from the SMILES codes of a 111-molecule training set of known EACN values, the NN model is fed with some σ-moment descriptors computed with the COSMOtherm software for the 111-molecule set. In a preliminary step, the leave-one-out algorithm is used to select, given the available data, the appropriate complexity of the two models. A comparison of the EACNs of liquids of a fresh set of 10 complex cosmetic and perfumery molecules shows that the two approaches provide comparable results in terms of accuracy and reliability. Finally, the NN and GM models are applied to nine series of homologous compounds, for which the GM model results are in better agreement with the experimental EACN trends than the NN model predictions. The results obtained by the GMs and by the NN based on σ-moments can be duplicated with the demonstration tool available for download as detailed in the Supporting Information.
Collapse
Affiliation(s)
- Lucie Delforce
- University
of Lille, CNRS, Centrale Lille, Université d′Artois,
UMR 8181—UCCS—Unité de Catalyse et Chimie du
Solide, F-59000Lille, France
| | - François Duprat
- Laboratoire
de Chimie Organique, CNRS, ESPCI Paris,
PSL Research University, 10 rue Vauquelin, 75005Paris, France
| | - Jean-Luc Ploix
- Laboratoire
de Chimie Organique, CNRS, ESPCI Paris,
PSL Research University, 10 rue Vauquelin, 75005Paris, France
| | - Jesus Fermín Ontiveros
- University
of Lille, CNRS, Centrale Lille, Université d′Artois,
UMR 8181—UCCS—Unité de Catalyse et Chimie du
Solide, F-59000Lille, France
| | - Valentin Goussard
- University
of Lille, CNRS, Centrale Lille, Université d′Artois,
UMR 8181—UCCS—Unité de Catalyse et Chimie du
Solide, F-59000Lille, France
| | - Véronique Nardello-Rataj
- University
of Lille, CNRS, Centrale Lille, Université d′Artois,
UMR 8181—UCCS—Unité de Catalyse et Chimie du
Solide, F-59000Lille, France
| | - Jean-Marie Aubry
- University
of Lille, CNRS, Centrale Lille, Université d′Artois,
UMR 8181—UCCS—Unité de Catalyse et Chimie du
Solide, F-59000Lille, France
| |
Collapse
|
16
|
de Hemptinne JC, Kontogeorgis GM, Dohrn R, Economou IG, ten Kate A, Kuitunen S, Fele Žilnik L, De Angelis MG, Vesovic V. A View on the Future of Applied Thermodynamics. Ind Eng Chem Res 2022. [DOI: 10.1021/acs.iecr.2c01906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
| | - Georgios M. Kontogeorgis
- Center for Energy Resources Engineering (CERE), Department of Chemical and Biochemical Engineering, Technical University of Denmark, Lyngby DK-2800, Denmark
| | - Ralf Dohrn
- Bayer AG, Process Technologies, Building E41, Leverkusen 51368, Germany
| | - Ioannis G. Economou
- Chemical Engineering Program, Texas A&M University at Qatar, Doha P.O. Box 23874, Qatar
| | | | - Susanna Kuitunen
- Neste Engineering Solutions Oy, P.O. Box 310, Porvoo FI-06101, Finland
| | - Ljudmila Fele Žilnik
- Department of Catalysis and Chemical Reaction Engineering, National Institute of Chemistry, Hajdrihova 19, Ljubljana 1001, Slovenia
| | - Maria Grazia De Angelis
- Institute for Materials and Processes, School of Engineering, University of Edinburgh, Sanderson Building, Edinburgh EH9 3FB, UK
- Department of Civil, Chemical, Environmental and Materials Engineering University of Bologna, Bologna 40131 Italy
| | - Velisa Vesovic
- Department of Earth Science and Engineering, Imperial College London, South Kensington Campus, London SW7 2AZ, United Kingdom
| |
Collapse
|
17
|
Cao CT, Cao C. General Equation to Express Changes in the Physicochemical Properties of Organic Homologues. ACS OMEGA 2022; 7:26670-26679. [PMID: 35936486 PMCID: PMC9352247 DOI: 10.1021/acsomega.2c02828] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/13/2022] [Accepted: 07/12/2022] [Indexed: 06/15/2023]
Abstract
Changes in various physicochemical properties (P (n)) of organic compounds with the number of carbon atoms (n) can be roughly divided into linear and nonlinear changes. To date, there has been no general equation to express nonlinear changes in the properties of organic homologues. This study proposes a general equation expressing nonlinear changes in the physicochemical properties of organic homologues, including boiling point, viscosity, ionization potential, and vapor pressure, named the "NPOH equation", as follows: P (n) = P (1) α n - 1 e ∑i=2 n(β/(i - 1)) where α and β are adjustable parameters, and P (1) represents the property of the starting compound (pseudo-value at n = 1) of each homologue. The results show that various nonlinear changes in the properties of homologues can be expressed by the NPOH equation. Linear and nonlinear changes in the properties of homologues can all be correlated with n and the "sum of carbon number effects", ∑i=2 n(1/i - 1). Using these two parameters, a quantitative correlation equation can be established between any two properties of each homologue, providing convenient mutual estimation of the properties of a homologue series. The NPOH equation can also be used in property correlation for structures with functionality located elsewhere along a linear alkyl chain as well as for branched organic compounds. This work can provide new perspectives for studying quantitative structure-property relationships.
Collapse
|
18
|
Bio-based alternatives to volatile silicones: Relationships between chemical structure, physicochemical properties and functional performances. Adv Colloid Interface Sci 2022; 304:102679. [PMID: 35512559 DOI: 10.1016/j.cis.2022.102679] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.3] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2022] [Revised: 04/08/2022] [Accepted: 04/13/2022] [Indexed: 11/23/2022]
Abstract
Emollient oils are ubiquitous ingredients of personal care products, especially skin care and hair care formulations. They offer excellent spreading properties and give end-use products a soft, pleasant and non-sticky after-feel. Emollients belong to various petro- or bio-based chemical families among which silicone oils, hydrocarbons and esters are the most prominent. Silicones have exceptional physicochemical and sensory properties but their high chemical stability results in very low biodegradability and a high bioaccumulation potential. Nowadays, consumers are increasingly responsive to environmental issues and demand more environmentally friendly products. This awareness strongly encourages cosmetics industries to develop bio-based alternatives to silicone oils. Finding effective silicon-free emollients requires understanding the molecular origin of emollience. This review details the relationships between the molecular structures of emollients and their physicochemical properties as well as the resulting functional performances in order to facilitate the design of alternative oils with suitable physicochemical and sensory properties. The molecular profile of an ideal emollient in terms of chemical function (alkane, ether, ester, carbonate, alcohol), optimal number of carbons and branching is established to obtain an odourless oil with good spreading on the skin. Since none of the carbon-based emollients alone can imitate the non-sticky and dry feel of silicone oils, it is judicious to blend alkanes and esters to significantly improve both the sensory properties and the solubilizing properties of the synergistic mixture towards polar ingredients (sun filters, antioxidants, fragrances). Finally, it is shown how modelling tools (QSPR, COSMO-RS and neural networks) can predict in silico the key properties of hundreds of virtual candidate molecules in order to synthesize only the most promising whose predicted properties are close to the specifications.
Collapse
|
19
|
Lađarević JM, Božić BĐ, Vitnik VD, Matović LR, Mijin DŽ, Vitnik ŽJ. Improvement of theoretical UV-Vis spectra calculations by empirical solvatochromic parameters: Case study of 5-arylazo-3-cyano-1-ethyl-6-hydroxy-4-methyl-2-pyridones. SPECTROCHIMICA ACTA. PART A, MOLECULAR AND BIOMOLECULAR SPECTROSCOPY 2022; 272:120978. [PMID: 35151162 DOI: 10.1016/j.saa.2022.120978] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 08/06/2021] [Revised: 01/03/2022] [Accepted: 01/28/2022] [Indexed: 06/14/2023]
Abstract
In order to improve the performance of theoretical UV-Vis spectra predictions, a theoretical and experimental study of solvatochromic properties of ten azo pyridone dyes has been performed. For quantitative estimation of intermolecular solvent-solute interactions, a concept of the linear solvation energy relationships has been applied using Kamlet-Taft and Catalán models. Theoretical UV-Vis spectra for all dyes have been calculated using four TD-DFT methods in nine different solvents with the aim to define the most reliable model. Finally, new polylinear equations for more accurate theoretical prediction of UV-Vis maxima are developed using empirical Kamlet-Taft and Catalán solvent parameters as additive corrections for specific and nonspecific solvent-solute interactions.
Collapse
Affiliation(s)
- Jelena M Lađarević
- Faculty of Technology and Metallurgy, University of Belgrade, Karnegijeva 4, Belgrade, Serbia.
| | - Bojan Đ Božić
- Institute of Physiology and Biochemistry "Ivan Djaja", Faculty of Biology, University of Belgrade, Studentski trg 16, Belgrade, Serbia
| | - Vesna D Vitnik
- Department of Chemistry, Institute of Chemistry, Technology and Metallurgy, University of Belgrade, Studentski trg 12-16, Belgrade, Serbia
| | - Luka R Matović
- Innovation Centre of the Faculty of Technology and Metallurgy, University of Belgrade, Karnegijeva 4, Belgrade, Serbia
| | - Dušan Ž Mijin
- Faculty of Technology and Metallurgy, University of Belgrade, Karnegijeva 4, Belgrade, Serbia
| | - Željko J Vitnik
- Department of Chemistry, Institute of Chemistry, Technology and Metallurgy, University of Belgrade, Studentski trg 12-16, Belgrade, Serbia
| |
Collapse
|
20
|
Rosenberger D, Barros K, Germann TC, Lubbers N. Machine learning of consistent thermodynamic models using automatic differentiation. Phys Rev E 2022; 105:045301. [PMID: 35590626 DOI: 10.1103/physreve.105.045301] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2021] [Accepted: 03/09/2022] [Indexed: 06/15/2023]
Abstract
We propose a data-driven method to describe consistent equations of state (EOS) for arbitrary systems. Complex EOS are traditionally obtained by fitting suitable analytical expressions to thermophysical data. A key aspect of EOS is that the relationships between state variables are given by derivatives of the system free energy. In this work, we model the free energy with an artificial neural network and utilize automatic differentiation to directly learn the derivatives of the free energy. We demonstrate this approach on two different systems, the analytic van der Waals EOS and published data for the Lennard-Jones fluid, and we show that it is advantageous over direct learning of thermodynamic properties (i.e., not as derivatives of the free energy but as independent properties), in terms of both accuracy and the exact preservation of the Maxwell relations. Furthermore, the method implicitly provides the free energy of a system without explicit integration.
Collapse
Affiliation(s)
- David Rosenberger
- Los Alamos National Laboratory, Theoretical Division, Physics and Chemistry of Materials Group, Los Alamos, New Mexico 87545, USA
| | - Kipton Barros
- Los Alamos National Laboratory, Theoretical Division, Physics and Chemistry of Materials Group, Los Alamos, New Mexico 87545, USA
| | - Timothy C Germann
- Los Alamos National Laboratory, Theoretical Division, Physics and Chemistry of Materials Group, Los Alamos, New Mexico 87545, USA
| | - Nicholas Lubbers
- Los Alamos National Laboratory, Computer, Computational & Statistical Sciences Division, Information Sciences Group, Los Alamos, New Mexico 87545, USA
| |
Collapse
|
21
|
Dupeux T, Gaudin T, Marteau‐Roussy C, Aubry J, Nardello‐Rataj V. COSMO‐RS as an effective tool for predicting the physicochemical properties of fragrance raw materials. FLAVOUR FRAG J 2022. [DOI: 10.1002/ffj.3690] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/14/2022]
Affiliation(s)
- Tristan Dupeux
- Univ. LilleCNRSCentrale LilleUniv. ArtoisUMR 8181 – UCCS – Unité de Catalyse et Chimie du Solide Lille France
- International Flavors & Fragrances (Fragrance Beauty Care) Neuilly‐sur‐Seine France
| | - Théophile Gaudin
- Univ. LilleCNRSCentrale LilleUniv. ArtoisUMR 8181 – UCCS – Unité de Catalyse et Chimie du Solide Lille France
| | | | - Jean‐Marie Aubry
- Univ. LilleCNRSCentrale LilleUniv. ArtoisUMR 8181 – UCCS – Unité de Catalyse et Chimie du Solide Lille France
| | - Véronique Nardello‐Rataj
- Univ. LilleCNRSCentrale LilleUniv. ArtoisUMR 8181 – UCCS – Unité de Catalyse et Chimie du Solide Lille France
| |
Collapse
|
22
|
Gantzer P, Creton B, Nieto-Draghi C. Comparisons of Molecular Structure Generation Methods Based on Fragment Assemblies and Genetic Graphs. J Chem Inf Model 2021; 61:4245-4258. [PMID: 34405674 DOI: 10.1021/acs.jcim.1c00803] [Citation(s) in RCA: 4] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The use of quantitative structure-property relationships (QSPRs) helps in predicting molecular properties for several decades, while the automatic design of new molecular structures is still emerging. The choice of algorithms to generate molecules is not obvious and is related to several factors such as the desired chemical diversity (according to an initial dataset's content) and the level of construction (the use of atoms, fragments, pattern-based methods). In this paper, we address the problem of molecular structure generation by revisiting two approaches: fragment-based methods (FMs) and genetic-based methods (GMs). We define a set of indices to compare generation methods on a specific task. New indices inform about the explored data space (coverage), compare how the data space is explored (representativeness), and quantifies the ratio of molecules satisfying requirements (generation specificity) without the use of a database composed of real chemicals as a reference. These indices were employed to compare generations of molecules fulfilling the desired property criterion, evaluated by QSPR.
Collapse
Affiliation(s)
- Philippe Gantzer
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Benoit Creton
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| | - Carlos Nieto-Draghi
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852 Rueil-Malmaison, France
| |
Collapse
|
23
|
Bittner JP, Huang L, Zhang N, Kara S, Jakobtorweihen S. Comparison and Validation of Force Fields for Deep Eutectic Solvents in Combination with Water and Alcohol Dehydrogenase. J Chem Theory Comput 2021; 17:5322-5341. [PMID: 34232662 DOI: 10.1021/acs.jctc.1c00274] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/24/2022]
Abstract
Deep eutectic solvents (DESs) have become popular as environmental-friendly solvents for biocatalysis. Molecular dynamics (MD) simulations offer an in-depth analysis of enzymes in DESs, but their performance depends on the force field chosen. Here, we present a comprehensive validation of three biomolecular force fields (CHARMM, Amber, and OPLS) for simulations of alcohol dehydrogenase (ADH) in DESs composed of choline chloride and glycerol/ethylene glycol with varying water contents. Different properties (e.g., protein structure and flexibility, solvation layer, and H-bonds) were used for validation. For two properties (viscosity and water activity) also experiments were performed. The viscosity was calculated with the periodic perturbation method, whereby its parameter dependency is disclosed. A modification of Amber was identified as the best-performing model for low water contents, whereas CHARMM outperforms the other models at larger water concentrations. An analysis of ADH's structure and interactions with the DESs revealed similar predictions for Amber and CHARMM.
Collapse
Affiliation(s)
- Jan Philipp Bittner
- Institute of Thermal Separation Processes, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
| | - Lei Huang
- Department of Biological and Chemical Engineering, Biocatalysis and Bioprocessing Group, Aarhus University, Gustav Wieds Vej 10, 8000 Aarhus, Denmark
| | - Ningning Zhang
- Department of Biological and Chemical Engineering, Biocatalysis and Bioprocessing Group, Aarhus University, Gustav Wieds Vej 10, 8000 Aarhus, Denmark
| | - Selin Kara
- Department of Biological and Chemical Engineering, Biocatalysis and Bioprocessing Group, Aarhus University, Gustav Wieds Vej 10, 8000 Aarhus, Denmark
| | - Sven Jakobtorweihen
- Institute of Thermal Separation Processes, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany.,Department for Chemical Reaction Engineering, Hamburg University of Technology, Eißendorfer Straße 38, 21073 Hamburg, Germany
| |
Collapse
|
24
|
Barthel C, Massiot G, Lavaud C. An easy-to-use and general nuclear magnetic resonance method for the determination of partition coefficients of drugs and natural products. MAGNETIC RESONANCE IN CHEMISTRY : MRC 2021; 59:835-843. [PMID: 33818813 DOI: 10.1002/mrc.5159] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/10/2021] [Revised: 03/27/2021] [Accepted: 03/29/2021] [Indexed: 06/12/2023]
Abstract
The lipophilicity of a drug is an important parameter for its eventual development by the pharmaceutical industry. It is usually measured by HPLC following partition of the compound between water and 1-octanol. We present here an alternative, simple, sensitive and quantitative 1 H nuclear magnetic resonance (NMR) method for the experimental measurement of partition coefficients of natural compounds and pharmaceutical drugs. It is based on measuring concentrations in the water phase, before and after partitioning and equilibration between water and octanol, using the ERETIC (Electronic Reference To Access In Vivo Concentration) technique. The signal to noise ratio is improved by a Water Suppression by Excitation Sculpting sequence. Quantification is based on an electronic reference signal and does not need addition of a reference compound. The log P values of 22 natural metabolites and four pharmaceutical drugs were determined and the experimental results are in excellent agreement with literature data. The experiments were run on ~2 mg material. This technique proved to be robust, reproducible and suitable for log P values between -2 and +2.
Collapse
Affiliation(s)
- Cédric Barthel
- Institut de Chimie Moléculaire de Reims, Université de Reims Champagne-Ardenne, UMR CNRS 7312, UFR de Pharmacie, Reims, France
| | - Georges Massiot
- Institut de Chimie Moléculaire de Reims, Université de Reims Champagne-Ardenne, UMR CNRS 7312, Case postale 44, UFR des Sciences Exactes et Naturelles, Reims, France
| | - Catherine Lavaud
- Institut de Chimie Moléculaire de Reims, Université de Reims Champagne-Ardenne, UMR CNRS 7312, UFR de Pharmacie, Reims, France
| |
Collapse
|
25
|
Honarasa F, Yousefinejad S, Nekoeinia M. Structure–solubility and solvation energy relationships for propanol in different solvents using structural and empirical scales. J CHIN CHEM SOC-TAIP 2021. [DOI: 10.1002/jccs.202100215] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/30/2022]
Affiliation(s)
- Fatemeh Honarasa
- Department of Chemistry, Shiraz Branch Islamic Azad University Shiraz Iran
- Department of Applied Researches, Chemical, Petroleum & Polymer Engineering Research Center Shiraz Branch, Islamic Azad University Shiraz Iran
| | - Saeed Yousefinejad
- Research Center for Health Sciences, Institute of Health, Department of Occupational Health Engineering, School of Health Shiraz University of Medical Sciences Shiraz Iran
| | - Mohsen Nekoeinia
- Department of Chemistry Payame Noor University (PNU) Tehran Iran
| |
Collapse
|
26
|
Falcioni F, Kalayan J, Henchman RH. Energy-entropy prediction of octanol-water logP of SAMPL7 N-acyl sulfonamide bioisosters. J Comput Aided Mol Des 2021; 35:831-840. [PMID: 34244906 PMCID: PMC8295089 DOI: 10.1007/s10822-021-00401-w] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/04/2021] [Accepted: 06/17/2021] [Indexed: 12/23/2022]
Abstract
Partition coefficients quantify a molecule's distribution between two immiscible liquid phases. While there are many methods to compute them, there is not yet a method based on the free energy of each system in terms of energy and entropy, where entropy depends on the probability distribution of all quantum states of the system. Here we test a method in this class called Energy Entropy Multiscale Cell Correlation (EE-MCC) for the calculation of octanol-water logP values for 22 N-acyl sulfonamides in the SAMPL7 Physical Properties Challenge (Statistical Assessment of the Modelling of Proteins and Ligands). EE-MCC logP values have a mean error of 1.8 logP units versus experiment and a standard error of the mean of 1.0 logP units for three separate calculations. These errors are primarily due to getting sufficiently converged energies to give accurate differences of large numbers, particularly for the large-molecule solvent octanol. However, this is also an issue for entropy, and approximations in the force field and MCC theory also contribute to the error. Unique to MCC is that it explains the entropy contributions over all the degrees of freedom of all molecules in the system. A gain in orientational entropy of water is the main favourable entropic contribution, supported by small gains in solute vibrational and orientational entropy but offset by unfavourable changes in the orientational entropy of octanol, the vibrational entropy of both solvents, and the positional and conformational entropy of the solute.
Collapse
Affiliation(s)
- Fabio Falcioni
- Manchester Institute of Biotechnology, The University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK.
- School of Chemistry, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK.
| | - Jas Kalayan
- Manchester Institute of Biotechnology, The University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK
- School of Chemistry, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK
| | - Richard H Henchman
- Manchester Institute of Biotechnology, The University of Manchester, 131 Princess Street, Manchester, M1 7DN, UK.
- School of Chemistry, The University of Manchester, Oxford Road, Manchester, M13 9PL, UK.
- Sydney Medical School, The University of Sydney, Sydney, NSW, 2006, Australia.
| |
Collapse
|
27
|
Chen CC, Guo YC. Prediction of minimum ignition energy using quantitative structure activity relationships approach. J Loss Prev Process Ind 2021. [DOI: 10.1016/j.jlp.2021.104443] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
28
|
Gaudin T, Ma H. Substructure shock-friction theory for molecular transport in liquids. J Mol Liq 2021. [DOI: 10.1016/j.molliq.2021.115655] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/22/2022]
|
29
|
Rozanska X, Wimmer E, de Meyer F. Quantitative Kinetic Model of CO 2 Absorption in Aqueous Tertiary Amine Solvents. J Chem Inf Model 2021; 61:1814-1824. [PMID: 33709702 DOI: 10.1021/acs.jcim.0c01386] [Citation(s) in RCA: 10] [Impact Index Per Article: 2.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/30/2022]
Abstract
Aqueous tertiary amine solutions are increasingly used in industrial CO2 capture operations because they are more energy-efficient than primary or secondary amines and demonstrate higher CO2 absorption capacity. Yet, tertiary amine solutions have a significant drawback in that they tend to have lower CO2 absorption rates. To identify tertiary amines that absorb CO2 faster, it would be efficacious to have a quantitative and predictive model of the rate-controlling processes. Despite numerous attempts to date, this goal has been elusive. The present computational approach achieves this goal by focusing on the reaction of CO2 with OH- forming HCO3-. The performance of the resulting model is demonstrated for a consistent experimental data set of the absorption rates of CO2 for 24 different aqueous tertiary amine solvents. The key to the new model's success is the manner in which the free energy barrier for the reaction of CO2 with OH- is evaluated from the differences among the solvation free energies of CO2, OH-, and HCO3-, while the pKa of the amines controls the concentration of OH-. These solvation energies are obtained from molecular dynamics simulations. The experimental value of the free energy of reaction of CO2 with pure water is combined with information about measured rates of absorption of CO2 in an aqueous amine solvent in order to calibrate the absorption rate model. This model achieves a relative accuracy better than 0.1 kJ mol-1 for the free energies of activation for CO2 absorption in aqueous amine solutions and 0.07 g L-1 min-1 for the absorption rate of CO2. Such high accuracies are necessary to predict the correct experimental ranking of CO2 absorption rates, thus providing a quantitative approach of practical interest.
Collapse
Affiliation(s)
- Xavier Rozanska
- Materials Design SARL, 42 avenue Verdier, 92120 Montrouge, France
| | - Erich Wimmer
- Materials Design SARL, 42 avenue Verdier, 92120 Montrouge, France
| | - Frédérick de Meyer
- TOTAL SE, Total Exploration Production, Liquefied Natural Gas - Acid Gas Entity, CCUS R&D Program, 2 Place Jean Milier, 92078 Paris, France.,MINES ParisTech, PSL University, Centre de thermodynamique des procédés (CTP), 35 rue St Honoré, 77300 Fontainebleau, France
| |
Collapse
|
30
|
Chemoinformatics and QSAR. Adv Bioinformatics 2021. [DOI: 10.1007/978-981-33-6191-1_10] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/25/2022] Open
|
31
|
Cai G, Liu Z, Zhang L, Shi Q, Zhao S, Xu C. Systematic performance evaluation of gasoline molecules based on quantitative structure-property relationship models. Chem Eng Sci 2021. [DOI: 10.1016/j.ces.2020.116077] [Citation(s) in RCA: 8] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/24/2022]
|
32
|
Bouarab AF, Harvey JP, Robelin C. Viscosity models for ionic liquids and their mixtures. Phys Chem Chem Phys 2021; 23:733-752. [DOI: 10.1039/d0cp05787h] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.8] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/29/2022]
Abstract
Review of principles and limitations of viscosity models for ionic liquids and their mixtures focusing on the use of inappropriate mixing rules for molten salts.
Collapse
Affiliation(s)
- Anya F. Bouarab
- Centre for Research in Computational Thermochemistry (CRCT)
- Department of Chemical Engineering
- Polytechnique Montréal
- Montréal
- Canada
| | - Jean-Philippe Harvey
- Centre for Research in Computational Thermochemistry (CRCT)
- Department of Chemical Engineering
- Polytechnique Montréal
- Montréal
- Canada
| | - Christian Robelin
- Centre for Research in Computational Thermochemistry (CRCT)
- Department of Chemical Engineering
- Polytechnique Montréal
- Montréal
- Canada
| |
Collapse
|
33
|
Fayet G, Rotureau P. Chemoinformatics for the Safety of Energetic and Reactive Materials at Ineris. Mol Inform 2020; 41:e2000190. [PMID: 33283975 DOI: 10.1002/minf.202000190] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2020] [Accepted: 12/06/2020] [Indexed: 11/07/2022]
Abstract
The characterization of physical hazards of substances is a key information to manage the risks associated to their use, storage and transport. With decades of work in this area, Ineris develops and implements cutting-edge experimental facilities allowing such characterizations at different scales and under various conditions to study all of the dreaded accident scenarios. This review presents the efforts engaged by Ineris more recently in the field of chemoinformatics to develop and use new predictive methods for the anticipation and management of industrials risks associated to energetic and reactive materials as a complement to experiments. An overview of the methods used for the development of Quantitative Structure-Property Relationships for physical hazards are presented and discussed regarding the specificities associated to this class of properties. A review of models developed at Ineris is also provided from the first tentative models on the explosivity of nitro compounds to the successful application to the flammability of organic mixtures. Then, a discussion is proposed on the use of QSPR models. Good practices for robust use for QSPR models are recalled with specific comments related to physical hazards, notably for regulatory purpose. Dissemination and training efforts engaged by Ineris are also presented. The potential offered by these predictive methods in terms of in silico design and for the development of new intrinsically safer technologies in safety-by-design strategies is finally discussed. At last, challenges and perspectives to extend the application of chemoinformatics in the field of safety and in particular for the physical hazards of energetic and reactive substances are proposed.
Collapse
Affiliation(s)
- Guillaume Fayet
- Ineris, Accidental Risk Division, Parc Technologique Alata, 60550, Verneuil-en-Halatte, France
| | - Patricia Rotureau
- Ineris, Accidental Risk Division, Parc Technologique Alata, 60550, Verneuil-en-Halatte, France
| |
Collapse
|
34
|
Zhu K, Müller EA. Generating a Machine-Learned Equation of State for Fluid Properties. J Phys Chem B 2020; 124:8628-8639. [DOI: 10.1021/acs.jpcb.0c05806] [Citation(s) in RCA: 13] [Impact Index Per Article: 2.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Kezheng Zhu
- Department of Chemical Engineering, Imperial College London, London SW7 2AZ, United Kingdom
| | - Erich A. Müller
- Department of Chemical Engineering, Imperial College London, London SW7 2AZ, United Kingdom
| |
Collapse
|
35
|
A quantitative structure activity relationship model for predicting minimum ignition energy of organic substance. J Loss Prev Process Ind 2020. [DOI: 10.1016/j.jlp.2020.104227] [Citation(s) in RCA: 6] [Impact Index Per Article: 1.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/22/2022]
|
36
|
Postnikov EB, Jasiok B, Melent'ev VV, Ryshkova OS, Korotkovskii VI, Radchenko AK, Lowe AR, Chorążewski M. Prediction of high pressure properties of complex mixtures without knowledge of their composition as a problem of thermodynamic linear analysis. J Mol Liq 2020. [DOI: 10.1016/j.molliq.2020.113016] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
37
|
Abstract
The Solvent Similarity Index (SSI) is a quantitative parameter we introduce for the comparison of the solvation properties of any solvent or solvent mixture. The Surface Site Interaction Model for Liquids at Equilibrium (SSIMPLE) was used to calculate the free energy of solvation of a single Surface Site Interaction Point (SSIP) on a solute. The SSIP representation of molecular surfaces was used to calculate the free energy of solvation for all possible solute polarities, generating a unique solvation profile for any solvent or solvent mixture. Quantitative comparison of the solvation profiles of two solvents was used as the basis for calculating the solvation similarity index. Values of SSI were calculated for all pairwise comparisons of 261 pure solvents at 298 K, and the results were used to classify solvents into groups according to their solvation properties. Applications to understanding the solvation properties of binary solvent mixtures and for identification of alternative solvents are illustrated.
Collapse
Affiliation(s)
- Mark D Driver
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| | - Christopher A Hunter
- Department of Chemistry, University of Cambridge, Lensfield Road, Cambridge CB2 1EW, UK.
| |
Collapse
|
38
|
Sin KR, Ko SG, Kim CJ, Pak SH, Kim HC, Kim CU. Quantum chemical investigation on interaction of 5-fluorouracil with cucurbiturils. MONATSHEFTE FUR CHEMIE 2020. [DOI: 10.1007/s00706-020-02599-1] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/28/2022]
|
39
|
In Silico Prediction of Critical Micelle Concentration (CMC) of Classic and Extended Anionic Surfactants from Their Molecular Structural Descriptors. ARABIAN JOURNAL FOR SCIENCE AND ENGINEERING 2020. [DOI: 10.1007/s13369-020-04598-0] [Citation(s) in RCA: 3] [Impact Index Per Article: 0.6] [Reference Citation Analysis] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/18/2022]
|
40
|
Estimation of Vapor Pressures of Solvent + Salt Systems with Quadratic Solvation Relationships. J SOLUTION CHEM 2020. [DOI: 10.1007/s10953-020-00983-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 10/24/2022]
|
41
|
Goussard V, Duprat F, Ploix JL, Dreyfus G, Nardello-Rataj V, Aubry JM. A New Machine-Learning Tool for Fast Estimation of Liquid Viscosity. Application to Cosmetic Oils. J Chem Inf Model 2020; 60:2012-2023. [PMID: 32250628 DOI: 10.1021/acs.jcim.0c00083] [Citation(s) in RCA: 15] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The viscosities of pure liquids are estimated at 25 °C, from their molecular structures, using three modeling approaches: group contributions, COSMO-RS σ-moment-based neural networks, and graph machines. The last two are machine-learning methods, whereby models are designed and trained from a database of viscosities of 300 molecules at 25 °C. Group contributions and graph machines make use of the 2D-structures only (the SMILES codes of the molecules), while neural networks estimations are based on a set of five descriptors: COSMO-RS σ-moments. For the first time, leave-one-out is used for graph machine selection, and it is shown that it can be replaced with the much faster virtual leave-one-out algorithm. The database covers a wide diversity of chemical structures, namely, alkanes, ethers, esters, ketones, carbonates, acids, alcohols, silanes, and siloxanes, as well as different chemical backbone, i.e., straight, branched, or cyclic chains. A comparison of the viscosities of liquids of an independent set of 22 cosmetic oils shows that the graph machine approach provides the most accurate results given the available data. The results obtained by the neural network based on sigma-moments and by the graph machines can be duplicated easily by using a demonstration tool based on the Docker technology, available for download as explained in the Supporting Information. This demonstration also allows the reader to predict, at 25 °C, the viscosity of any liquid of moderate molecular size (M < 600 Da) that contains C, H, O, or Si atoms, starting either from its SMILES code or from its σ-moments computed with the COSMOtherm software.
Collapse
Affiliation(s)
- Valentin Goussard
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| | - François Duprat
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Jean-Luc Ploix
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Gérard Dreyfus
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Véronique Nardello-Rataj
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| | - Jean-Marie Aubry
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| |
Collapse
|
42
|
Işık M, Bergazin TD, Fox T, Rizzi A, Chodera JD, Mobley DL. Assessing the accuracy of octanol-water partition coefficient predictions in the SAMPL6 Part II log P Challenge. J Comput Aided Mol Des 2020; 34:335-370. [PMID: 32107702 PMCID: PMC7138020 DOI: 10.1007/s10822-020-00295-0] [Citation(s) in RCA: 29] [Impact Index Per Article: 5.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/07/2019] [Accepted: 01/24/2020] [Indexed: 12/12/2022]
Abstract
The SAMPL Challenges aim to focus the biomolecular and physical modeling community on issues that limit the accuracy of predictive modeling of protein-ligand binding for rational drug design. In the SAMPL5 log D Challenge, designed to benchmark the accuracy of methods for predicting drug-like small molecule transfer free energies from aqueous to nonpolar phases, participants found it difficult to make accurate predictions due to the complexity of protonation state issues. In the SAMPL6 log P Challenge, we asked participants to make blind predictions of the octanol-water partition coefficients of neutral species of 11 compounds and assessed how well these methods performed absent the complication of protonation state effects. This challenge builds on the SAMPL6 p[Formula: see text] Challenge, which asked participants to predict p[Formula: see text] values of a superset of the compounds considered in this log P challenge. Blind prediction sets of 91 prediction methods were collected from 27 research groups, spanning a variety of quantum mechanics (QM) or molecular mechanics (MM)-based physical methods, knowledge-based empirical methods, and mixed approaches. There was a 50% increase in the number of participating groups and a 20% increase in the number of submissions compared to the SAMPL5 log D Challenge. Overall, the accuracy of octanol-water log P predictions in SAMPL6 Challenge was higher than cyclohexane-water log D predictions in SAMPL5, likely because modeling only the neutral species was necessary for log P and several categories of method benefited from the vast amounts of experimental octanol-water log P data. There were many highly accurate methods: 10 diverse methods achieved RMSE less than 0.5 log P units. These included QM-based methods, empirical methods, and mixed methods with physical modeling supported with empirical corrections. A comparison of physical modeling methods showed that QM-based methods outperformed MM-based methods. The average RMSE of the most accurate five MM-based, QM-based, empirical, and mixed approach methods based on RMSE were 0.92 ± 0.13, 0.48 ± 0.06, 0.47 ± 0.05, and 0.50 ± 0.06, respectively.
Collapse
Affiliation(s)
- Mehtap Işık
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA.
- Tri-Institutional PhD Program in Chemical Biology, Weill Cornell Graduate School of Medical Sciences, Cornell University, New York, NY, 10065, USA.
| | | | - Thomas Fox
- Computational Chemistry, Medicinal Chemistry, Boehringer Ingelheim Pharma GmbH & Co KG, 88397, Biberach, Germany
| | - Andrea Rizzi
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
- Tri-Institutional Training Program in Computational Biology and Medicine, New York, NY, 10065, USA
| | - John D Chodera
- Computational and Systems Biology Program, Sloan Kettering Institute, Memorial Sloan Kettering Cancer Center, New York, NY, 10065, USA
| | - David L Mobley
- Department of Pharmaceutical Sciences, University of California, Irvine, CA, 92697, USA
- Department of Chemistry, University of California, Irvine, CA, 92697, USA
| |
Collapse
|
43
|
NguyenHuynh D, Luu MT, Mai CTQ, Tran STK. Free-volume theory coupled with modified group-contribution PC-SAFT for predicting viscosities of 1-alkenes. KOREAN J CHEM ENG 2020. [DOI: 10.1007/s11814-019-0473-x] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.2] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
44
|
Computer-aided molecular and processes design based on quantum chemistry: current status and future prospects. Curr Opin Chem Eng 2020. [DOI: 10.1016/j.coche.2019.11.007] [Citation(s) in RCA: 35] [Impact Index Per Article: 7.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/07/2023]
|
45
|
Sun X, Krakauer NJ, Politowicz A, Chen WT, Li Q, Li Z, Shao X, Sunaryo A, Shen M, Wang J, Morgan D. Assessing Graph-based Deep Learning Models for Predicting Flash Point. Mol Inform 2020; 39:e1900101. [PMID: 32077235 DOI: 10.1002/minf.201900101] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.4] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2019] [Accepted: 01/28/2020] [Indexed: 11/06/2022]
Abstract
Flash points of organic molecules play an important role in preventing flammability hazards and large databases of measured values exist, although millions of compounds remain unmeasured. To rapidly extend existing data to new compounds many researchers have used quantitative structure-property relationship (QSPR) analysis to effectively predict flash points. In recent years graph-based deep learning (GBDL) has emerged as a powerful alternative method to traditional QSPR. In this paper, GBDL models were implemented in predicting flash point for the first time. We assessed the performance of two GBDL models, message-passing neural network (MPNN) and graph convolutional neural network (GCNN), by comparing against 12 previous QSPR studies using more traditional methods. Our result shows that MPNN both outperforms GCNN and yields slightly worse but comparable performance with previous QSPR studies. The average R 2 and Mean Absolute Error (MAE) scores of MPNN are, respectively, 2.3 % lower and 2.0 K higher than previous comparable studies. To further explore GBDL models, we collected the largest flash point dataset to date, which contains 10575 unique molecules. The optimized MPNN gives a test data R 2 of 0.803 and MAE of 17.8 K on the complete dataset. We also extracted 5 datasets from our integrated dataset based on molecular types (acids, organometallics, organogermaniums, organosilicons, and organotins) and explore the quality of the model in these classes.
Collapse
Affiliation(s)
- Xiaoyu Sun
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Nathaniel J Krakauer
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Alexander Politowicz
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Wei-Ting Chen
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Qiying Li
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Zuoyi Li
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Xianjia Shao
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Alfred Sunaryo
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Mingren Shen
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - James Wang
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| | - Dane Morgan
- Dept. of Materials Science and Engineering, 244 MSE, University of Wisconsin, Madison, 53562
| |
Collapse
|
46
|
Shirmohammadi M, Mohammadinasab E, Bayat Z. Prediction of Lipophilicity of some Quinolone Derivatives by using Quantitative Structure-Activity Relationship. Curr Drug Discov Technol 2019; 18:83-94. [PMID: 31701848 DOI: 10.2174/1570163816666191108145026] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/28/2019] [Revised: 07/24/2019] [Accepted: 09/27/2019] [Indexed: 11/22/2022]
Abstract
OBJECTIVES Quantitative structure activity relationship (QSAR) was used to study the partition coefficient of some quinolones and their derivatives. METHODS These molecules are broad-spectrum antibiotic pharmaceutics. First, data were divided into two categories of train and test (validation) sets using a random selection method. Second, three approaches, including stepwise selection (STS) (forward), genetic algorithm (GA), and simulated annealing (SA) were used to select the descriptors, to examine the effect feature selection methods. To find the relation between descriptors and partition coefficient, multiple linear regression (MLR), principal component regression (PCR) and partial least squares (PLS) were used. RESULTS QSAR study showed that both regression and descriptor selection methods have a vital role in the results. Different statistical metrics showed that the MLR-SA approach with (r2=0.96, q2=0.91, pred_r2=0.95) gives the best outcome. CONCLUSION The proposed expression by the MLR-SA approach can be used in the better design of novel quinolones and their derivatives.
Collapse
Affiliation(s)
| | | | - Zakiyeh Bayat
- Department of Chemistry, Quchan Branch, Islamic Azad University, Quchan, Iran
| |
Collapse
|
47
|
Gantzer P, Creton B, Nieto-Draghi C. Inverse-QSPR for de novo Design: A Review. Mol Inform 2019; 39:e1900087. [PMID: 31682079 DOI: 10.1002/minf.201900087] [Citation(s) in RCA: 12] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/17/2019] [Accepted: 11/04/2019] [Indexed: 11/09/2022]
Abstract
The use of computer tools to solve chemistry-related problems has given rise to a large and increasing number of publications these last decades. This new field of science is now well recognized and labelled Chemoinformatics. Among all chemoinformatics techniques, the use of statistical based approaches for property predictions has been the subject of numerous research reflecting both new developments and many cases of applications. The so obtained predictive models relating a property to molecular features - descriptors - are gathered under the acronym QSPR, for Quantitative Structure Property Relationships. Apart from the obvious use of such models to predict property values for new compounds, their use to virtually synthesize new molecules - de novo design - is currently a high-interest subject. Inverse-QSPR (i-QSPR) methods have hence been developed to accelerate the discovery of new materials that meet a set of specifications. In the proposed manuscript, we review existing i-QSPR methodologies published in the open literature in a way to highlight developments, applications, improvements and limitations of each.
Collapse
Affiliation(s)
- Philippe Gantzer
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| | - Benoit Creton
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| | - Carlos Nieto-Draghi
- IFP Energies nouvelles, 1 et 4 avenue de Bois-Préau, 92852, Rueil-Malmaison, France
| |
Collapse
|
48
|
Batool M, Ahmad B, Choi S. A Structure-Based Drug Discovery Paradigm. Int J Mol Sci 2019; 20:ijms20112783. [PMID: 31174387 PMCID: PMC6601033 DOI: 10.3390/ijms20112783] [Citation(s) in RCA: 312] [Impact Index Per Article: 52.0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/10/2019] [Revised: 05/31/2019] [Accepted: 06/04/2019] [Indexed: 12/14/2022] Open
Abstract
Structure-based drug design is becoming an essential tool for faster and more cost-efficient lead discovery relative to the traditional method. Genomic, proteomic, and structural studies have provided hundreds of new targets and opportunities for future drug discovery. This situation poses a major problem: the necessity to handle the “big data” generated by combinatorial chemistry. Artificial intelligence (AI) and deep learning play a pivotal role in the analysis and systemization of larger data sets by statistical machine learning methods. Advanced AI-based sophisticated machine learning tools have a significant impact on the drug discovery process including medicinal chemistry. In this review, we focus on the currently available methods and algorithms for structure-based drug design including virtual screening and de novo drug design, with a special emphasis on AI- and deep-learning-based methods used for drug discovery.
Collapse
Affiliation(s)
- Maria Batool
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, Korea.
| | - Bilal Ahmad
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, Korea.
| | - Sangdun Choi
- Department of Molecular Science and Technology, Ajou University, Suwon 16499, Korea.
| |
Collapse
|
49
|
A Way towards Reliable Predictive Methods for the Prediction of Physicochemical Properties of Chemicals Using the Group Contribution and other Methods. APPLIED SCIENCES-BASEL 2019. [DOI: 10.3390/app9081700] [Citation(s) in RCA: 4] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/17/2022]
Abstract
Physicochemical properties of chemicals as referred to in this review include, for example, thermodynamic properties such as heat of formation, boiling point, toxicity of molecules and the fate of molecules whenever undergoing or accelerating (catalytic) a chemical reaction and therewith about chemical equilibrium, that is, the equilibrium in chemical reactions. All such properties have been predicted in literature by a variety of methods. However, for the experimental scientist for whom such predictions are of relevance, the accuracies are often far from sufficient for reliable application We discuss current practices and suggest how one could arrive at better, that is sufficiently accurate and reliable, predictive methods. Some recently published examples have shown this to be possible in practical cases. In summary, this review focuses on methodologies to obtain the required accuracies for the chemical practitioner and process technologist designing chemical processes. Finally, something almost never explicitly mentioned is the fact that whereas for some practical cases very accurate predictions are required, for other cases a qualitatively correct picture with relatively low correlation coefficients can be sufficient as a valuable predictive tool. Requirements for acceptable predictive methods can therefore be significantly different depending on the actual application, which are illustrated using real-life examples, primarily with industrial relevance. Furthermore, for specific properties such as the octanol-water partition coefficient more close collaboration between research groups using different methods would greatly facilitate progress in the field of predictive modelling.
Collapse
|
50
|
Pan Y, Ji X, Ding L, Jiang J. Prediction of Lower Flammability Limits for Binary Hydrocarbon Gases by Quantitative Structure-A Property Relationship Approach. Molecules 2019; 24:E748. [PMID: 30791456 PMCID: PMC6413142 DOI: 10.3390/molecules24040748] [Citation(s) in RCA: 11] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/12/2019] [Revised: 02/13/2019] [Accepted: 02/15/2019] [Indexed: 12/15/2022] Open
Abstract
The lower flammability limit (LFL) is one of the most important parameters for evaluating the fire and explosion hazards of flammable gases or vapors. This study proposed quantitative structure-property relationship (QSPR) models to predict the LFL of binary hydrocarbon gases from their molecular structures. Twelve different mixing rules were employed to derive mixture descriptors for describing the structures characteristics of a series of 181 binary hydrocarbon mixtures. Genetic algorithm (GA)-based multiple linear regression (MLR) was used to select the most statistically effective mixture descriptors on the LFL of binary hydrocarbon gases. A total of 12 multilinear models were obtained based on the different mathematical formulas. The best model, issued from the norm of the molar contribution formula, was achieved as a six-parameter model. The best model was then rigorously validated using multiple strategies and further extensively compared to the previously published model. The results demonstrated the robustness, validity, and satisfactory predictivity of the proposed model. The applicability domain (AD) of the model was defined as well. The proposed best model would be expected to present an alternative to predict the LFL values of existing or new binary hydrocarbon gases, and provide some guidance for prioritizing the design of safer blended gases with desired properties.
Collapse
Affiliation(s)
- Yong Pan
- Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science and Engineering, Nanjing Tech University, Nanjing 210009, China.
| | - Xianke Ji
- Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science and Engineering, Nanjing Tech University, Nanjing 210009, China.
| | - Li Ding
- Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science and Engineering, Nanjing Tech University, Nanjing 210009, China.
| | - Juncheng Jiang
- Jiangsu Key Laboratory of Hazardous Chemicals Safety and Control, College of Safety Science and Engineering, Nanjing Tech University, Nanjing 210009, China.
| |
Collapse
|