1
|
Panwar P, Yang Q, Martini A. Temperature-Dependent Density and Viscosity Prediction for Hydrocarbons: Machine Learning and Molecular Dynamics Simulations. J Chem Inf Model 2024; 64:2760-2774. [PMID: 37582234 DOI: 10.1021/acs.jcim.3c00231] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/17/2023]
Abstract
Machine learning-based predictive models allow rapid and reliable prediction of material properties and facilitate innovative materials design. Base oils used in the formulation of lubricant products are complex hydrocarbons of varying sizes and structure. This study developed Gaussian process regression-based models to accurately predict the temperature-dependent density and dynamic viscosity of 305 complex hydrocarbons. In our approach, strongly correlated/collinear predictors were trimmed, important predictors were selected by least absolute shrinkage and selection operator (LASSO) regularization and prior domain knowledge, hyperparameters were systematically optimized by Bayesian optimization, and the models were interpreted. The approach provided versatile and quantitative structure-property relationship (QSPR) models with relatively simple predictors for determining the dynamic viscosity and density of complex hydrocarbons at any temperature. In addition, we developed molecular dynamics simulation-based descriptors and evaluated the feasibility and versatility of dynamic descriptors from simulations for predicting the material properties. It was found that the models developed using a comparably smaller pool of dynamic descriptors performed similarly in predicting density and viscosity to models based on many more static descriptors. The best models were shown to predict density and dynamic viscosity with coefficient of determination (R2) values of 99.6% and 97.7%, respectively, for all data sets, including a test data set of 45 molecules. Finally, partial dependency plots (PDPs), individual conditional expectation (ICE) plots, local interpretable model-agnostic explanation (LIME) values, and trimmed model R2 values were used to identify the most important static and dynamic predictors of the density and viscosity.
Collapse
Affiliation(s)
- Pawan Panwar
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| | - Quanpeng Yang
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| | - Ashlie Martini
- Department of Mechanical Engineering, University of California Merced, 5200 North Lake Road, Merced, California 95343, United States
| |
Collapse
|
2
|
Zhu Q, Gu Y, Hu L, Gaudin T, Fan M, Ma J. Shear viscosity prediction of alcohols, hydrocarbons, halogenated, carbonyl, nitrogen-containing, and sulfur compounds using the variable force fields. J Chem Phys 2021; 154:074502. [PMID: 33607909 DOI: 10.1063/5.0038267] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.7] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/15/2022] Open
Abstract
Viscosity of organic liquids is an important physical property in applications of printing, pharmaceuticals, oil extracting, engineering, and chemical processes. Experimental measurement is a direct but time-consuming process. Accurately predicting the viscosity with a broad range of chemical diversity is still a great challenge. In this work, a protocol named Variable Force Field (VaFF) was implemented to efficiently vary the force field parameters, especially λvdW, for the van der Waals term for the shear viscosity prediction of 75 organic liquid molecules with viscosity ranging from -9 to 0 in their nature logarithm and containing diverse chemical functional groups, such as alcoholic hydroxyl, carbonyl, and halogenated groups. Feature learning was applied for the viscosity prediction, and the selected features indicated that the hydrogen bonding interactions and the number of atoms and rings play important roles in the property of viscosity. The shear viscosity prediction of alcohols is very difficult owing to the existence of relative strong intermolecular hydrogen bonding interaction as reflected by density functional theory binding energies. From radial and spatial distribution functions of methanol, we found that the van der Waals related parameters λvdW are more crucial to the viscosity prediction than the rotation related parameters, λtor. With the variable λvdW-based all-atom optimized potentials for liquid simulations force field, a great improvement was observed in the viscosity prediction for alcohols. The simplicity and uniformity of VaFF make it an efficient tool for the prediction of viscosity and other related properties in the rational design of materials with the specific properties.
Collapse
Affiliation(s)
- Qiang Zhu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| | - Yuming Gu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| | - Limu Hu
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| | - Théophile Gaudin
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| | - Mengting Fan
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| | - Jing Ma
- Key Laboratory of Mesoscopic Chemistry of Ministry of Education Institute of Theoretical and Computational Chemistry School of Chemistry and Chemical Engineering, Nanjing University, Nanjing 210023, People's Republic of China
| |
Collapse
|
3
|
Goussard V, Duprat F, Ploix JL, Dreyfus G, Nardello-Rataj V, Aubry JM. A New Machine-Learning Tool for Fast Estimation of Liquid Viscosity. Application to Cosmetic Oils. J Chem Inf Model 2020; 60:2012-2023. [PMID: 32250628 DOI: 10.1021/acs.jcim.0c00083] [Citation(s) in RCA: 9] [Impact Index Per Article: 2.3] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
The viscosities of pure liquids are estimated at 25 °C, from their molecular structures, using three modeling approaches: group contributions, COSMO-RS σ-moment-based neural networks, and graph machines. The last two are machine-learning methods, whereby models are designed and trained from a database of viscosities of 300 molecules at 25 °C. Group contributions and graph machines make use of the 2D-structures only (the SMILES codes of the molecules), while neural networks estimations are based on a set of five descriptors: COSMO-RS σ-moments. For the first time, leave-one-out is used for graph machine selection, and it is shown that it can be replaced with the much faster virtual leave-one-out algorithm. The database covers a wide diversity of chemical structures, namely, alkanes, ethers, esters, ketones, carbonates, acids, alcohols, silanes, and siloxanes, as well as different chemical backbone, i.e., straight, branched, or cyclic chains. A comparison of the viscosities of liquids of an independent set of 22 cosmetic oils shows that the graph machine approach provides the most accurate results given the available data. The results obtained by the neural network based on sigma-moments and by the graph machines can be duplicated easily by using a demonstration tool based on the Docker technology, available for download as explained in the Supporting Information. This demonstration also allows the reader to predict, at 25 °C, the viscosity of any liquid of moderate molecular size (M < 600 Da) that contains C, H, O, or Si atoms, starting either from its SMILES code or from its σ-moments computed with the COSMOtherm software.
Collapse
Affiliation(s)
- Valentin Goussard
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| | - François Duprat
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Jean-Luc Ploix
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Gérard Dreyfus
- Chimie Moléculaire, Macromoléculaire, Matériaux, ESPCI Paris, CNRS, PSL University, 10 rue Vauquelin, 75005 Paris, France
| | - Véronique Nardello-Rataj
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| | - Jean-Marie Aubry
- Université de Lille, CNRS, ENSCL, UMR 8181, UCCS-Unité de Catalyse et de Chimie du Solide, 59655 Villeneuve d'Ascq, France
| |
Collapse
|
4
|
Kim SY, Lee B. Adaptive prediction model for fluidized catalytic cracking processes based on the PLS method. ASIA-PAC J CHEM ENG 2018. [DOI: 10.1002/apj.2191] [Citation(s) in RCA: 2] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/07/2022]
Affiliation(s)
- Sung Young Kim
- Department of Chemical Engineering, College of Engineering; Dong-A University; Saha-gu Busan 49315 South Korea
| | - Bomsock Lee
- Department of Chemical Engineering, College of Engineering; Kyung Hee University; Yongin-si Kyunggi-do 17104 South Korea
| |
Collapse
|
5
|
Application of a General Computer Algorithm Based on the Group-Additivity Method for the Calculation of Two Molecular Descriptors at Both Ends of Dilution: Liquid Viscosity and Activity Coefficient in Water at Infinite Dilution. Molecules 2017; 23:molecules23010005. [PMID: 29267187 PMCID: PMC5943952 DOI: 10.3390/molecules23010005] [Citation(s) in RCA: 8] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/10/2017] [Revised: 12/16/2017] [Accepted: 12/19/2017] [Indexed: 11/17/2022] Open
Abstract
The application of a commonly used computer algorithm based on the group-additivity method for the calculation of the liquid viscosity coefficient at 293.15 K and the activity coefficient at infinite dilution in water at 298.15 K of organic molecules is presented. The method is based on the complete breakdown of the molecules into their constituting atoms, further subdividing them by their immediate neighborhood. A fast Gauss–Seidel fitting method using experimental data from literature is applied for the calculation of the atom groups’ contributions. Plausibility tests have been carried out on each of the calculations using a ten-fold cross-validation procedure which confirms the excellent predictive quality of the method. The goodness of fit (Q2) and the standard deviation (σ) of the cross-validation calculations for the viscosity coefficient, expressed as log(η), was 0.9728 and 0.11, respectively, for 413 test molecules, and for the activity coefficient log(γ)∞ the corresponding values were 0.9736 and 0.31, respectively, for 621 test compounds. The present approach has proven its versatility in that it enabled the simultaneous evaluation of the liquid viscosity of normal organic compounds as well as of ionic liquids.
Collapse
|
6
|
|
7
|
Katritzky AR, Kuanar M, Slavov S, Hall CD, Karelson M, Kahn I, Dobchev DA. Quantitative Correlation of Physical and Chemical Properties with Chemical Structure: Utility for Prediction. Chem Rev 2010; 110:5714-89. [DOI: 10.1021/cr900238d] [Citation(s) in RCA: 386] [Impact Index Per Article: 27.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
Affiliation(s)
- Alan R. Katritzky
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Minati Kuanar
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Svetoslav Slavov
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - C. Dennis Hall
- Center for Heterocyclic Compounds, Department of Chemistry, University of Florida, Gainesville, Florida 32611
| | - Mati Karelson
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| | - Iiris Kahn
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| | - Dimitar A. Dobchev
- Institute of Chemistry, Tallinn University of Technology, Akadeemia tee 15, Tallinn 19086, Estonia, and MolCode, Ltd., Soola 8, Tartu 51013, Estonia
| |
Collapse
|
8
|
|
9
|
Prediction of the surface tension of binary systems based on the partial least squares method. KOREAN J CHEM ENG 2009. [DOI: 10.1007/s11814-009-0058-1] [Citation(s) in RCA: 8] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/26/2022]
|
10
|
Rajappan R, Shingade PD, Natarajan R, Jayaraman VK. Quantitative Structure−Property Relationship (QSPR) Prediction of Liquid Viscosities of Pure Organic Compounds Employing Random Forest Regression. Ind Eng Chem Res 2009. [DOI: 10.1021/ie8018406] [Citation(s) in RCA: 16] [Impact Index Per Article: 1.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 01/13/2023]
Affiliation(s)
- Remya Rajappan
- Centre for Mathematical Sciences Pala Campus, Arunapuram, Kerala, India 686 574, Chemical Engineering and Process Development Division, National Chemical Laboratory, Pune, India 411 008, and Department of Chemical Engineering, Lakehead University, 955 Oliver Road Thunder Bay, ON, Canada P7B 5E1
| | - Prashant D. Shingade
- Centre for Mathematical Sciences Pala Campus, Arunapuram, Kerala, India 686 574, Chemical Engineering and Process Development Division, National Chemical Laboratory, Pune, India 411 008, and Department of Chemical Engineering, Lakehead University, 955 Oliver Road Thunder Bay, ON, Canada P7B 5E1
| | - Ramanathan Natarajan
- Centre for Mathematical Sciences Pala Campus, Arunapuram, Kerala, India 686 574, Chemical Engineering and Process Development Division, National Chemical Laboratory, Pune, India 411 008, and Department of Chemical Engineering, Lakehead University, 955 Oliver Road Thunder Bay, ON, Canada P7B 5E1
| | - Valadi K. Jayaraman
- Centre for Mathematical Sciences Pala Campus, Arunapuram, Kerala, India 686 574, Chemical Engineering and Process Development Division, National Chemical Laboratory, Pune, India 411 008, and Department of Chemical Engineering, Lakehead University, 955 Oliver Road Thunder Bay, ON, Canada P7B 5E1
| |
Collapse
|
11
|
Multivariate Statistical Analysis Approach to Predict the Reactor Properties and the Product Quality of a Direct Esterification Reactor for PET Synthesis. ACTA ACUST UNITED AC 2005. [DOI: 10.5302/j.icros.2005.11.6.550] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.1] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/05/2022]
|
12
|
Suzuki T, Ebert RU, Schüürmann G. Application of neural networks to modeling and estimating temperature-dependent liquid viscosity of organic compounds. JOURNAL OF CHEMICAL INFORMATION AND COMPUTER SCIENCES 2001; 41:776-90. [PMID: 11410058 DOI: 10.1021/ci000154y] [Citation(s) in RCA: 20] [Impact Index Per Article: 0.9] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Abstract
Back-propagation neural network models for correlating and predicting the viscosity-temperature behavior of a large variety of organic liquids were developed. Experimental values for the liquid viscosity for 1229 data points from 440 compounds containing C, H, N, O, S, and all halogens have been collected from the literature. The data ranges covered are from -120 to 160 degrees C for temperature and from 0.164 (trans-2-pentene at 20 degrees C) to 1.34 x 10(5) (glycerol at -20 degrees C) mPa.s for viscosity value. After dividing the total database of 440 compounds into training (237 with 673 data points), validation (124 with 423 data points), and test (79 with 133 data points) sets, the modeling performance of two separate neural network models with different architectures, one based on a compound-specific temperature dependence and the second based on a compound-independent one, has been examined. The resulting former model showed somewhat better modeling performance than latter, and the model gave squared correlation coefficients of 0.956, 0.932, and 0.884 and root mean-squares errors of 0.122, 0.134, and 0.148 log units for the training, validation, and test sets, respectively. The input descriptors include molar refraction, critical temperature, molar magnetic susceptibility, cohesive energy, temperatures, and five kinds of indicator variables for functionalities, alcohols/phenols, nitriles, amines, amides, and aliphatic ring. The reliability of the proposed model was assessed by comparing the results against calculated viscosities by two existing group-contribution approaches, the method of van Velzen et al. and the Joback and Reid method.
Collapse
Affiliation(s)
- T Suzuki
- Chemical Resources Laboratory, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226-8503, Japan.
| | | | | |
Collapse
|
13
|
Katritzky AR, Chen K, Wang Y, Karelson M, Lucic B, Trinajstic N, Suzuki T, Sch��rmann G. Prediction of liquid viscosity for organic compounds by a quantitative structure-property relationship. J PHYS ORG CHEM 2000. [DOI: 10.1002/(sici)1099-1395(200001)13:1<80::aid-poc179>3.0.co;2-8] [Citation(s) in RCA: 43] [Impact Index Per Article: 1.8] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/10/2022]
|
14
|
Ivanciuc O, Ivanciuc T, Filip PA, Cabrol-Bass D. Estimation of the Liquid Viscosity of Organic Compounds with a Quantitative Structure−Property Model. ACTA ACUST UNITED AC 1999. [DOI: 10.1021/ci980117v] [Citation(s) in RCA: 41] [Impact Index Per Article: 1.6] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/28/2022]
|
15
|
Suzuki T, Ebert RU, Schüürmann G. Development of Both Linear and Nonlinear Methods To Predict the Liquid Viscosity at 20 °C of Organic Compounds. ACTA ACUST UNITED AC 1997. [DOI: 10.1021/ci9704468] [Citation(s) in RCA: 27] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/29/2022]
Affiliation(s)
- Takahiro Suzuki
- Research Laboratory of Resources Utilization, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226, Japan, and Department of Chemical Ecotoxicology, UFZ Centre for Environmental Research, Permoserstrasse 15, D-04318 Leipzig, Germany
| | - Ralf-Uwe Ebert
- Research Laboratory of Resources Utilization, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226, Japan, and Department of Chemical Ecotoxicology, UFZ Centre for Environmental Research, Permoserstrasse 15, D-04318 Leipzig, Germany
| | - Gerrit Schüürmann
- Research Laboratory of Resources Utilization, Tokyo Institute of Technology, 4259 Nagatsuta, Midori-ku, Yokohama 226, Japan, and Department of Chemical Ecotoxicology, UFZ Centre for Environmental Research, Permoserstrasse 15, D-04318 Leipzig, Germany
| |
Collapse
|