1
|
Cheng Y, Araki T, Kamimura N, Masai E, Nakamura M, Manzhos S, Michinobu T. Accelerated Development of Novel Biomass-Based Polyurethane Adhesives via Machine Learning. ACS APPLIED MATERIALS & INTERFACES 2025; 17:15959-15968. [PMID: 40017372 PMCID: PMC11912195 DOI: 10.1021/acsami.4c20371] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/20/2024] [Revised: 01/18/2025] [Accepted: 02/20/2025] [Indexed: 03/01/2025]
Abstract
2-Pyrone-4,6-dicarboxylic acid (PDC) can be produced on a large scale from lignin by transformed bacteria, and its use as a bifunctional monomer to synthesize biomass-based polymers has been reported. Recently, excellent adhesive properties of the resulting biomass-based polymers were also reported, but their performance has not yet been optimized. In this study, we focus on improving the adhesive properties of PDC-based polyurethanes (PUs) by combining experiments and machine learning (ML). We synthesized an initial data set of 25 adhesive samples from different polyols and isocyanates with different isocyanate-to-hydroxyl ratios (r). Adhesive strengths were measured after hot-pressing at varying temperatures (Theat, °C) and durations (theat, h), following a Taguchi L25 orthogonal design. Gaussian process-based Bayesian optimization (BO) was employed to identify an optimal PDC-based PU adhesive as a function of polyol type, isocyanate type, r ratio, heating temperature, and time with an improved adhesive strength of 10.04 ± 1.26 MPa after only five iterations. This approach highlights the effectiveness of BO in guiding experimental conditions for an enhanced performance. Random Forest regression was also used as an alternative ML approach and supported the conclusions. Overall, this study demonstrates the potential of the BO in accelerating the development and optimization of novel adhesive materials.
Collapse
Affiliation(s)
- Ye Cheng
- Department of Materials Science and Engineering, Institute of Science Tokyo, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8552, Japan
| | - Takuma Araki
- Department of Forest Resource Chemistry, Forestry and Forest Products Research Institute, Tsukuba, Ibaraki 305-8687, Japan
| | - Naofumi Kamimura
- Department of Materials Science and Bioengineering, Nagaoka University of Technology, Nagaoka, Niigata 940-2188, Japan
| | - Eiji Masai
- Department of Materials Science and Bioengineering, Nagaoka University of Technology, Nagaoka, Niigata 940-2188, Japan
| | - Masaya Nakamura
- Department of Forest Resource Chemistry, Forestry and Forest Products Research Institute, Tsukuba, Ibaraki 305-8687, Japan
| | - Sergei Manzhos
- Department of Chemical Science and Engineering, Institute of Science Tokyo, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8550, Japan
| | - Tsuyoshi Michinobu
- Department of Materials Science and Engineering, Institute of Science Tokyo, 2-12-1 Ookayama, Meguro-ku, Tokyo 152-8552, Japan
| |
Collapse
|
2
|
Koch D, Pavanello M, Shao X, Ihara M, Ayers PW, Matta CF, Jenkins S, Manzhos S. The Analysis of Electron Densities: From Basics to Emergent Applications. Chem Rev 2024; 124:12661-12737. [PMID: 39545704 DOI: 10.1021/acs.chemrev.4c00297] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 11/17/2024]
Abstract
The electron density determines all properties of a system of nuclei and electrons. It is both computable and observable. Its topology allows gaining insight into the mechanisms of bonding and other phenomena in a way that is complementary to and beyond that available from the molecular orbital picture and the formal oxidation state (FOS) formalism. The ability to derive mechanistic insight from electron density is also important with methods where orbitals are not available, such as orbital-free density functional theory (OF-DFT). While density topology-based analyses such as QTAIM (quantum theory of atoms-in-molecules) have been widely used, novel, vector-based techniques recently emerged such as next-generation (NG) QTAIM. Density-dependent quantities are also actively used in machine learning (ML)-based methods, in particular, for ML DFT functional development, including machine-learnt kinetic energy functionals. We review QTAIM and its recent extensions such as NG-QTAIM and localization-delocalization matrices (LDM) and their uses in the analysis of bonding, conformations, mechanisms of redox reactions excitations, as well as ultrafast phenomena. We review recent research showing that direct density analysis can circumvent certain pitfalls of the FOS formalism, in particular in the description of anionic redox, and of the widely used (spherically) projected density of states analysis. We discuss uses of density-based quantities for the construction of DFT functionals and prospects of applications of analyses of density topology to get mechanistic insight with OF-DFT and recently developed time-dependent OF-DFT.
Collapse
Affiliation(s)
- Daniel Koch
- Centre Énergie Matériaux Télécommunications, Institut National de la Recherche Scientifique, 1650 boulevard Lionel-Boulet, Varennes, Quebec J3X 1S2, Canada
| | - Michele Pavanello
- Department of Physics, Rutgers University, 101 Warren Street, Newark, New Jersey 07102, United States
- Department of Chemistry, Rutgers University, 73 Warren Street, Newark, New Jersey 07102, United States
| | - Xuecheng Shao
- Department of Chemistry, Rutgers University, 73 Warren Street, Newark, New Jersey 07102, United States
| | - Manabu Ihara
- School of Materials and Chemical Technology, Institute of Science Tokyo, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| | - Paul W Ayers
- Department of Chemistry and Chemical Biology, McMaster University, 25-1280 Main Street West, Hamilton, Ontario L8S 4M1, Canada
| | - Chérif F Matta
- Department of Chemistry and Physics, Mount Saint Vincent University, 166 Bedford Highway, Halifax, Nova Scotia B3M 2J6, Canada
| | - Samantha Jenkins
- College of Chemistry and Chemical Engineering, Hunan Normal University, 36 Lushan Road, Changsha, Hunan 410081, People's Republic of China
| | - Sergei Manzhos
- School of Materials and Chemical Technology, Institute of Science Tokyo, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| |
Collapse
|
3
|
Zhao R, Wang G, Li F, Wang J, Zhang Y, Li D, Liu S, Li J, Song J, Wei F, Wang C. Developing Machine Learning-Based Predictive Models for Hallux Valgus Recurrence Based on Measurements From Radiographs. Foot Ankle Int 2024; 45:1000-1008. [PMID: 38872342 DOI: 10.1177/10711007241256648] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Indexed: 06/15/2024]
Abstract
BACKGROUND Machine learning (ML) is increasingly used to predict the prognosis of numerous diseases. This retrospective analysis aimed to develop a prediction model using ML algorithms and to identify predictors associated with the recurrence of hallux valgus (HV) following surgery. METHODS A total of 198 symptomatic feet that underwent chevron osteotomy combined with a distal soft tissue procedure were enrolled and analyzed from 2 independent medical centers. The feet were grouped according to nonrecurrence or recurrence based on 1-year follow-up outcomes. Preoperative weightbearing radiographs and immediate postoperative nonweightbearing radiographs were obtained for each HV foot. Radiographic measurements (eg, HV angle and intermetatarsal angle) were acquired and used for ML model training. A total of 9 commonly used ML models were trained on the data obtained from one institute (108 feet), and tested on the other data set from another independent institute (90 feet) for external validation. Optimal feature sets for each model were identified based on a 2000-resample bootstrap-based internal validation via an exhaustive search. The performance of each model was then tested on the external validation set. The area under the curve (AUC), classification accuracy, sensitivity, and specificity of each model were calculated to evaluate the performance of each model. RESULTS The support vector machine (SVM) model showed the highest predictive accuracy compared to other methods, with an AUC of 0.88 and an accuracy of 75.6%. Preoperative hallux valgus angle, tibial sesamoid position, postoperative intermetatarsal angle, and postoperative tibial sesamoid position were identified as the most selected features by several ML models. CONCLUSION ML classifiers such as SVM could predict the recurrence of HV (an HVA >20 degrees) at a 1-year follow-up while identifying associated predictors in a multivariate manner. This study holds the potential for foot and ankle surgeons to effectively identify individuals at higher risk of HV recurrence postsurgery.
Collapse
Affiliation(s)
- Rui Zhao
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Guobin Wang
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Fengtan Li
- Department of Radiology, Tianjin Medical University General Hospital, Tianjin, China
| | - Jinchan Wang
- Department of Dermatology, Tianjin Medical University General Hospital, Tianjin, China
| | - Yuan Zhang
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Dong Li
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Shen Liu
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Jie Li
- Graduate School, Tianjin Medical University, Tianjin, China
| | - Jiajun Song
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| | - Fangyuan Wei
- Department of Hand and Foot Surgery, Beijing University of Chinese Medicine Third Affiliated Hospital, Beijing, China
- Engineering Research Center of Chinese Orthopaedic and Sports Rehabilitation Artificial Intelligent, Ministry of Education, Beijing, China
| | - Chenguang Wang
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin, China
| |
Collapse
|
4
|
Gallegos M, Isamura BK, Popelier PLA, Martín Pendás Á. An Unsupervised Machine Learning Approach for the Automatic Construction of Local Chemical Descriptors. J Chem Inf Model 2024; 64:3059-3079. [PMID: 38498942 PMCID: PMC11040729 DOI: 10.1021/acs.jcim.3c01906] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/27/2023] [Revised: 03/06/2024] [Accepted: 03/07/2024] [Indexed: 03/20/2024]
Abstract
Condensing the many physical variables defining a chemical system into a fixed-size array poses a significant challenge in the development of chemical Machine Learning (ML). Atom Centered Symmetry Functions (ACSFs) offer an intuitive featurization approach by means of a tedious and labor-intensive selection of tunable parameters. In this work, we implement an unsupervised ML strategy relying on a Gaussian Mixture Model (GMM) to automatically optimize the ACSF parameters. GMMs effortlessly decompose the vastness of the chemical and conformational spaces into well-defined radial and angular clusters, which are then used to build tailor-made ACSFs. The unsupervised exploration of the space has demonstrated general applicability across a diverse range of systems, spanning from various unimolecular landscapes to heterogeneous databases. The impact of the sampling technique and temperature on space exploration is also addressed, highlighting the particularly advantageous role of high-temperature Molecular Dynamics (MD) simulations. The reliability of the resulting features is assessed through the estimation of the atomic charges of a prototypical capped amino acid and a heterogeneous collection of CHON molecules. The automatically constructed ACSFs serve as high-quality descriptors, consistently yielding typical prediction errors below 0.010 electrons bound for the reported atomic charges. Altering the spatial distribution of the functions with respect to the cluster highlights the critical role of symmetry rupture in achieving significantly improved features. More specifically, using two separate functions to describe the lower and upper tails of the cluster results in the best performing models with errors as low as 0.006 electrons. Finally, the effectiveness of finely tuned features was checked across different architectures, unveiling the superior performance of Gaussian Process (GP) models over Feed Forward Neural Networks (FFNNs), particularly in low-data regimes, with nearly a 2-fold increase in prediction quality. Altogether, this approach paves the way toward an easier construction of local chemical descriptors, while providing valuable insights into how radial and angular spaces should be mapped. Finally, this work opens the possibility of encoding many-body information beyond angular terms into upcoming ML features.
Collapse
Affiliation(s)
- Miguel Gallegos
- Department
of Analytical and Physical Chemistry, University
of Oviedo, Oviedo E-33006, Spain
| | | | - Paul L. A. Popelier
- Department
of Chemistry, The University of Manchester, Oxford Road, Manchester M13 9PL, U.K.
| | - Ángel Martín Pendás
- Department
of Analytical and Physical Chemistry, University
of Oviedo, Oviedo E-33006, Spain
| |
Collapse
|
5
|
Manzhos S, Ihara M. Degeneration of kernel regression with Matern kernels into low-order polynomial regression in high dimension. J Chem Phys 2024; 160:021101. [PMID: 38189605 DOI: 10.1063/5.0187867] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/16/2023] [Accepted: 12/17/2023] [Indexed: 01/09/2024] Open
Abstract
Kernel methods such as kernel ridge regression and Gaussian process regression with Matern-type kernels have been increasingly used, in particular, to fit potential energy surfaces (PES) and density functionals, and for materials informatics. When the dimensionality of the feature space is high, these methods are used with necessarily sparse data. In this regime, the optimal length parameter of a Matern-type kernel may become so large that the method effectively degenerates into a low-order polynomial regression and, therefore, loses any advantage over such regression. This is demonstrated theoretically as well as numerically in the examples of six- and fifteen-dimensional molecular PES using squared exponential and simple exponential kernels. The results shed additional light on the success of polynomial approximations such as PIP for medium-size molecules and on the importance of orders-of-coupling-based models for preserving the advantages of kernel methods with Matern-type kernels of on the use of physically motivated (reproducing) kernels.
Collapse
Affiliation(s)
- Sergei Manzhos
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| | - Manabu Ihara
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| |
Collapse
|
6
|
Song J, Li J, Zhao R, Chu X. Developing predictive models for surgical outcomes in patients with degenerative cervical myelopathy: a comparison of statistical and machine learning approaches. Spine J 2024; 24:57-67. [PMID: 37531977 DOI: 10.1016/j.spinee.2023.07.021] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 05/15/2023] [Revised: 07/16/2023] [Accepted: 07/26/2023] [Indexed: 08/04/2023]
Abstract
BACKGROUND CONTEXT Machine learning (ML) is widely used to predict the prognosis of numerous diseases. PURPOSE This retrospective analysis aimed to develop a prognostic prediction model using ML algorithms and identify predictors associated with poor surgical outcomes in patients with degenerative cervical myelopathy (DCM). STUDY DESIGN A retrospective study. PATIENT SAMPLE A total of 406 symptomatic DCM patients who underwent surgical decompression were enrolled and analyzed from three independent medical centers. OUTCOME MEASURES We calculated the area under the curve (AUC), classification accuracy, sensitivity, and specificity of each model. METHODS The Japanese Orthopedic Association (JOA) score was obtained before and 1 year following decompression surgery, and patients were grouped into good and poor outcome groups based on a cut-off value of 60% based on a previous study. Two datasets were fused for training, 1 dataset was held out as an external validation set. Optimal feature-subset and hyperparameters for each model were adjusted based on a 2,000-resample bootstrap-based internal validation via exhaustive search and grid search. The performance of each model was then tested on the external validation set. RESULTS The Support Vector Machine (SVM) model showed the highest predictive accuracy compared to other methods, with an AUC of 0.82 and an accuracy of 75.7%. Age, sex, disease duration, and preoperative JOA score were identified as the most commonly selected features by both the ML and statistical models. Grid search optimization for hyperparameters successfully enhanced the predictive performance of each ML model, and the SVM model still had the best performance with an AUC of 0.93 and an accuracy of 86.4%. CONCLUSIONS Overall, the study demonstrated that ML classifiers such as SVM can effectively predict surgical outcomes for patients with DCM while identifying associated predictors in a multivariate manner.
Collapse
Affiliation(s)
- Jiajun Song
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin 300052, China
| | - Jie Li
- Department of Minimally Invasive Spine Surgery, Tianjin Hospital, Tianjin 300211, China
| | - Rui Zhao
- Department of Orthopedic Surgery, Tianjin Medical University General Hospital, Tianjin 300052, China
| | - Xu Chu
- Department of Orthopedic Surgery, Honghui Hospital, Xi'an Jiaotong University, Xi'an 710054, China.
| |
Collapse
|
7
|
Manzhos S, Lüder J, Ihara M. Machine learning of kinetic energy densities with target and feature smoothing: Better results with fewer training data. J Chem Phys 2023; 159:234115. [PMID: 38112506 DOI: 10.1063/5.0175689] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/08/2023] [Accepted: 11/27/2023] [Indexed: 12/21/2023] Open
Abstract
Machine learning (ML) of kinetic energy functionals (KEFs), in particular kinetic energy density (KED) functionals, is a promising way to construct KEFs for orbital-free density functional theory (DFT). Neural networks and kernel methods including Gaussian process regression (GPR) have been used to learn Kohn-Sham (KS) KED from density-based descriptors derived from KS DFT calculations. The descriptors are typically expressed as functions of different powers and derivatives of the electron density. This can generate large and extremely unevenly distributed datasets, which complicates effective application of ML techniques. Very uneven data distributions require many training datapoints, can cause overfitting, and can ultimately lower the quality of an ML KED model. We show that one can produce more accurate ML models from fewer data by working with smoothed density-dependent variables and KED. Smoothing palliates the issue of very uneven data distributions and associated difficulties of sampling while retaining enough spatial structure necessary for working within the paradigm of KEDF. We use GPR as a function of smoothed terms of the fourth order gradient expansion and KS effective potential and obtain accurate and stable (with respect to different random choices of training points) kinetic energy models for Al, Mg, and Si simultaneously from as few as 2000 samples (about 0.3% of the total KS DFT data). In particular, accuracies on the order of 1% in a measure of the quality of energy-volume dependence B'=EV0-ΔV-2EV0+E(V0+ΔV)ΔV/V02 (where V0 is the equilibrium volume and ΔV is a deviation from it) are obtained simultaneously for all three materials.
Collapse
Affiliation(s)
- Sergei Manzhos
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| | - Johann Lüder
- Department of Materials and Optoelectronic Science, National Sun Yat-sen University, No. 70, Lien-Hai Road, Kaohsiung 80424, Taiwan
- Center of Crystal Research, National Sun Yat-sen University, No. 70, Lien-Hai Road, Kaohsiung 80424, Taiwan
- Center for Theoretical and Computational Physics, National Sun Yat-sen University, Kaohsiung 80424, Taiwan
| | - Manabu Ihara
- School of Materials and Chemical Technology, Tokyo Institute of Technology, Ookayama 2-12-1, Meguro-ku, Tokyo 152-8552, Japan
| |
Collapse
|