1
|
Ravutla S, Bai A, Realff MJ, Boukouvala F. Effects of Surrogate Hybridization and Adaptive Sampling for Simulation-Based Optimization. Ind Eng Chem Res 2025; 64:9228-9251. [PMID: 40351754 PMCID: PMC12063061 DOI: 10.1021/acs.iecr.4c03303] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/01/2024] [Revised: 03/28/2025] [Accepted: 03/31/2025] [Indexed: 05/14/2025]
Abstract
Process simulators are essential for modeling of complex processes; however, optimization of expensive models remains challenging due to lack of equations, simulation cost, and lack of convergence guarantees. To tackle these challenges, surrogate modeling and surrogate-based optimization methods have been proposed. Most commonly, surrogates are treated as black-box models, while recently hybrid surrogates have gained popularity. In this work, we assess two main methodologies: (a) optimization of surrogates trained using a set of fixed a priori samples using deterministic solvers, and (b) adaptive sampling-based optimization, which leverages surrogate predictions to guide the search process. Across both methods, we systematically compare the effect of black-box versus hybrid surrogates, that utilize a "model-correction" architecture combining different fidelity data. Through mathematical benchmarks with up to ten dimensions, and two engineering case studies for process design of an extractive distillation simulation model and an adsorption simulation model, we present the effects of sampling quantity, dimensionality, formulation, and hybridization on solution convergence, reliability, and CPU efficiency. Our results show that hybrid modeling improves surrogate robustness and reduces solution variability with fewer samples, though it increases optimization costs. Additionally, adaptive sampling methods are more efficient and consistent than fixed-sampling surrogate strategies, even across different sampling and dimensionality scenarios.
Collapse
Affiliation(s)
- Suryateja Ravutla
- Department of Chemical and
Biomolecular Engineering, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Andrew Bai
- Department of Chemical and
Biomolecular Engineering, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Matthew J. Realff
- Department of Chemical and
Biomolecular Engineering, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| | - Fani Boukouvala
- Department of Chemical and
Biomolecular Engineering, Georgia Institute
of Technology, Atlanta, Georgia 30332, United States
| |
Collapse
|
2
|
Wijerathne ADHT, Joardder MUH, Welsh ZG, Nayak R, Sablani SS, Karim A. Recent Advances in Food Drying Modeling: Empirical to Multiscale Physics-Informed Neural Networks. Compr Rev Food Sci Food Saf 2025; 24:e70194. [PMID: 40371462 DOI: 10.1111/1541-4337.70194] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/08/2025] [Revised: 04/19/2025] [Accepted: 04/25/2025] [Indexed: 05/16/2025]
Abstract
Food insecurity is a major global challenge. Food preservation, particularly through drying, presents a promising solution to enhance food security and minimize waste. Fruits and vegetables contain 80%-90% water, and much of this is removed during drying. However, structural changes across multiple length scales occur during drying, compromising stability and affecting quality. Understanding these changes is essential, and several modeling techniques exist to analyze them, including empirical modeling, physics-based computational methods, purely data-driven machine learning approaches, and physics-informed neural network (PINN) models. Although empirical methods are straightforward to implement, their limited generalizability and lack of physical insights have led to the development of physics-based computational methods. These methods can achieve high spatiotemporal resolution without requiring experimental investigations. However, their complexity and high computational costs have prompted the exploration of data-driven machine learning models for drying processes, which involve comparatively lower computational costs and are more straightforward to execute. Nonetheless, their poor predictive ability with sparse data has restricted their application, leading to a hybrid modeling approach: PINN, which merges physical insights with data-driven machine learning techniques. This method still holds significant potential for advancements in food drying modeling. Therefore, this study aims to conduct a comprehensive literature review of state-of-the-art conventional drying modeling techniques, such as empirical, physics-based computational, and pure data-driven machine learning techniques, and explores the potential of the PINN approach for overcoming the limitations associated with conventional drying modeling strategies.
Collapse
Affiliation(s)
- Aluth Durage Hiruni Tharaka Wijerathne
- School of Mechanical, Medical, and Process Engineering, Faculty of Engineering, Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| | - Mohammad U H Joardder
- School of Mechanical, Medical, and Process Engineering, Faculty of Engineering, Queensland University of Technology (QUT), Brisbane, Queensland, Australia
- Rajshahi University of Engineering and Technology, Rajshahi, Bangladesh
| | - Zachary G Welsh
- School of Mechanical, Medical, and Process Engineering, Faculty of Engineering, Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| | - Richi Nayak
- School of Computer Science, Faculty of Science, Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| | - Shyam S Sablani
- Department of Biological Systems Engineering, Washington State University, Pullman, Washington, USA
| | - Azharul Karim
- School of Mechanical, Medical, and Process Engineering, Faculty of Engineering, Queensland University of Technology (QUT), Brisbane, Queensland, Australia
| |
Collapse
|
3
|
Wu W, Daneker M, Turner KT, Jolley MA, Lu L. Identifying Heterogeneous Micromechanical Properties of Biological Tissues via Physics-Informed Neural Networks. SMALL METHODS 2025; 9:e2400620. [PMID: 39091065 PMCID: PMC11747890 DOI: 10.1002/smtd.202400620] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/29/2024] [Revised: 07/19/2024] [Indexed: 08/04/2024]
Abstract
The heterogeneous micromechanical properties of biological tissues have profound implications across diverse medical and engineering domains. However, identifying full-field heterogeneous elastic properties of soft materials using traditional engineering approaches is fundamentally challenging due to difficulties in estimating local stress fields. Recently, there has been a growing interest in data-driven models for learning full-field mechanical responses, such as displacement and strain, from experimental or synthetic data. However, research studies on inferring full-field elastic properties of materials, a more challenging problem, are scarce, particularly for large deformation, hyperelastic materials. Here, a physics-informed machine learning approach is proposed to identify the elasticity map in nonlinear, large deformation hyperelastic materials. This study reports the prediction accuracies and computational efficiency of physics-informed neural networks (PINNs) in inferring the heterogeneous elasticity maps across materials with structural complexity that closely resemble real tissue microstructure, such as brain, tricuspid valve, and breast cancer tissues. Further, the improved architecture is applied to three hyperelastic constitutive models: Neo-Hookean, Mooney Rivlin, and Gent. The improved network architecture consistently produces accurate estimations of heterogeneous elasticity maps, even when there is up to 10% noise present in the training data.
Collapse
Affiliation(s)
- Wensi Wu
- Department of Anesthesiology and Critical Care Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
- Division of Cardiology, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Mitchell Daneker
- Department of Statistics and Data Science, Yale University, New Haven, CT, 06511, USA
- Department of Chemical and Biochemical Engineering, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Kevin T Turner
- Department of Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA, 19104, USA
| | - Matthew A Jolley
- Department of Anesthesiology and Critical Care Medicine, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
- Division of Cardiology, Children's Hospital of Philadelphia, Philadelphia, PA, 19104, USA
| | - Lu Lu
- Department of Statistics and Data Science, Yale University, New Haven, CT, 06511, USA
- Wu Tsai Institute, Yale University, New Haven, CT, 06510, USA
| |
Collapse
|
4
|
Molnar SM, Godfrey J, Song B. Balance equations for physics-informed machine learning. Heliyon 2024; 10:e38799. [PMID: 39654737 PMCID: PMC11626790 DOI: 10.1016/j.heliyon.2024.e38799] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/09/2024] [Revised: 09/30/2024] [Accepted: 09/30/2024] [Indexed: 12/12/2024] Open
Abstract
Using traditional machine learning (ML) methods may produce results that are inconsistent with the laws of physics. In contrast, physics-based models of complex physical, biological, or engineering systems incorporate the laws of physics as constraints on ML methods by introducing loss terms, ensuring that the results are consistent with these laws. However, accurately deriving the nonlinear and high order differential equations to enforce various complex physical laws is non-trivial. There is a lack of comprehensive guidance on the formulation of residual loss terms. To address this challenge, this paper proposes a new framework based on the balance equations, which aims to advance the development of PIML across multiple domains by providing a systematic approach to constructing residual loss terms that maintain the physical integrity of PDE solutions. The proposed balance equation method offers a unified treatment of all the fundamental equations of classical physics used in models of mechanical, electrical, and chemical systems and guides the derivation of differential equations for embedding physical laws in ML models. We show that all of these equations can be derived from a single equation known as the generic balance equation, in conjunction with specific constitutive relations that bind the balance equation to a particular domain. We also provide a few simple worked examples how to use our balance equation method in practice for PIML. Our approach suggests that a single framework can be followed to incorporate physics into ML models. This level of generalization may provide the basis for more efficient methods of developing physics-based ML for complex systems.
Collapse
Affiliation(s)
- Sandor M. Molnar
- Institute of Astronomy and Astrophysics, Academia Sinica, Taipei, Taiwan, Republic of China
| | | | | |
Collapse
|
5
|
Qian Y, Zhu G, Zhang Z, Modepalli S, Zheng Y, Zheng X, Frydman G, Li H. Coagulo-Net: Enhancing the mathematical modeling of blood coagulation using physics-informed neural networks. Neural Netw 2024; 180:106732. [PMID: 39305783 PMCID: PMC11578045 DOI: 10.1016/j.neunet.2024.106732] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/17/2024] [Revised: 08/30/2024] [Accepted: 09/10/2024] [Indexed: 11/14/2024]
Abstract
Blood coagulation, which involves a group of complex biochemical reactions, is a crucial step in hemostasis to stop bleeding at the injury site of a blood vessel. Coagulation abnormalities, such as hypercoagulation and hypocoagulation, could either cause thrombosis or hemorrhage, resulting in severe clinical consequences. Mathematical models of blood coagulation have been widely used to improve the understanding of the pathophysiology of coagulation disorders, guide the design and testing of new anticoagulants or other therapeutic agents, and promote precision medicine. However, estimating the parameters in these coagulation models has been challenging as not all reaction rate constants and new parameters derived from model assumptions are measurable. Although various conventional methods have been employed for parameter estimation for coagulation models, the existing approaches have several shortcomings. Inspired by the physics-informed neural networks, we propose Coagulo-Net, which synergizes the strengths of deep neural networks with the mechanistic understanding of the blood coagulation processes to enhance the mathematical models of the blood coagulation cascade. We assess the performance of the Coagulo-Net using two existing coagulation models with different extents of complexity. Our simulation results illustrate that Coagulo-Net can efficiently infer the unknown model parameters and dynamics of species based on sparse measurement data and data contaminated with noise. In addition, we show that Coagulo-Net can process a mixture of synthetic and experimental data and refine the predictions of existing mathematical models of coagulation. These results demonstrate the promise of Coagulo-Net in enhancing current coagulation models and aiding the creation of novel models for physiological and pathological research. These results showcase the potential of Coagulo-Net to advance computational modeling in the study of blood coagulation, improving both research methodologies and the development of new therapies for treating patients with coagulation disorders.
Collapse
Affiliation(s)
- Ying Qian
- School of Chemical, Materials and Biomedical Engineering, University of Georgia, Athens, USA
| | - Ge Zhu
- Department of Biomedical Engineering, Worcester Polytechnic Institute, Worcester, USA
| | - Zhen Zhang
- Division of Applied Mathematics, Brown University, Providence, RI, USA
| | | | - Yihao Zheng
- Department of Mechanical and Material Engineering, Worcester Polytechnic Institute, Worcester, USA
| | - Xiaoning Zheng
- Department of Mathematics, College of Information Science & Technology, Jinan University, Guangzhou, Guangdong, 510632, China
| | - Galit Frydman
- Division of Trauma, Emergency Surgery and Surgical Critical Care at the Massachusetts General Hospital, Boston, MA, USA; Division of Comparative Medicine, Department of Biological Engineering, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - He Li
- School of Chemical, Materials and Biomedical Engineering, University of Georgia, Athens, USA.
| |
Collapse
|
6
|
Wu W, Daneker M, Turner KT, Jolley MA, Lu L. Identifying heterogeneous micromechanical properties of biological tissues via physics-informed neural networks. ARXIV 2024:arXiv:2402.10741v3. [PMID: 38745694 PMCID: PMC11092874] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Subscribe] [Scholar Register] [Indexed: 05/16/2024]
Abstract
The heterogeneous micromechanical properties of biological tissues have profound implications across diverse medical and engineering domains. However, identifying full-field heterogeneous elastic properties of soft materials using traditional engineering approaches is fundamentally challenging due to difficulties in estimating local stress fields. Recently, there has been a growing interest in using data-driven models to learn full-field mechanical responses such as displacement and strain from experimental or synthetic data. However, research studies on inferring full-field elastic properties of materials, a more challenging problem, are scarce, particularly for large deformation, hyperelastic materials. Here, we propose a physics-informed machine learning approach to identify the elasticity map in nonlinear, large deformation hyperelastic materials. We evaluate the prediction accuracies and computational efficiency of physics-informed neural networks (PINNs) by inferring the heterogeneous elasticity maps across three materials with structural complexity that closely resemble real tissue patterns, such as brain tissue and tricuspid valve tissue. We further applied our improved architecture to three additional examples of breast cancer tissue and extended our analysis to three hyperelastic constitutive models: Neo-Hookean, Mooney Rivlin, and Gent. Our selected network architecture consistently produced highly accurate estimations of heterogeneous elasticity maps, even when there was up to 10% noise present in the training data.
Collapse
Affiliation(s)
- Wensi Wu
- Department of Anesthesiology and Critical Care Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Division of Cardiology, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Mitchell Daneker
- Department of Statistics and Data Science, Yale University, New Haven, CT 06511
- Department of Chemical and Biochemical Engineering, University of Pennsylvania, Philadelphia, PA 19104
| | - Kevin T. Turner
- Department of Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA 19104
| | - Matthew A. Jolley
- Department of Anesthesiology and Critical Care Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
- Division of Cardiology, Children’s Hospital of Philadelphia, Philadelphia, PA 19104
| | - Lu Lu
- Department of Statistics and Data Science, Yale University, New Haven, CT 06511
| |
Collapse
|
7
|
Song S, Jin H. Identifying constitutive parameters for complex hyperelastic materials using physics-informed neural networks. SOFT MATTER 2024. [PMID: 38954481 DOI: 10.1039/d4sm00001c] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 07/04/2024]
Abstract
Identifying constitutive parameters in engineering and biological materials, particularly those with intricate geometries and mechanical behaviors, remains a longstanding challenge. The recent advent of physics-informed neural networks (PINNs) offers promising solutions, but current frameworks are often limited to basic constitutive laws and encounter practical constraints when combined with experimental data. In this paper, we introduce a robust PINN-based framework designed to identify material parameters for soft materials, specifically those exhibiting complex constitutive behaviors, under large deformation in plane stress conditions. Distinctively, our model emphasizes training PINNs with multi-modal synthetic experimental datasets consisting of full-field deformation and loading history, ensuring algorithm robustness even with noisy data. Our results reveal that the PINNs framework can accurately identify constitutive parameters of the incompressible Arruda-Boyce model for samples with intricate geometries, maintaining an error below 5%, even with an experimental noise level of 5%. We believe our framework provides a robust modulus identification approach for complex solids, especially for those with geometrical and constitutive complexity.
Collapse
Affiliation(s)
- Siyuan Song
- School of Engineering, Brown University, Providence, RI 02912, USA.
| | - Hanxun Jin
- School of Engineering, Brown University, Providence, RI 02912, USA.
| |
Collapse
|
8
|
Henkes A, Eshraghian JK, Wessels H. Spiking neural networks for nonlinear regression. ROYAL SOCIETY OPEN SCIENCE 2024; 11:231606. [PMID: 38699557 PMCID: PMC11062414 DOI: 10.1098/rsos.231606] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Revised: 01/25/2024] [Accepted: 02/12/2024] [Indexed: 05/05/2024]
Abstract
Spiking neural networks (SNN), also often referred to as the third generation of neural networks, carry the potential for a massive reduction in memory and energy consumption over traditional, second-generation neural networks. Inspired by the undisputed efficiency of the human brain, they introduce temporal and neuronal sparsity, which can be exploited by next-generation neuromorphic hardware. Energy efficiency plays a crucial role in many engineering applications, for instance, in structural health monitoring. Machine learning in engineering contexts, especially in data-driven mechanics, focuses on regression. While regression with SNN has already been discussed in a variety of publications, in this contribution, we provide a novel formulation for its accuracy and energy efficiency. In particular, a network topology for decoding binary spike trains to real numbers is introduced, using the membrane potential of spiking neurons. Several different spiking neural architectures, ranging from simple spiking feed-forward to complex spiking long short-term memory neural networks, are derived. Since the proposed architectures do not contain any dense layers, they exploit the full potential of SNN in terms of energy efficiency. At the same time, the accuracy of the proposed SNN architectures is demonstrated by numerical examples, namely different material models. Linear and nonlinear, as well as history-dependent material models, are examined. While this contribution focuses on mechanical examples, the interested reader may regress any custom function by adapting the published source code.
Collapse
Affiliation(s)
- Alexander Henkes
- Computational Mechanics Group, ETH Zurich, Zurich, Switzerland
- Division Data-Driven Modeling of Mechanical Systems, Technical University Braunschweig, Braunschweig, Germany
| | - Jason K. Eshraghian
- Department of Electrical and Computer Engineering, University of California, Santa Cruz, CA, USA
| | - Henning Wessels
- Division Data-Driven Modeling of Mechanical Systems, Technical University Braunschweig, Braunschweig, Germany
| |
Collapse
|
9
|
Tsai P, Edison J, Wang C, Sefton J, Manning KQ, Gramlich MW. Myofascial trigger point (MTrP) size and elasticity properties can be used to differentiate characteristics of MTrPs in lower back skeletal muscle. Sci Rep 2024; 14:7562. [PMID: 38555353 PMCID: PMC10981696 DOI: 10.1038/s41598-024-57733-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/14/2023] [Accepted: 03/21/2024] [Indexed: 04/02/2024] Open
Abstract
Myofascial trigger points (MTrPs) are localized contraction knots that develop after muscle overuse or an acute trauma. Significant work has been done to understand, diagnose, and treat MTrPs in order to improve patients suffering from their effects. However, effective non-invasive diagnostic tools are still a missing gap in both understanding and treating MTrPs. Effective treatments for patients suffering from MTrP mediated pain require a means to measure MTrP properties quantitatively and diagnostically both prior to and during intervention. Further, quantitative measurements of MTrPs are often limited by the availability of equipment and training. Here we develop ultrasound (US) based diagnostic metrics that can be used to distinguish the biophysical properties of MTrPs, and show how those metrics can be used by clinicians during patient diagnosis and treatment. We highlight the advantages and limitations of previous US-based approaches that utilize elasticity theory. To overcome these previous limitations, we use a hierarchical approach to distinguish MTrP properties by patients' reported pain and clinician measured palpation. We show how US-based measurements can characterize MTrPs with this approach. We demonstrate that MTrPs tend to be smaller, stiffer, and deeper in the muscle tissue for patients with pain compared to patients without pain. We provide evidence that more than one MTrP within a single US-image field increases the stiffness of neighboring MTrPs. Finally, we highlight a combination of metrics (depth, thickness, and stiffness) that can be used by clinicians to evaluate individual MTrPs in combination with standard clinical assessments.
Collapse
Affiliation(s)
- P Tsai
- College of Nursing, Auburn University, Auburn, AL, USA
| | - J Edison
- Edward Via College of Osteopathic Medicine, Auburn University, Auburn, AL, USA
| | - C Wang
- Department of Educational Foundations, Leadership, and Technology, College of Education, Auburn University, Auburn, AL, USA
| | - J Sefton
- School of Kinesiology, College of Education, Auburn University, Auburn, AL, USA
| | - K Q Manning
- Department of Educational Foundations, Leadership, and Technology, College of Education, Auburn University, Auburn, AL, USA
| | - M W Gramlich
- Department of Physics, College of Science and Mathematics, Auburn University, Auburn, AL, USA.
| |
Collapse
|
10
|
Zhang Z, Lin C, Wang B. Physics-informed shape optimization using coordinate projection. Sci Rep 2024; 14:6537. [PMID: 38503891 PMCID: PMC10951326 DOI: 10.1038/s41598-024-57137-4] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 01/03/2024] [Accepted: 03/14/2024] [Indexed: 03/21/2024] Open
Abstract
The rapid growth of artificial intelligence is revolutionizing classical engineering society, offering novel approaches to material and structural design and analysis. Among various scientific machine learning techniques, physics-informed neural network (PINN) has been one of the most researched subjects, for its ability to incorporate physics prior knowledge into model training. However, the intrinsic continuity requirement of PINN demands the adoption of domain decomposition when multiple materials with distinct properties exist. This greatly complicates the gradient computation of design features, restricting the application of PINN to structural shape optimization. To address this, we present a novel framework that employs neural network coordinate projection for shape optimization within PINN. This technique allows for direct mapping from a standard shape to its optimal counterpart, optimizing the design objective without the need for traditional transition functions or the definition of intermediate material properties. Our method demonstrates a high degree of adaptability, allowing the incorporation of diverse constraints and objectives directly as training penalties. The proposed approach is tested on magnetostatic problems for iron core shape optimization, a scenario typically plagued by the high permeability contrast between materials. Validation with finite-element analysis confirms the accuracy and efficiency of our approach. The results highlight the framework's capability as a viable tool for shape optimization in complex material design tasks.
Collapse
Affiliation(s)
- Zhizhou Zhang
- Mitsubishi Electric Research Laboratories, 201 Broadway, 8th Floor, Cambridge, MA, 02139-1955, USA
| | - Chungwei Lin
- Mitsubishi Electric Research Laboratories, 201 Broadway, 8th Floor, Cambridge, MA, 02139-1955, USA
| | - Bingnan Wang
- Mitsubishi Electric Research Laboratories, 201 Broadway, 8th Floor, Cambridge, MA, 02139-1955, USA.
| |
Collapse
|
11
|
WU W, DANEKER M, JOLLEY MA, TURNER KT, LU L. Effective data sampling strategies and boundary condition constraints of physics-informed neural networks for identifying material properties in solid mechanics. APPLIED MATHEMATICS AND MECHANICS 2023; 44:1039-1068. [PMID: 37501681 PMCID: PMC10373631 DOI: 10.1007/s10483-023-2995-8] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 11/28/2022] [Revised: 05/09/2023] [Indexed: 07/29/2023]
Abstract
Material identification is critical for understanding the relationship between mechanical properties and the associated mechanical functions. However, material identification is a challenging task, especially when the characteristic of the material is highly nonlinear in nature, as is common in biological tissue. In this work, we identify unknown material properties in continuum solid mechanics via physics-informed neural networks (PINNs). To improve the accuracy and efficiency of PINNs, we develop efficient strategies to nonuniformly sample observational data. We also investigate different approaches to enforce Dirichlet-type boundary conditions (BCs) as soft or hard constraints. Finally, we apply the proposed methods to a diverse set of time-dependent and time-independent solid mechanic examples that span linear elastic and hyperelastic material space. The estimated material parameters achieve relative errors of less than 1%. As such, this work is relevant to diverse applications, including optimizing structural integrity and developing novel materials.
Collapse
Affiliation(s)
- W. WU
- Department of Anesthesiology and Critical Care Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, U. S. A
- Division of Pediatric Cardiology, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, U. S. A
| | - M. DANEKER
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA 19104, U. S. A
| | - M. A. JOLLEY
- Department of Anesthesiology and Critical Care Medicine, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, U. S. A
- Division of Pediatric Cardiology, Children’s Hospital of Philadelphia, Philadelphia, PA 19104, U. S. A
| | - K. T. TURNER
- Department of Mechanical Engineering and Applied Mechanics, University of Pennsylvania, Philadelphia, PA 19104, U. S. A
| | - L. LU
- Department of Chemical and Biomolecular Engineering, University of Pennsylvania, Philadelphia, PA 19104, U. S. A
| |
Collapse
|
12
|
Yang Z, Buehler MJ. Fill in the Blank: Transferrable Deep Learning Approaches to Recover Missing Physical Field Information. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2023; 35:e2301449. [PMID: 36934303 DOI: 10.1002/adma.202301449] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/14/2023] [Indexed: 06/09/2023]
Abstract
Solving materials engineering tasks is often hindered by limited information, such as in inverse problems with only boundary data information or design tasks with a simple objective but a vast search space. To address these challenges, multiple deep learning (DL) architectures are leveraged to predict missing mechanical information given limited known data in part of the domain, and further characterize the composite geometries from the recovered mechanical fields for 2D and 3D complex microstructures. In 2D, a conditional generative adversarial network (GAN) is utilized to complete partially masked field maps and predict the composite geometry with convolutional models with great accuracy and generality by making precise predictions on field data with mixed stress/strain components, hierarchical geometries, distinct materials properties and various types of microstructures including ill-posed inverse problems. In 3D, a Transformer-based architecture is implemented to predict complete 3D mechanical fields from input field snapshots. The model manifests excellent performance regardless of microstructural complexity and recovers the entire bulk field even from a single surface field image, allowing internal structural characterization from only boundary measurements. The whole frameworks provide efficient ways for analysis and design with incomplete information and allow the direct inverse translation from properties back to materials structures.
Collapse
Affiliation(s)
- Zhenze Yang
- Laboratory for Atomistic and Molecular Mechanics (LAMM), Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA, 02139, USA
- Department of Materials Science and Engineering, Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA, 02139, USA
| | - Markus J Buehler
- Laboratory for Atomistic and Molecular Mechanics (LAMM), Massachusetts Institute of Technology, 77 Massachusetts Ave., Cambridge, MA, 02139, USA
- Center for Computational Science and Engineering, Schwarzman College of Computing, Massachusetts Institute of Technology, 77 Massachusetts Ave, Cambridge, MA, 02139, USA
- Center for Materials Science and Engineering, 77 Massachusetts Ave, Cambridge, MA, 02139, USA
| |
Collapse
|
13
|
Roy AM, Bose R, Sundararaghavan V, Arróyave R. Deep learning-accelerated computational framework based on Physics Informed Neural Network for the solution of linear elasticity. Neural Netw 2023; 162:472-489. [PMID: 36966712 DOI: 10.1016/j.neunet.2023.03.014] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.5] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/05/2022] [Revised: 02/07/2023] [Accepted: 03/08/2023] [Indexed: 03/16/2023]
Abstract
The paper presents an efficient and robust data-driven deep learning (DL) computational framework developed for linear continuum elasticity problems. The methodology is based on the fundamentals of the Physics Informed Neural Networks (PINNs). For an accurate representation of the field variables, a multi-objective loss function is proposed. It consists of terms corresponding to the residual of the governing partial differential equations (PDE), constitutive relations derived from the governing physics, various boundary conditions, and data-driven physical knowledge fitting terms across randomly selected collocation points in the problem domain. To this end, multiple densely connected independent artificial neural networks (ANNs), each approximating a field variable, are trained to obtain accurate solutions. Several benchmark problems including the Airy solution to elasticity and the Kirchhoff-Love plate problem are solved. Performance in terms of accuracy and robustness illustrates the superiority of the current framework showing excellent agreement with analytical solutions. The present work combines the benefits of the classical methods depending on the physical information available in analytical relations with the superior capabilities of the DL techniques in the data-driven construction of lightweight, yet accurate and robust neural networks. The models developed herein can significantly boost computational speed using minimal network parameters with easy adaptability in different computational platforms.
Collapse
Affiliation(s)
- Arunabha M Roy
- Department of Materials Science and Engineering, Texas A&M University, 3003 TAMU, College Station, TX 77843, USA.
| | - Rikhi Bose
- Mechanical Engineering, Johns Hopkins University, Baltimore, MD 21218, USA
| | - Veera Sundararaghavan
- Department of Aerospace Engineering, University of Michigan, Ann Arbor, MI 48109, USA
| | - Raymundo Arróyave
- Department of Materials Science and Engineering, Texas A&M University, 3003 TAMU, College Station, TX 77843, USA; Department of Mechanical Engineering, Texas A&M University, 3003 TAMU, College Station, TX 77843, USA
| |
Collapse
|
14
|
Fang Q, Mou X, Li S. A physics-informed neural network based on mixed data sampling for solving modified diffusion equations. Sci Rep 2023; 13:2491. [PMID: 36781943 PMCID: PMC9925766 DOI: 10.1038/s41598-023-29822-3] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/13/2022] [Accepted: 02/10/2023] [Indexed: 02/15/2023] Open
Abstract
We developed a physics-informed neural network based on a mixture of Cartesian grid sampling and Latin hypercube sampling to solve forward and backward modified diffusion equations. We optimized the parameters in the neural networks and the mixed data sampling by considering the squeeze boundary condition and the mixture coefficient, respectively. Then, we used a given modified diffusion equation as an example to demonstrate the efficiency of the neural network solver for forward and backward problems. The neural network results were compared with the numerical solutions, and good agreement with high accuracy was observed. This neural network solver can be generalized to other partial differential equations.
Collapse
Affiliation(s)
- Qian Fang
- Department of Physics, Wenzhou University, Wenzhou, 325035, Zhejiang, China
| | - Xuankang Mou
- Department of Physics, Wenzhou University, Wenzhou, 325035, Zhejiang, China
| | - Shiben Li
- Department of Physics, Wenzhou University, Wenzhou, 325035, Zhejiang, China.
| |
Collapse
|
15
|
Chen S, Xu N. Detecting Microstructural Criticality/Degeneracy through Hybrid Learning Strategies Trained by Molecular Dynamics Simulations. ACS APPLIED MATERIALS & INTERFACES 2023; 15:10193-10202. [PMID: 36757321 DOI: 10.1021/acsami.2c20218] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/18/2023]
Abstract
Efficient microstructure design can strongly accelerate the development of materials. However, the complexity of the microstructure-behavior relation renders the criticalities and degeneracies within the microstructure space highly possible. Criticality means that a slight microstructural change can lead to a dramatic transition in material behavior, while degeneracy means that very different microstructures may lead to similar behaviors. To investigate these microstructural characteristics of the fiber/matrix interface within composite materials, we have proposed a hybrid deep-learning-based framework by integrating the supervised feed-forward neural network and the unsupervised autoencoder, which are trained by the molecular dynamics (MD) simulation results. The well-trained model continuously maps the elemental density images within the interfacial area into a low-dimensional latent space. Assisted by the extracted latent features, we can easily detect the criticalities and degeneracies within the original microstructure space of the composite's interface. The predicted microstructural criticalities and degeneracies are validated by investigating their atomistic origins through MD simulations. The proposed framework can be employed for the interfacial microstructure design of composite materials by identifying certain interfacial microstructures that might lead to undesirable behaviors.
Collapse
Affiliation(s)
- Shaohua Chen
- College of Energy and Power Engineering, Nanjing University of Aeronautics and Astronautics, Nanjing 210016, China
| | - Nuo Xu
- School of Materials Science and Engineering, Harbin Institute of Technology, Harbin 150001, China
| |
Collapse
|
16
|
Yin M, Zhang E, Yu Y, Karniadakis GE. Interfacing finite elements with deep neural operators for fast multiscale modeling of mechanics problems. COMPUTER METHODS IN APPLIED MECHANICS AND ENGINEERING 2022; 402:115027. [PMID: 37384215 PMCID: PMC10300559 DOI: 10.1016/j.cma.2022.115027] [Citation(s) in RCA: 5] [Impact Index Per Article: 1.7] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 06/30/2023]
Abstract
Multiscale modeling is an effective approach for investigating multiphysics systems with largely disparate size features, where models with different resolutions or heterogeneous descriptions are coupled together for predicting the system's response. The solver with lower fidelity (coarse) is responsible for simulating domains with homogeneous features, whereas the expensive high-fidelity (fine) model describes microscopic features with refined discretization, often making the overall cost prohibitively high, especially for time-dependent problems. In this work, we explore the idea of multiscale modeling with machine learning and employ DeepONet, a neural operator, as an efficient surrogate of the expensive solver. DeepONet is trained offline using data acquired from the fine solver for learning the underlying and possibly unknown fine-scale dynamics. It is then coupled with standard PDE solvers for predicting the multiscale systems with new boundary/initial conditions in the coupling stage. The proposed framework significantly reduces the computational cost of multiscale simulations since the DeepONet inference cost is negligible, facilitating readily the incorporation of a plurality of interface conditions and coupling schemes. We present various benchmarks to assess the accuracy and efficiency, including static and time-dependent problems. We also demonstrate the feasibility of coupling of a continuum model (finite element methods, FEM) with a neural operator, serving as a surrogate of a particle system (Smoothed Particle Hydrodynamics, SPH), for predicting mechanical responses of anisotropic and hyperelastic materials. What makes this approach unique is that a well-trained over-parametrized DeepONet can generalize well and make predictions at a negligible cost.
Collapse
Affiliation(s)
- Minglang Yin
- Center for Biomedical Engineering, Brown University, Providence, RI, United States of America
- School of Engineering, Brown University, Providence, RI, United States of America
| | - Enrui Zhang
- Division of Applied Mathematics, Brown University, Providence, RI, United States of America
| | - Yue Yu
- Department of Mathematics, Lehigh University, Bethlehem, PA, United States of America
| | - George Em Karniadakis
- School of Engineering, Brown University, Providence, RI, United States of America
- Division of Applied Mathematics, Brown University, Providence, RI, United States of America
| |
Collapse
|
17
|
Zhang E, Spronck B, Humphrey JD, Karniadakis GE. G2Φnet: Relating genotype and biomechanical phenotype of tissues with deep learning. PLoS Comput Biol 2022; 18:e1010660. [PMID: 36315608 PMCID: PMC9668200 DOI: 10.1371/journal.pcbi.1010660] [Citation(s) in RCA: 1] [Impact Index Per Article: 0.3] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/03/2022] [Revised: 11/16/2022] [Accepted: 10/17/2022] [Indexed: 11/05/2022] Open
Abstract
Many genetic mutations adversely affect the structure and function of load-bearing soft tissues, with clinical sequelae often responsible for disability or death. Parallel advances in genetics and histomechanical characterization provide significant insight into these conditions, but there remains a pressing need to integrate such information. We present a novel genotype-to-biomechanical phenotype neural network (G2Φnet) for characterizing and classifying biomechanical properties of soft tissues, which serve as important functional readouts of tissue health or disease. We illustrate the utility of our approach by inferring the nonlinear, genotype-dependent constitutive behavior of the aorta for four mouse models involving defects or deficiencies in extracellular constituents. We show that G2Φnet can infer the biomechanical response while simultaneously ascribing the associated genotype by utilizing limited, noisy, and unstructured experimental data. More broadly, G2Φnet provides a powerful method and a paradigm shift for correlating genotype and biomechanical phenotype quantitatively, promising a better understanding of their interplay in biological tissues.
Collapse
Affiliation(s)
- Enrui Zhang
- Division of Applied Mathematics, Brown University, Providence, Rhode Island, United States of America
| | - Bart Spronck
- Department of Biomedical Engineering, Maastricht University, Maastricht, the Netherlands
| | - Jay D. Humphrey
- Department of Biomedical Engineering, Yale University, New Haven, Connecticut, United States of America
| | - George Em Karniadakis
- Division of Applied Mathematics, Brown University, Providence, Rhode Island, United States of America
- School of Engineering, Brown University, Providence, Rhode Island, United States of America
| |
Collapse
|