1
|
Wang Z, You F. Leveraging generative models with periodicity-aware, invertible and invariant representations for crystalline materials design. NATURE COMPUTATIONAL SCIENCE 2025:10.1038/s43588-025-00797-7. [PMID: 40346195 DOI: 10.1038/s43588-025-00797-7] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/03/2024] [Accepted: 03/25/2025] [Indexed: 05/11/2025]
Abstract
Designing periodicity-aware, invariant and invertible representations provides an opportunity for the inverse design of crystalline materials with desired properties by generative models. This objective requires optimizing representations and refining the architecture of generative models, yet its feasibility remains uncertain, given current progress in molecular inverse generation. In this Perspective, we highlight the progress of various methods for designing representations and generative schemes for crystalline materials, discuss the challenges in the field and propose a roadmap for future developments.
Collapse
Affiliation(s)
- Zhilong Wang
- Cornell University AI for Science Institute, Cornell University, Ithaca, NY, USA
- College of Engineering, Cornell University, Ithaca, NY, USA
- Robert Frederick Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA
| | - Fengqi You
- Cornell University AI for Science Institute, Cornell University, Ithaca, NY, USA.
- College of Engineering, Cornell University, Ithaca, NY, USA.
- Robert Frederick Smith School of Chemical and Biomolecular Engineering, Cornell University, Ithaca, NY, USA.
| |
Collapse
|
2
|
Resnik DB, Hosseini M. The ethics of using artificial intelligence in scientific research: new guidance needed for a new tool. AI AND ETHICS 2025; 5:1499-1521. [PMID: 40337745 PMCID: PMC12057767 DOI: 10.1007/s43681-024-00493-8] [Citation(s) in RCA: 10] [Impact Index Per Article: 10.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/14/2023] [Accepted: 05/07/2024] [Indexed: 05/09/2025]
Abstract
Using artificial intelligence (AI) in research offers many important benefits for science and society but also creates novel and complex ethical issues. While these ethical issues do not necessitate changing established ethical norms of science, they require the scientific community to develop new guidance for the appropriate use of AI. In this article, we briefly introduce AI and explain how it can be used in research, examine some of the ethical issues raised when using it, and offer nine recommendations for responsible use, including: (1) Researchers are responsible for identifying, describing, reducing, and controlling AI-related biases and random errors; (2) Researchers should disclose, describe, and explain their use of AI in research, including its limitations, in language that can be understood by non-experts; (3) Researchers should engage with impacted communities, populations, and other stakeholders concerning the use of AI in research to obtain their advice and assistance and address their interests and concerns, such as issues related to bias; (4) Researchers who use synthetic data should (a) indicate which parts of the data are synthetic; (b) clearly label the synthetic data; (c) describe how the data were generated; and (d) explain how and why the data were used; (5) AI systems should not be named as authors, inventors, or copyright holders but their contributions to research should be disclosed and described; (6) Education and mentoring in responsible conduct of research should include discussion of ethical use of AI.
Collapse
Affiliation(s)
- David B. Resnik
- National Institute of Environmental Health Sciences, Durham, USA
| | - Mohammad Hosseini
- Department of Preventive Medicine, Northwestern University Feinberg School of Medicine, Chicago, IL USA
- Galter Health Sciences Library and Learning Center, Northwestern University Feinberg School of Medicine, Chicago, IL USA
| |
Collapse
|
3
|
Kraessig PM, Singh SP, Lu J, Corvalan CM. Sensory-biased autoencoder enables prediction of texture perception from food rheology. Food Res Int 2025; 205:116007. [PMID: 40032450 DOI: 10.1016/j.foodres.2025.116007] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/26/2024] [Revised: 12/17/2024] [Accepted: 02/11/2025] [Indexed: 03/05/2025]
Abstract
Understanding how the physical properties of food affect sensory perception remains a critical challenge for food design. Here, we present an innovative machine learning strategy to decode the complex relationships between non-Newtonian rheological attributes of liquid foods and their perceived texture. A unique and key aspect of our approach is the implementation of an autoencoder neural network that incorporates sensory scores as a decoder bias during training. This enables the autoencoder to effectively identify non-linear, non-injective relationships between shear-thinning properties and perceived thickness, even when trained on a small dataset. This strategy offers a promising approach for advancing food product development by aiding the design of carefully tailored sensory experiences.
Collapse
Affiliation(s)
- Paul M Kraessig
- Transport Phenomena Laboratory, Department of Food Science, Purdue University, West Lafayette, IN, USA
| | - Shyamvanshikumar P Singh
- Transport Phenomena Laboratory, Department of Food Science, Purdue University, West Lafayette, IN, USA
| | - Jiakai Lu
- Department of Food Science, University of Massachusetts, Ahmerst, MA, USA
| | - Carlos M Corvalan
- Transport Phenomena Laboratory, Department of Food Science, Purdue University, West Lafayette, IN, USA.
| |
Collapse
|
4
|
Wang Y, Zhu C, Zhang S, Xiang C, Gao Z, Zhu G, Sun J, Ding X, Li B, Shen X. Accurately Models the Relationship Between Physical Response and Structure Using Kolmogorov-Arnold Network. ADVANCED SCIENCE (WEINHEIM, BADEN-WURTTEMBERG, GERMANY) 2025; 12:e2413805. [PMID: 39921316 PMCID: PMC11948078 DOI: 10.1002/advs.202413805] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 10/28/2024] [Revised: 12/30/2024] [Indexed: 02/10/2025]
Abstract
Artificial intelligence (AI) in science is a key area of modern research. However, many current machine learning methods lack interpretability, making it difficult to grasp the physical mechanisms behind various phenomena, which hampers progress in related fields. This study focuses on the Poisson's ratio of a hexagonal lattice elastic network as it varies with structural deformation. By employing the Kolmogorov-Arnold Network (KAN), the transition of the network's Poisson's ratio from positive to negative as the hexagonal structural element shifts from a convex polygon to a concave polygon was accurately predicted. The KAN provides a clear mathematical framework that describes this transition, revealing the connection between the Poisson's ratio and the geometric properties of the hexagonal element, and accurately identifying the geometric parameters at which the Poisson's ratio equals zero. This work demonstrates the significant potential of the KAN network to clarify the mathematical relationships that underpin physical responses and structural behaviors.
Collapse
Affiliation(s)
- Yang Wang
- State Key Laboratory for Mechanical Behavior of MaterialsSchool of Materials Science and EngineeringXi'an Jiaotong UniversityXi'an710049P. R. China
| | - Changliang Zhu
- Department of PhysicsSouthern University of Science and TechnologyShenzhen518055P. R. China
| | - Shuzhe Zhang
- Department of Materials Science and EngineeringSouthern University of Science and TechnologyShenzhen518055P. R. China
| | - Changsheng Xiang
- State Key Laboratory for Mechanical Behavior of MaterialsSchool of Materials Science and EngineeringXi'an Jiaotong UniversityXi'an710049P. R. China
| | - Zhibin Gao
- State Key Laboratory for Mechanical Behavior of MaterialsSchool of Materials Science and EngineeringXi'an Jiaotong UniversityXi'an710049P. R. China
| | - Guimei Zhu
- School of MicroelectronicsSouthern University of Science and TechnologyShenzhen518055P. R. China
| | - Jun Sun
- State Key Laboratory for Mechanical Behavior of MaterialsSchool of Materials Science and EngineeringXi'an Jiaotong UniversityXi'an710049P. R. China
| | - Xiangdong Ding
- State Key Laboratory for Mechanical Behavior of MaterialsSchool of Materials Science and EngineeringXi'an Jiaotong UniversityXi'an710049P. R. China
| | - Baowen Li
- Department of PhysicsSouthern University of Science and TechnologyShenzhen518055P. R. China
- Department of Materials Science and EngineeringSouthern University of Science and TechnologyShenzhen518055P. R. China
- School of MicroelectronicsSouthern University of Science and TechnologyShenzhen518055P. R. China
- Shenzhen International Quantum AcademyShenzhen518017P. R. China
| | - Xiangying Shen
- Department of PhysicsSouthern University of Science and TechnologyShenzhen518055P. R. China
| |
Collapse
|
5
|
Reynolds SA, Beery S, Burgess N, Burgman M, Butchart SHM, Cooke SJ, Coomes D, Danielsen F, Di Minin E, Durán AP, Gassert F, Hinsley A, Jaffer S, Jones JPG, Li BV, Mac Aodha O, Madhavapeddy A, O'Donnell SAL, Oxbury WM, Peck L, Pettorelli N, Rodríguez JP, Shuckburgh E, Strassburg B, Yamashita H, Miao Z, Sutherland WJ. The potential for AI to revolutionize conservation: a horizon scan. Trends Ecol Evol 2025; 40:191-207. [PMID: 39694720 DOI: 10.1016/j.tree.2024.11.013] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/08/2024] [Revised: 11/11/2024] [Accepted: 11/15/2024] [Indexed: 12/20/2024]
Abstract
Artificial Intelligence (AI) is an emerging tool that could be leveraged to identify the effective conservation solutions demanded by the urgent biodiversity crisis. We present the results of our horizon scan of AI applications likely to significantly benefit biological conservation. An international panel of conservation scientists and AI experts identified 21 key ideas. These included species recognition to uncover 'dark diversity', multimodal models to improve biodiversity loss predictions, monitoring wildlife trade, and addressing human-wildlife conflict. We consider the potential negative impacts of AI adoption, such as AI colonialism and loss of essential conservation skills, and suggest how the conservation field might adapt to harness the benefits of AI while mitigating its risks.
Collapse
Affiliation(s)
- Sam A Reynolds
- Conservation Science Group, Department of Zoology, Cambridge University, Cambridge CB2 3QZ, UK.
| | - Sara Beery
- Faculty of Artificial Intelligence and Decision Making, Massachusetts Institute of Technology, Cambridge, MA, USA
| | - Neil Burgess
- United Nations Environment Programme World Conservation Monitoring Centre (UNEP-WCMC), Cambridge, CB3 0DL, UK; Center for Macroecology, Evolution, and Climate, University of Copenhagen, DK-2100, Copenhagen, Denmark
| | - Mark Burgman
- School of Life Sciences, University of Hawai'i at Mānoa, Honolulu, HI 96822, USA
| | - Stuart H M Butchart
- BirdLife International, Cambridge CB2 3QZ, UK; Department of Zoology, University of Cambridge, Cambridge CB2 3EJ, UK
| | - Steven J Cooke
- Department of Biology and Institute of Environmental and Interdisciplinary Science, Carleton University, Ottawa, ON K1S 5B6, Canada
| | - David Coomes
- Conservation Research Institute and Department of Plant Sciences, Cambridge CB2 3QZ, UK
| | - Finn Danielsen
- Nordic Foundation for Development and Ecology (NORDECO), Copenhagen DK-1159, Denmark
| | - Enrico Di Minin
- Department of Geosciences and Geography, and Helsinki Institute of Sustainability Science (HELSUS), FI-00014, University of Helsinki, Finland; School of Life Sciences, University of KwaZulu-Natal, Durban 4041, South Africa
| | | | | | - Amy Hinsley
- Department of Biology, University of Oxford, Oxford, OX1 3SZ, UK
| | - Sadiq Jaffer
- Department of Computer Science and Technology, University of Cambridge, Cambridge CB3 0FD, UK
| | - Julia P G Jones
- School of Environmental and Natural Sciences, Bangor University, Bangor, UK
| | - Binbin V Li
- Environmental Research Centre, Duke Kunshan University, Kunshan, Jiangsu 215316, China; Nicholas School of the Environment, Duke University, Durham, NC 27708, USA
| | - Oisin Mac Aodha
- School of Informatics, University of Edinburgh, Edinburgh EH8 9AB, UK
| | - Anil Madhavapeddy
- Department of Computer Science and Technology, University of Cambridge, Cambridge CB3 0FD, UK
| | | | - William M Oxbury
- School of Mathematical Sciences, Lancaster University, Lancaster LA1 4YQ, UK
| | - Lloyd Peck
- British Antarctic Survey, Natural Environment Research Council (NERC), Cambridge CB3 0ET, UK
| | | | - Jon Paul Rodríguez
- International Union for Conservation of Nature (IUCN) Species Survival Commission, Calle La Joya, Edificio Unidad Técnica del Este, Chacao, Caracas 1060, Venezuela; Provita, Calle La Joya, Edificio Unidad Técnica del Este, Chacao, Caracas 1060, Venezuela; Centro de Ecología, Instituto Venezolano de Investigaciones Científicas (IVIC), Miranda 1204, Venezuela
| | - Emily Shuckburgh
- Department of Computer Science and Technology, University of Cambridge, Cambridge CB3 0FD, UK
| | - Bernardo Strassburg
- Re.Green, Rio de Janeiro 22470-060, Brazil; Rio Conservation and Sustainability Science Centre, Department of Geography and the Environment, Pontifícia Universidade Católica, Rio de Janeiro 22451-900, Brazil
| | - Hiromi Yamashita
- Ritsumeikan Asia Pacific University, Jyumonji-baru, Beppu City, Oita, 874-8577, Japan
| | | | - William J Sutherland
- Conservation Science Group, Department of Zoology, Cambridge University, Cambridge CB2 3QZ, UK
| |
Collapse
|
6
|
Chen W, Yang S, Yan Y, Gao Y, Zhu J, Dong Z. Empowering nanophotonic applications via artificial intelligence: pathways, progress, and prospects. NANOPHOTONICS (BERLIN, GERMANY) 2025; 14:429-447. [PMID: 39975637 PMCID: PMC11834058 DOI: 10.1515/nanoph-2024-0723] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2024] [Accepted: 01/14/2025] [Indexed: 02/21/2025]
Abstract
Empowering nanophotonic devices via artificial intelligence (AI) has revolutionized both scientific research methodologies and engineering practices, addressing critical challenges in the design and optimization of complex systems. Traditional methods for developing nanophotonic devices are often constrained by the high dimensionality of design spaces and computational inefficiencies. This review highlights how AI-driven techniques provide transformative solutions by enabling the efficient exploration of vast design spaces, optimizing intricate parameter systems, and predicting the performance of advanced nanophotonic materials and devices with high accuracy. By bridging the gap between computational complexity and practical implementation, AI accelerates the discovery of novel nanophotonic functionalities. Furthermore, we delve into emerging domains, such as diffractive neural networks and quantum machine learning, emphasizing their potential to exploit photonic properties for innovative strategies. The review also examines AI's applications in advanced engineering areas, e.g., optical image recognition, showcasing its role in addressing complex challenges in device integration. By facilitating the development of highly efficient, compact optical devices, these AI-powered methodologies are paving the way for next-generation nanophotonic systems with enhanced functionalities and broader applications.
Collapse
Affiliation(s)
- Wei Chen
- Institute of Electromagnetics and Acoustics and Key Laboratory of Electromagnetic Wave Science and Detection Technology, Xiamen University, Xiamen, Fujian361005, China
- Quantum Innovation Centre (Q.InC), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Innovis #08-03, Singapore138634, Republic of Singapore
| | - Shuya Yang
- Institute of Electromagnetics and Acoustics and Key Laboratory of Electromagnetic Wave Science and Detection Technology, Xiamen University, Xiamen, Fujian361005, China
| | - Yiming Yan
- Institute of Electromagnetics and Acoustics and Key Laboratory of Electromagnetic Wave Science and Detection Technology, Xiamen University, Xiamen, Fujian361005, China
| | - Yuan Gao
- Institute of Electromagnetics and Acoustics and Key Laboratory of Electromagnetic Wave Science and Detection Technology, Xiamen University, Xiamen, Fujian361005, China
| | - Jinfeng Zhu
- Institute of Electromagnetics and Acoustics and Key Laboratory of Electromagnetic Wave Science and Detection Technology, Xiamen University, Xiamen, Fujian361005, China
| | - Zhaogang Dong
- Quantum Innovation Centre (Q.InC), Agency for Science Technology and Research (A*STAR), 2 Fusionopolis Way, Innovis #08-03, Singapore138634, Republic of Singapore
- Science, Mathematics, and Technology (SMT), Singapore University of Technology and Design (SUTD), 8 Somapah Road, Singapore487372, Singapore
| |
Collapse
|
7
|
Wang W, Yang Y, Wu F. Towards Data-And Knowledge-Driven AI: A Survey on Neuro-Symbolic Computing. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE 2025; 47:878-899. [PMID: 39418157 DOI: 10.1109/tpami.2024.3483273] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 10/19/2024]
Abstract
Neural-symbolic computing (NeSy), which pursues the integration of the symbolic and statistical paradigms of cognition, has been an active research area of Artificial Intelligence (AI) for many years. As NeSy shows promise of reconciling the advantages of reasoning and interpretability of symbolic representation and robust learning in neural networks, it may serve as a catalyst for the next generation of AI. In the present paper, we provide a systematic overview of the recent developments and important contributions of NeSy research. First, we introduce study history of this area, covering early work and foundations. We further discuss background concepts and identify key driving factors behind the development of NeSy. Afterward, we categorize recent landmark approaches along several main characteristics that underline this research paradigm, including neural-symbolic integration, knowledge representation, knowledge embedding, and functionality. Next, we briefly discuss the successful application of modern NeSy approaches in several domains. Then, we benchmark several NeSy methods on three representative application tasks. Finally, we identify the open problems together with potential future research directions. This survey is expected to help new researchers enter this rapidly evolving field and accelerate the progress towards data-and knowledge-driven AI.
Collapse
|
8
|
Hosseini M, Horbach SPJM, Holmes K, Ross-Hellauer T. Open Science at the generative AI turn: An exploratory analysis of challenges and opportunities. QUANTITATIVE SCIENCE STUDIES 2025; 6:22-45. [PMID: 40124128 PMCID: PMC11928019 DOI: 10.1162/qss_a_00337] [Citation(s) in RCA: 1] [Impact Index Per Article: 1.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 03/25/2025] Open
Abstract
Technology influences Open Science (OS) practices, because conducting science in transparent, accessible, and participatory ways requires tools and platforms for collaboration and sharing results. Due to this relationship, the characteristics of the employed technologies directly impact OS objectives. Generative Artificial Intelligence (GenAI) is increasingly used by researchers for tasks such as text refining, code generation/editing, reviewing literature, and data curation/analysis. Nevertheless, concerns about openness, transparency, and bias suggest that GenAI may benefit from greater engagement with OS. GenAI promises substantial efficiency gains but is currently fraught with limitations that could negatively impact core OS values, such as fairness, transparency, and integrity, and may harm various social actors. In this paper, we explore the possible positive and negative impacts of GenAI on OS. We use the taxonomy within the UNESCO Recommendation on Open Science to systematically explore the intersection of GenAI and OS. We conclude that using GenAI could advance key OS objectives by broadening meaningful access to knowledge, enabling efficient use of infrastructure, improving engagement of societal actors, and enhancing dialogue among knowledge systems. However, due to GenAI's limitations, it could also compromise the integrity, equity, reproducibility, and reliability of research. Hence, sufficient checks, validation, and critical assessments are essential when incorporating GenAI into research workflows.
Collapse
Affiliation(s)
- Mohammad Hosseini
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Serge P J M Horbach
- Institute for Science in Society, Radboud University, Nijmegen, The Netherlands
| | - Kristi Holmes
- Department of Preventive Medicine, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
- Galter Health Sciences Library and Learning Center, Feinberg School of Medicine, Northwestern University, Chicago, IL, USA
| | - Tony Ross-Hellauer
- Open and Reproducible Research Group, Know-Center GmbH and Institute for Interactive Systems and Data Science, Graz University of Technology, Graz, Austria
| |
Collapse
|
9
|
Scrosati PM, MacKay-Barr EH, Konermann L. Umbrella Sampling MD Simulations for Retention Prediction in Peptide Reversed-phase Liquid Chromatography. Anal Chem 2025; 97:828-837. [PMID: 39705373 DOI: 10.1021/acs.analchem.4c05428] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 12/22/2024]
Abstract
Reversed-phase liquid chromatography (RPLC) is an essential tool for separating complex mixtures such as proteolytic digests in bottom-up proteomics. There is growing interest in methods that can predict the RPLC retention behavior of peptides and other analytes. Already, existing algorithms provide excellent performance based on empirical rules or large sets of RPLC training data. Here we explored a new type of retention prediction strategy that relies on first-principles modeling of peptide interactions with a C18 stationary phase. We recently demonstrated that molecular dynamics (MD) simulations can provide atomistic insights into the behavior of peptides under RPLC conditions (Anal. Chem. 2023, 95, 3892). However, the current work found that it is problematic to use conventional MD data for retention prediction, evident from a poor correlation between experimental retention times and MD-generated "fraction bound" values. We thus turned to umbrella sampling MD, a complementary technique that has previously been applied to probe noncovalent contacts in other types of systems. By restraining the peptide dynamic motions at various positions inside a C18-lined pore, we determined the free energy of the system as a function of peptide-stationary phase distance. ΔGbinding values determined in this way under various mobile phase conditions were linearly correlated with experimental retention times of tryptic test peptides. This work opens retention prediction avenues for novel types of stationary and mobile phases, and for peptides (or other analytes) having arbitrary chemical properties, without the need for RPLC reference data. Umbrella sampling can be used as a stand-alone tool, or it may serve to enhance existing retention prediction algorithms.
Collapse
Affiliation(s)
- Pablo M Scrosati
- Department of Chemistry, The University of Western Ontario, London, Ontario N6A 5B7, Canada
| | - Evelyn H MacKay-Barr
- Department of Chemistry, The University of Western Ontario, London, Ontario N6A 5B7, Canada
| | - Lars Konermann
- Department of Chemistry, The University of Western Ontario, London, Ontario N6A 5B7, Canada
| |
Collapse
|
10
|
Rodríguez C, Arlt S, Möckl L, Krenn M. Automated discovery of experimental designs in super-resolution microscopy with XLuminA. Nat Commun 2024; 15:10658. [PMID: 39658575 PMCID: PMC11632100 DOI: 10.1038/s41467-024-54696-y] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/28/2024] [Accepted: 11/19/2024] [Indexed: 12/12/2024] Open
Abstract
Driven by human ingenuity and creativity, the discovery of super-resolution techniques, which circumvent the classical diffraction limit of light, represent a leap in optical microscopy. However, the vast space encompassing all possible experimental configurations suggests that some powerful concepts and techniques might have not been discovered yet, and might never be with a human-driven direct design approach. Thus, AI-based exploration techniques could provide enormous benefit, by exploring this space in a fast, unbiased way. We introduce XLuminA, an open-source computational framework developed using JAX, a high-performance computing library in Python. XLuminA offers enhanced computational speed enabled by JAX's accelerated linear algebra compiler (XLA), just-in-time compilation, and its seamlessly integrated automatic vectorization, automatic differentiation capabilities and GPU compatibility. XLuminA demonstrates a speed-up of 4 orders of magnitude compared to well-established numerical optimization methods. We showcase XLuminA's potential by re-discovering three foundational experiments in advanced microscopy, and identifying an unseen experimental blueprint featuring sub-diffraction imaging capabilities. This work constitutes an important step in AI-driven scientific discovery of new concepts in optics and advanced microscopy.
Collapse
Affiliation(s)
- Carla Rodríguez
- Max Planck Institute for the Science of Light, Erlangen, Germany.
| | - Sören Arlt
- Max Planck Institute for the Science of Light, Erlangen, Germany
| | - Leonhard Möckl
- Max Planck Institute for the Science of Light, Erlangen, Germany.
- Friedrich-Alexander-University Erlangen-Nuremberg, Faculty of Sciences, Department of Physics, Erlangen, Germany.
- Friedrich-Alexander-University Erlangen-Nuremberg, Faculty of Medicine 1/CITABLE, Erlangen, Germany.
- Deutsches Zentrum Immuntherapie (DZI), Erlangen, Germany.
| | - Mario Krenn
- Max Planck Institute for the Science of Light, Erlangen, Germany.
| |
Collapse
|
11
|
Gao J, Wang D. Quantifying the use and potential benefits of artificial intelligence in scientific research. Nat Hum Behav 2024; 8:2281-2292. [PMID: 39394445 DOI: 10.1038/s41562-024-02020-5] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/26/2023] [Accepted: 09/12/2024] [Indexed: 10/13/2024]
Abstract
The rapid advancement of artificial intelligence (AI) is poised to reshape almost every line of work. Despite enormous efforts devoted to understanding AI's economic impacts, we lack a systematic understanding of the benefits to scientific research associated with the use of AI. Here we develop a measurement framework to estimate the direct use of AI and associated benefits in science. We find that the use and benefits of AI appear widespread throughout the sciences, growing especially rapidly since 2015. However, there is a substantial gap between AI education and its application in research, highlighting a misalignment between AI expertise supply and demand. Our analysis also reveals demographic disparities, with disciplines with higher proportions of women or Black scientists reaping fewer benefits from AI, potentially exacerbating existing inequalities in science. These findings have implications for the equity and sustainability of the research enterprise, especially as the integration of AI with science continues to deepen.
Collapse
Affiliation(s)
- Jian Gao
- Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA
- Kellogg School of Management, Northwestern University, Evanston, IL, USA
- Ryan Institute on Complexity, Northwestern University, Evanston, IL, USA
- Faculty of Social Sciences, The University of Hong Kong, Hong Kong SAR, China
| | - Dashun Wang
- Center for Science of Science and Innovation, Northwestern University, Evanston, IL, USA.
- Kellogg School of Management, Northwestern University, Evanston, IL, USA.
- Ryan Institute on Complexity, Northwestern University, Evanston, IL, USA.
- McCormick School of Engineering, Northwestern University, Evanston, IL, USA.
| |
Collapse
|
12
|
Mejia-Mendez JL, Reza-Zaldívar EE, Sanchez-Martinez A, Ceballos-Sanchez O, Navarro-López DE, Marcelo Lozano L, Armendariz-Borunda J, Tiwari N, Jacobo-Velázquez DA, Sanchez-Ante G, López-Mena ER. Exploring the cytotoxic and antioxidant properties of lanthanide-doped ZnO nanoparticles: a study with machine learning interpretation. J Nanobiotechnology 2024; 22:687. [PMID: 39523303 PMCID: PMC11552316 DOI: 10.1186/s12951-024-02957-9] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/19/2024] [Accepted: 10/27/2024] [Indexed: 11/16/2024] Open
Abstract
BACKGROUND Lanthanide-based nanomaterials offer a promising alternative for cancer therapy because of their selectivity and effectiveness, which can be modified and predicted by leveraging the improved accuracy and enhanced decision-making of machine learning (ML) modeling. METHODS In this study, erbium (Er3+) and ytterbium (Yb3+) were used to dope zinc oxide (ZnO) nanoparticles (NPs). Various characterization techniques and biological assays were employed to investigate the physicochemical and optical properties of the (Er, Yb)-doped ZnO NPs, revealing the influence of the lanthanide elements. RESULTS The (Er, Yb)-doped ZnO NPs exhibited laminar-type morphologies, negative surface charges, and optical bandgaps that vary with the presence of Er3+ and Yb3+. The incorporation of lanthanide ions reduced the cytotoxicity activity of ZnO against HEPG-2, CACO-2, and U87 cell lines. Conversely, doping with Er3+ and Yb3+ enhanced the antioxidant activity of the ZnO against DPPH, ABTS, and H2O2 radicals. The extra tree (ET) and random forest (RF) models predicted the relevance of the characterization results vis-à-vis the cytotoxic properties of the synthesized NPs. CONCLUSION This study demonstrates, for the first time, the synthesis of ZnO NPs doped with Er and Yb via a solution polymerization route. According to characterization results, it was unveiled that the effect of optical bandgap variations influenced the cytotoxic performance of the developed lanthanide-doped ZnO NPs, being the undoped ZnO NPs the most cytotoxic ones. The presence alone or in combination of Er and Yb enhanced their scavenging capacity. ML models such as ET and RF efficiently demonstrated that the concentration and cell line type are key parameters that influence the cytotoxicity of (Er, Yb)-doped ZnO NPs achieving high accuracy rates of 98.96% and 98.67%, respectively. This study expands the knowledge of lanthanides as dopants of nanomaterials for biological and medical applications and supports their potential in cancer therapy by integrating robust ML approaches.
Collapse
Affiliation(s)
- Jorge L Mejia-Mendez
- Departamento de Ciencias Químico-Biológicas, Universidad de las Américas Puebla, Santa Catarina Mártir s/n, Cholula, Puebla, 72810, Mexico
| | - Edwin E Reza-Zaldívar
- Tecnologico de Monterrey, Institute for Obesity Research, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45201, Mexico
| | - A Sanchez-Martinez
- Departamento de Ingeniería de Proyectos, CUCEI, Universidad de Guadalajara, Av. José Guadalupe Zuno # 48, Industrial los Belenes, Zapopan, Jalisco, 45157, México
| | - O Ceballos-Sanchez
- Departamento de Ingeniería de Proyectos, CUCEI, Universidad de Guadalajara, Av. José Guadalupe Zuno # 48, Industrial los Belenes, Zapopan, Jalisco, 45157, México
| | - Diego E Navarro-López
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico
| | - L Marcelo Lozano
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico
| | - Juan Armendariz-Borunda
- Tecnologico de Monterrey, School of Medicine and Health Sciences, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico
- Institute for Molecular Biology in Medicine and Gene Therapy, Department of Molecular Biology and Genomics, Health Sciences University Center, University of Guadalajara, Guadalajara, 44340, Mexico
| | - Naveen Tiwari
- Center for Research in Biological Chemistry and Molecular Materials (CiQUS), , University of Santiago de Compostela, Rúa Jenaro de La Fuente S/N, 15782, Santiago de Compostela, A Coruna, 15782, Mexico.
| | - Daniel A Jacobo-Velázquez
- Tecnologico de Monterrey, Institute for Obesity Research, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45201, Mexico.
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico.
| | - Gildardo Sanchez-Ante
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico.
| | - Edgar R López-Mena
- Tecnologico de Monterrey, Escuela de Ingenieria y Ciencias, Ave. General Ramon Corona 2514, Zapopan, Jalisco, 45138, Mexico.
| |
Collapse
|
13
|
Gao S, Fang A, Huang Y, Giunchiglia V, Noori A, Schwarz JR, Ektefaie Y, Kondic J, Zitnik M. Empowering biomedical discovery with AI agents. Cell 2024; 187:6125-6151. [PMID: 39486399 DOI: 10.1016/j.cell.2024.09.022] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/15/2024] [Revised: 07/16/2024] [Accepted: 09/12/2024] [Indexed: 11/04/2024]
Abstract
We envision "AI scientists" as systems capable of skeptical learning and reasoning that empower biomedical research through collaborative agents that integrate AI models and biomedical tools with experimental platforms. Rather than taking humans out of the discovery process, biomedical AI agents combine human creativity and expertise with AI's ability to analyze large datasets, navigate hypothesis spaces, and execute repetitive tasks. AI agents are poised to be proficient in various tasks, planning discovery workflows and performing self-assessment to identify and mitigate gaps in their knowledge. These agents use large language models and generative models to feature structured memory for continual learning and use machine learning tools to incorporate scientific knowledge, biological principles, and theories. AI agents can impact areas ranging from virtual cell simulation, programmable control of phenotypes, and the design of cellular circuits to developing new therapies.
Collapse
Affiliation(s)
- Shanghua Gao
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Ada Fang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Department of Chemistry and Chemical Biology, Harvard University, Cambridge, MA, USA; Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Allston, MA, USA
| | - Yepeng Huang
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Program in Biological and Biomedical Sciences, Harvard Medical School, Boston, MA, USA
| | - Valentina Giunchiglia
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Department of Brain Sciences, Imperial College London, London, UK
| | - Ayush Noori
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Harvard College, Cambridge, MA, USA
| | | | - Yasha Ektefaie
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Program in Biomedical Informatics, Harvard Medical School, Boston, MA, USA
| | - Jovana Kondic
- Department of Electrical Engineering and Computer Science, MIT, Cambridge, MA, USA
| | - Marinka Zitnik
- Department of Biomedical Informatics, Harvard Medical School, Boston, MA, USA; Kempner Institute for the Study of Natural and Artificial Intelligence, Harvard University, Allston, MA, USA; Broad Institute of MIT and Harvard, Cambridge, MA, USA; Harvard Data Science Initiative, Cambridge, MA, USA.
| |
Collapse
|
14
|
Krauss A. Science of science: A multidisciplinary field studying science. Heliyon 2024; 10:e36066. [PMID: 39296115 PMCID: PMC11408022 DOI: 10.1016/j.heliyon.2024.e36066] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 11/18/2023] [Revised: 07/24/2024] [Accepted: 08/08/2024] [Indexed: 09/21/2024] Open
Abstract
Science and knowledge are studied by researchers across many disciplines, examining how they are developed, what their current boundaries are and how we can advance them. By integrating evidence across disparate disciplines, the holistic field of science of science can address these foundational questions. This field illustrates how science is shaped by many interconnected factors: the cognitive processes of scientists, the historical evolution of science, economic incentives, institutional influences, computational approaches, statistical, mathematical and instrumental foundations of scientific inference, scientometric measures, philosophical and ethical dimensions of scientific concepts, among other influences. Achieving a comprehensive overview of a multifaceted field like the science of science requires pulling together evidence from the many sub-fields studying science across the natural and social sciences and humanities. This enables developing an interdisciplinary perspective of scientific practice, a more holistic understanding of scientific processes and outcomes, and more nuanced perspectives to how scientific research is conducted, influenced and evolves. It enables leveraging the strengths of various disciplines to create a holistic view of the foundations of science. Different researchers study science from their own disciplinary perspective and use their own methods, and there is a large divide between quantitative and qualitative researchers as they commonly do not read or cite research using other methodological approaches. A broader, synthesizing paper employing a qualitative approach can however help provide a bridge between disciplines by pulling together aspects of science (economic, scientometric, psychological, philosophical etc.). Such an approach enables identifying, across the range of fields, the powerful role of our scientific methods and instruments in shaping most aspects of our knowledge and science, whereas economic, social and historical influences help shape what knowledge we pursue. A unifying theory is then outlined for science of science - the new-methods-drive-science theory.
Collapse
Affiliation(s)
- Alexander Krauss
- London School of Economics, London, UK
- Institute for Economic Analysis, Spanish National Research Council, Barcelona, Spain
| |
Collapse
|
15
|
Chen H, Zhang B, Huang J. Recent advances and applications of artificial intelligence in 3D bioprinting. BIOPHYSICS REVIEWS 2024; 5:031301. [PMID: 39036708 PMCID: PMC11260195 DOI: 10.1063/5.0190208] [Citation(s) in RCA: 5] [Impact Index Per Article: 5.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 12/04/2023] [Accepted: 06/11/2024] [Indexed: 07/23/2024]
Abstract
3D bioprinting techniques enable the precise deposition of living cells, biomaterials, and biomolecules, emerging as a promising approach for engineering functional tissues and organs. Meanwhile, recent advances in 3D bioprinting enable researchers to build in vitro models with finely controlled and complex micro-architecture for drug screening and disease modeling. Recently, artificial intelligence (AI) has been applied to different stages of 3D bioprinting, including medical image reconstruction, bioink selection, and printing process, with both classical AI and machine learning approaches. The ability of AI to handle complex datasets, make complex computations, learn from past experiences, and optimize processes dynamically makes it an invaluable tool in advancing 3D bioprinting. The review highlights the current integration of AI in 3D bioprinting and discusses future approaches to harness the synergistic capabilities of 3D bioprinting and AI for developing personalized tissues and organs.
Collapse
Affiliation(s)
| | - Bin Zhang
- Department of Mechanical and Aerospace Engineering, Brunel University London, London, United Kingdom
| | - Jie Huang
- Department of Mechanical Engineering, University College London, London, United Kingdom
| |
Collapse
|
16
|
Thamizharasan S, Chandrasekar VK, Senthilvelan M, Senthilkumar DV. Stimulus-induced dynamical states in an adaptive network with symmetric adaptation. Phys Rev E 2024; 110:034217. [PMID: 39425440 DOI: 10.1103/physreve.110.034217] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/22/2024] [Accepted: 09/03/2024] [Indexed: 10/21/2024]
Abstract
We consider an adaptive network of identical phase oscillators with the symmetric adaptation rule for the evolution of the connection weights under the influence of an external force. We show that the adaptive network exhibits a plethora of self-organizing dynamical states such as the two-cluster state, multiantipodal clusters, splay cluster, splay chimera, forced entrained state, chimera state, bump state, coherent, and incoherent states in the two-parameter phase diagrams. The intriguing structures of the frequency clusters and instantaneous phases of the oscillators characterize the distinct self-organized synchronized and partial synchronized states. The hierarchical organization of the frequency clusters, resulting in strongly coupled subnetworks, is also evident from the dynamics of the coupling weights, where the frequency clusters are either very weakly coupled or even completely decoupled from each other. Additionally, we also deduce the stability condition for the forced entrained state.
Collapse
|
17
|
Tom G, Schmid SP, Baird SG, Cao Y, Darvish K, Hao H, Lo S, Pablo-García S, Rajaonson EM, Skreta M, Yoshikawa N, Corapi S, Akkoc GD, Strieth-Kalthoff F, Seifrid M, Aspuru-Guzik A. Self-Driving Laboratories for Chemistry and Materials Science. Chem Rev 2024; 124:9633-9732. [PMID: 39137296 PMCID: PMC11363023 DOI: 10.1021/acs.chemrev.4c00055] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 08/15/2024]
Abstract
Self-driving laboratories (SDLs) promise an accelerated application of the scientific method. Through the automation of experimental workflows, along with autonomous experimental planning, SDLs hold the potential to greatly accelerate research in chemistry and materials discovery. This review provides an in-depth analysis of the state-of-the-art in SDL technology, its applications across various scientific disciplines, and the potential implications for research and industry. This review additionally provides an overview of the enabling technologies for SDLs, including their hardware, software, and integration with laboratory infrastructure. Most importantly, this review explores the diverse range of scientific domains where SDLs have made significant contributions, from drug discovery and materials science to genomics and chemistry. We provide a comprehensive review of existing real-world examples of SDLs, their different levels of automation, and the challenges and limitations associated with each domain.
Collapse
Affiliation(s)
- Gary Tom
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Stefan P. Schmid
- Department
of Chemistry and Applied Biosciences, ETH
Zurich, Vladimir-Prelog-Weg 1, CH-8093 Zurich, Switzerland
| | - Sterling G. Baird
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Yang Cao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Kourosh Darvish
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Han Hao
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
| | - Stanley Lo
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Sergio Pablo-García
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
| | - Ella M. Rajaonson
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Marta Skreta
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Naruki Yoshikawa
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
| | - Samantha Corapi
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
| | - Gun Deniz Akkoc
- Forschungszentrum
Jülich GmbH, Helmholtz Institute
for Renewable Energy Erlangen-Nürnberg, Cauerstr. 1, 91058 Erlangen, Germany
- Department
of Chemical and Biological Engineering, Friedrich-Alexander Universität Erlangen-Nürnberg, Egerlandstr. 3, 91058 Erlangen, Germany
| | - Felix Strieth-Kalthoff
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- School of
Mathematics and Natural Sciences, University
of Wuppertal, Gaußstraße
20, 42119 Wuppertal, Germany
| | - Martin Seifrid
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Department
of Materials Science and Engineering, North
Carolina State University, Raleigh, North Carolina 27695, United States of America
| | - Alán Aspuru-Guzik
- Department
of Chemistry, University of Toronto, 80 St. George St, Toronto, Ontario M5S 3H6, Canada
- Department
of Computer Science, University of Toronto, 40 St. George St, Toronto, Ontario M5S 2E4, Canada
- Vector Institute
for Artificial Intelligence, 661 University Ave Suite 710, Toronto, Ontario M5G 1M1, Canada
- Acceleration
Consortium, 80 St. George
St, Toronto, Ontario M5S 3H6, Canada
- Department
of Chemical Engineering & Applied Chemistry, University of Toronto, Toronto, Ontario M5S 3E5, Canada
- Department
of Materials Science & Engineering, University of Toronto, Toronto, Ontario M5S 3E4, Canada
- Lebovic
Fellow, Canadian Institute for Advanced
Research (CIFAR), 661
University Ave, Toronto, Ontario M5G 1M1, Canada
| |
Collapse
|
18
|
Jin T, Zhao Q, Schofield AB, Savoie BM. Deductive machine learning models for product identification. Chem Sci 2024; 15:11995-12005. [PMID: 39092129 PMCID: PMC11290435 DOI: 10.1039/d3sc04909d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 09/17/2023] [Accepted: 06/09/2024] [Indexed: 08/04/2024] Open
Abstract
Deductive solution strategies are required in prediction scenarios that are under determined, when contradictory information is available, or more generally wherever one-to-many non-functional mappings occur. In contrast, most contemporary machine learning (ML) in the chemical sciences is inductive learning from example, with a fixed set of features. Chemical workflows are replete with situations requiring deduction, including many aspects of lab automation and spectral interpretation. Here, a general strategy is described for designing and training machine learning models capable of deduction that consists of combining individual inductive models into a larger deductive network. The training and testing of these models is demonstrated on the task of deducing reaction products from a mixture of spectral sources. The resulting models can distinguish between intended and unintended reaction outcomes and identify starting material based on a mixture of spectral sources. The models also perform well on tasks that they were not directly trained on, like performing structural inference using real rather than simulated spectral inputs, predicting minor products from named organic chemistry reactions, identifying reagents and isomers as plausible impurities, and handling missing or conflicting information. A new dataset of 1 124 043 simulated spectra that were generated to train these models is also distributed with this work. These findings demonstrate that deductive bottlenecks for chemical problems are not fundamentally insuperable for ML models.
Collapse
Affiliation(s)
- Tianfan Jin
- Department of Chemical Engineering, Purdue University West Lafayette USA
| | - Qiyuan Zhao
- Department of Chemical Engineering, Purdue University West Lafayette USA
| | - Andrew B Schofield
- Department of Chemical Engineering, Purdue University West Lafayette USA
| | - Brett M Savoie
- Department of Chemical Engineering, Purdue University West Lafayette USA
| |
Collapse
|
19
|
Ding N, Yuan Z, Ma Z, Wu Y, Yin L. AI-Assisted Rational Design and Activity Prediction of Biological Elements for Optimizing Transcription-Factor-Based Biosensors. Molecules 2024; 29:3512. [PMID: 39124917 PMCID: PMC11313831 DOI: 10.3390/molecules29153512] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/27/2024] [Revised: 07/22/2024] [Accepted: 07/24/2024] [Indexed: 08/12/2024] Open
Abstract
The rational design, activity prediction, and adaptive application of biological elements (bio-elements) are crucial research fields in synthetic biology. Currently, a major challenge in the field is efficiently designing desired bio-elements and accurately predicting their activity using vast datasets. The advancement of artificial intelligence (AI) technology has enabled machine learning and deep learning algorithms to excel in uncovering patterns in bio-element data and predicting their performance. This review explores the application of AI algorithms in the rational design of bio-elements, activity prediction, and the regulation of transcription-factor-based biosensor response performance using AI-designed elements. We discuss the advantages, adaptability, and biological challenges addressed by the AI algorithms in various applications, highlighting their powerful potential in analyzing biological data. Furthermore, we propose innovative solutions to the challenges faced by AI algorithms in the field and suggest future research directions. By consolidating current research and demonstrating the practical applications and future potential of AI in synthetic biology, this review provides valuable insights for advancing both academic research and practical applications in biotechnology.
Collapse
Affiliation(s)
- Nana Ding
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| | - Zenan Yuan
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| | - Zheng Ma
- Zhejiang Provincial Key Laboratory of Biometrology and Inspection & Quarantine, College of Life Sciences, China Jiliang University, Hangzhou 310018, China;
| | - Yefei Wu
- Zhejiang Qianjiang Biochemical Co., Ltd., Haining 314400, China;
| | - Lianghong Yin
- State Key Laboratory of Subtropical Silviculture, Zhejiang A&F University, Hangzhou 311300, China;
- Zhejiang Provincial Key Laboratory of Resources Protection and Innovation of Traditional Chinese Medicine, Zhejiang A&F University, Hangzhou 311300, China
| |
Collapse
|
20
|
Gao TT, Barzel B, Yan G. Learning interpretable dynamics of stochastic complex systems from experimental data. Nat Commun 2024; 15:6029. [PMID: 39019850 PMCID: PMC11254936 DOI: 10.1038/s41467-024-50378-x] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 03/07/2024] [Accepted: 07/09/2024] [Indexed: 07/19/2024] Open
Abstract
Complex systems with many interacting nodes are inherently stochastic and best described by stochastic differential equations. Despite increasing observation data, inferring these equations from empirical data remains challenging. Here, we propose the Langevin graph network approach to learn the hidden stochastic differential equations of complex networked systems, outperforming five state-of-the-art methods. We apply our approach to two real systems: bird flock movement and tau pathology diffusion in brains. The inferred equation for bird flocks closely resembles the second-order Vicsek model, providing unprecedented evidence that the Vicsek model captures genuine flocking dynamics. Moreover, our approach uncovers the governing equation for the spread of abnormal tau proteins in mouse brains, enabling early prediction of tau occupation in each brain region and revealing distinct pathology dynamics in mutant mice. By learning interpretable stochastic dynamics of complex systems, our findings open new avenues for downstream applications such as control.
Collapse
Affiliation(s)
- Ting-Ting Gao
- MOE Key Laboratory of Advanced Micro-Structured Materials, and School of Physical Science and Engineering, Tongji University, Shanghai, P. R. China
- Shanghai Research Institute for Intelligent Autonomous Systems, National Key Laboratory of Autonomous Intelligent Unmanned Systems, MOE Frontiers Science Center for Intelligent Autonomous Systems, and Shanghai Key Laboratory of Intelligent Autonomous Systems, Tongji University, Shanghai, P. R. China
| | - Baruch Barzel
- Department of Mathematics, Bar-Ilan University, Ramat-Gan, Israel
- Gonda Multidisciplinary Brain Research Center, Bar-Ilan University, Ramat-Gan, Israel
| | - Gang Yan
- MOE Key Laboratory of Advanced Micro-Structured Materials, and School of Physical Science and Engineering, Tongji University, Shanghai, P. R. China.
- Shanghai Research Institute for Intelligent Autonomous Systems, National Key Laboratory of Autonomous Intelligent Unmanned Systems, MOE Frontiers Science Center for Intelligent Autonomous Systems, and Shanghai Key Laboratory of Intelligent Autonomous Systems, Tongji University, Shanghai, P. R. China.
| |
Collapse
|
21
|
Prentner R, Hoffman DD. Interfacing consciousness. Front Psychol 2024; 15:1429376. [PMID: 39077200 PMCID: PMC11284140 DOI: 10.3389/fpsyg.2024.1429376] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/08/2024] [Accepted: 06/19/2024] [Indexed: 07/31/2024] Open
Abstract
The current stage of consciousness science has reached an impasse. We blame the physicalist worldview for this and propose a new perspective to make progress on the problems of consciousness. Our perspective is rooted in the theory of conscious agents. We thereby stress the fundamentality of consciousness outside of spacetime, the importance of agency, and the mathematical character of the theory. For conscious agent theory (CAT) to achieve the status of a robust scientific framework, it needs to be integrated with a good explanation of perception and cognition. We argue that this role is played by the interface theory of perception (ITP), an evolutionary-based model of perception that has been previously formulated and defended by the authors. We are specifically interested in what this tells us about the possibility of AI consciousness and conclude with a somewhat counter-intuitive proposal: we live inside a simulation instantiated, not digitally, but in consciousness. Such a simulation is just an interface representation of the dynamics of conscious agents for a conscious agent. This paves the way for employing AI in consciousness science through customizing our interface.
Collapse
Affiliation(s)
- Robert Prentner
- Institute of Humanities, ShanghaiTech University, Shanghai, China
- Association for Mathematical Consciousness Science, Munich, Germany
| | - Donald D. Hoffman
- Department of Cognitive Sciences, University of California, Irvine, Irvine, CA, United States
| |
Collapse
|
22
|
Park H, Patel P, Haas R, Huerta EA. APACE: AlphaFold2 and advanced computing as a service for accelerated discovery in biophysics. Proc Natl Acad Sci U S A 2024; 121:e2311888121. [PMID: 38913887 PMCID: PMC11228474 DOI: 10.1073/pnas.2311888121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 08/14/2023] [Accepted: 12/25/2023] [Indexed: 06/26/2024] Open
Abstract
The prediction of protein 3D structure from amino acid sequence is a computational grand challenge in biophysics and plays a key role in robust protein structure prediction algorithms, from drug discovery to genome interpretation. The advent of AI models, such as AlphaFold, is revolutionizing applications that depend on robust protein structure prediction algorithms. To maximize the impact, and ease the usability, of these AI tools we introduce APACE, AlphaFold2 and advanced computing as a service, a computational framework that effectively handles this AI model and its TB-size database to conduct accelerated protein structure prediction analyses in modern supercomputing environments. We deployed APACE in the Delta and Polaris supercomputers and quantified its performance for accurate protein structure predictions using four exemplar proteins: 6AWO, 6OAN, 7MEZ, and 6D6U. Using up to 300 ensembles, distributed across 200 NVIDIA A100 GPUs, we found that APACE is up to two orders of magnitude faster than off-the-self AlphaFold2 implementations, reducing time-to-solution from weeks to minutes. This computational approach may be readily linked with robotics laboratories to automate and accelerate scientific discovery.
Collapse
Affiliation(s)
- Hyun Park
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Theoretical and Computational Biophysics Group, Beckman Institute for Advanced Science and Technology, University of Illinois at Urbana-Champaign, Urbana, IL 61801
- Center for Biophysics and Quantitative Biology, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - Parth Patel
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Department of Computer Science, University of Illinois at Urbana-Champaign, Urbana, IL 61801
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - Roland Haas
- National Center for Supercomputing Applications, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| | - E A Huerta
- Data Science and Learning Division, Argonne National Laboratory, Lemont, IL 60439
- Department of Computer Science, The University of Chicago, Chicago, IL 60637
- Department of Physics, University of Illinois at Urbana-Champaign, Urbana, IL 61801
| |
Collapse
|
23
|
Zsidai B, Kaarre J, Narup E, Hamrin Senorski E, Pareek A, Grassi A, Ley C, Longo UG, Herbst E, Hirschmann MT, Kopf S, Seil R, Tischer T, Samuelsson K, Feldt R. A practical guide to the implementation of artificial intelligence in orthopaedic research-Part 2: A technical introduction. J Exp Orthop 2024; 11:e12025. [PMID: 38715910 PMCID: PMC11076014 DOI: 10.1002/jeo2.12025] [Citation(s) in RCA: 4] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Submit a Manuscript] [Subscribe] [Scholar Register] [Received: 12/13/2023] [Revised: 01/31/2024] [Accepted: 03/21/2024] [Indexed: 12/26/2024] Open
Abstract
UNLABELLED Recent advances in artificial intelligence (AI) present a broad range of possibilities in medical research. However, orthopaedic researchers aiming to participate in research projects implementing AI-based techniques require a sound understanding of the technical fundamentals of this rapidly developing field. Initial sections of this technical primer provide an overview of the general and the more detailed taxonomy of AI methods. Researchers are presented with the technical basics of the most frequently performed machine learning (ML) tasks, such as classification, regression, clustering and dimensionality reduction. Additionally, the spectrum of supervision in ML including the domains of supervised, unsupervised, semisupervised and self-supervised learning will be explored. Recent advances in neural networks (NNs) and deep learning (DL) architectures have rendered them essential tools for the analysis of complex medical data, which warrants a rudimentary technical introduction to orthopaedic researchers. Furthermore, the capability of natural language processing (NLP) to interpret patterns in human language is discussed and may offer several potential applications in medical text classification, patient sentiment analysis and clinical decision support. The technical discussion concludes with the transformative potential of generative AI and large language models (LLMs) on AI research. Consequently, this second article of the series aims to equip orthopaedic researchers with the fundamental technical knowledge required to engage in interdisciplinary collaboration in AI-driven orthopaedic research. LEVEL OF EVIDENCE Level IV.
Collapse
Affiliation(s)
- Bálint Zsidai
- Sahlgrenska Sports Medicine CenterGothenburgSweden
- Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
| | - Janina Kaarre
- Sahlgrenska Sports Medicine CenterGothenburgSweden
- Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
- Department of Orthopaedic Surgery, UPMC Freddie Fu Sports Medicine CenterUniversity of PittsburghPittsburghUSA
| | - Eric Narup
- Sahlgrenska Sports Medicine CenterGothenburgSweden
- Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
| | - Eric Hamrin Senorski
- Sahlgrenska Sports Medicine CenterGothenburgSweden
- Department of Health and Rehabilitation, Institute of Neuroscience and Physiology, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
- Sportrehab Sports Medicine ClinicGothenburgSweden
| | - Ayoosh Pareek
- Sports and Shoulder Service, Hospital for Special SurgeryNew YorkNew YorkUSA
| | - Alberto Grassi
- Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
- IIa Clinica Ortopedica e Traumatologica, IRCCS Istituto Ortopedico RizzoliBolognaItaly
| | - Christophe Ley
- Department of MathematicsUniversity of LuxembourgEsch‐sur‐AlzetteLuxembourg
| | - Umile Giuseppe Longo
- Fondazione Policlinico Universitario Campus Bio‐MedicoRomeItaly
- Research Unit of Orthopaedic and Trauma Surgery, Department of Medicine and SurgeryUniversità Campus Bio‐Medico di RomaRomeItaly
| | - Elmar Herbst
- Department of Trauma, Hand and Reconstructive SurgeryUniversity Hospital MünsterMünsterGermany
| | - Michael T. Hirschmann
- Department of Orthopedic Surgery and Traumatology, Head Knee Surgery and DKF Head of ResearchKantonsspital BasellandBruderholzSwitzerland
| | - Sebastian Kopf
- Center of Orthopaedics and TraumatologyUniversity Hospital Brandenburg a.d.H., Brandenburg Medical School Theodor FontaneBrandenburg a.d.H.Germany
- Faculty of Health Sciences BrandenburgBrandenburg Medical School Theodor FontaneBrandenburg a.d.H.Germany
| | - Romain Seil
- Department of Orthopaedic Surgery LuxembourgCentre Hospitalier de Luxembourg—Clinique d'EichLuxembourgLuxembourg
- Luxembourg Institute of Research in OrthopaedicsSports Medicine and Science (LIROMS)LuxembourgLuxembourg
- Luxembourg Institute of Health, Human Motion, OrthopaedicsSports Medicine and Digital Methods (HOSD)LuxembourgLuxembourg
| | - Thomas Tischer
- Clinic for Orthopaedics and Trauma SurgeryErlangenGermany
| | - Kristian Samuelsson
- Sahlgrenska Sports Medicine CenterGothenburgSweden
- Department of Orthopaedics, Institute of Clinical Sciences, Sahlgrenska AcademyUniversity of GothenburgGothenburgSweden
- Department of OrthopaedicsSahlgrenska University HospitalMölndalSweden
| | - Robert Feldt
- Department of Computer Science and EngineeringChalmers University of TechnologyGothenburgSweden
| | | |
Collapse
|
24
|
Aldossary A, Campos-Gonzalez-Angulo JA, Pablo-García S, Leong SX, Rajaonson EM, Thiede L, Tom G, Wang A, Avagliano D, Aspuru-Guzik A. In Silico Chemical Experiments in the Age of AI: From Quantum Chemistry to Machine Learning and Back. ADVANCED MATERIALS (DEERFIELD BEACH, FLA.) 2024; 36:e2402369. [PMID: 38794859 DOI: 10.1002/adma.202402369] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/15/2024] [Revised: 04/28/2024] [Indexed: 05/26/2024]
Abstract
Computational chemistry is an indispensable tool for understanding molecules and predicting chemical properties. However, traditional computational methods face significant challenges due to the difficulty of solving the Schrödinger equations and the increasing computational cost with the size of the molecular system. In response, there has been a surge of interest in leveraging artificial intelligence (AI) and machine learning (ML) techniques to in silico experiments. Integrating AI and ML into computational chemistry increases the scalability and speed of the exploration of chemical space. However, challenges remain, particularly regarding the reproducibility and transferability of ML models. This review highlights the evolution of ML in learning from, complementing, or replacing traditional computational chemistry for energy and property predictions. Starting from models trained entirely on numerical data, a journey set forth toward the ideal model incorporating or learning the physical laws of quantum mechanics. This paper also reviews existing computational methods and ML models and their intertwining, outlines a roadmap for future research, and identifies areas for improvement and innovation. Ultimately, the goal is to develop AI architectures capable of predicting accurate and transferable solutions to the Schrödinger equation, thereby revolutionizing in silico experiments within chemistry and materials science.
Collapse
Affiliation(s)
- Abdulrahman Aldossary
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | | | - Sergio Pablo-García
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
| | - Shi Xuan Leong
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Ella Miray Rajaonson
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Luca Thiede
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Gary Tom
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
| | - Andrew Wang
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
| | - Davide Avagliano
- Chimie ParisTech, PSL University, CNRS, Institute of Chemistry for Life and Health Sciences (iCLeHS UMR 8060), Paris, F-75005, France
| | - Alán Aspuru-Guzik
- Department of Chemistry, University of Toronto, 80 St. George Street, Toronto, ON, M5S 3H6, Canada
- Department of Computer Science, University of Toronto, 40 St. George Street, Toronto, ON, M5S 2E4, Canada
- Vector Institute for Artificial Intelligence, 661 University Ave. Suite 710, Toronto, ON, M5G 1M1, Canada
- Department of Materials Science & Engineering, University of Toronto, 184 College St., Toronto, ON, M5S 3E4, Canada
- Department of Chemical Engineering & Applied Chemistry, University of Toronto, 200 College St., Toronto, ON, M5S 3E5, Canada
- Lebovic Fellow, Canadian Institute for Advanced Research (CIFAR), 66118 University Ave., Toronto, M5G 1M1, Canada
- Acceleration Consortium, 80 St George St, Toronto, M5S 3H6, Canada
| |
Collapse
|
25
|
Scherr TF, Douglas CE, Schaecher KE, Schoepp RJ, Ricks KM, Shoemaker CJ. Application of a Machine Learning-Based Classification Approach for Developing Host Protein Diagnostic Models for Infectious Disease. Diagnostics (Basel) 2024; 14:1290. [PMID: 38928705 PMCID: PMC11202442 DOI: 10.3390/diagnostics14121290] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/09/2024] [Revised: 06/11/2024] [Accepted: 06/13/2024] [Indexed: 06/28/2024] Open
Abstract
In recent years, infectious disease diagnosis has increasingly turned to host-centered approaches as a complement to pathogen-directed ones. The former, however, typically requires the interpretation of complex multiple biomarker datasets to arrive at an informative diagnostic outcome. This report describes a machine learning (ML)-based classification workflow that is intended as a template for researchers seeking to apply ML approaches for developing host-based infectious disease biomarker classifiers. As an example, we built a classification model that could accurately distinguish between three disease etiology classes: bacterial, viral, and normal in human sera using host protein biomarkers of known diagnostic utility. After collecting protein data from known disease samples, we trained a series of increasingly complex Auto-ML models until arriving at an optimized classifier that could differentiate viral, bacterial, and non-disease samples. Even when limited to a relatively small training set size, the model had robust diagnostic characteristics and performed well when faced with a blinded sample set. We present here a flexible approach for applying an Auto-ML-based workflow for the identification of host biomarker classifiers with diagnostic utility for infectious disease, and which can readily be adapted for multiple biomarker classes and disease states.
Collapse
Affiliation(s)
| | - Christina E. Douglas
- Diagnostic Systems Division, U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA (R.J.S.); (K.M.R.)
| | - Kurt E. Schaecher
- Virology Division, U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA
| | - Randal J. Schoepp
- Diagnostic Systems Division, U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA (R.J.S.); (K.M.R.)
| | - Keersten M. Ricks
- Diagnostic Systems Division, U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA (R.J.S.); (K.M.R.)
| | - Charles J. Shoemaker
- Diagnostic Systems Division, U.S. Army Medical Research Institute of Infectious Diseases, Fort Detrick, MD 21702, USA (R.J.S.); (K.M.R.)
| |
Collapse
|
26
|
Yu H. Application of the artificial intelligence system based on graphics and vision in ethnic tourism of subtropical grasslands. Heliyon 2024; 10:e31442. [PMID: 38867958 PMCID: PMC11167259 DOI: 10.1016/j.heliyon.2024.e31442] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 02/21/2024] [Revised: 05/11/2024] [Accepted: 05/15/2024] [Indexed: 06/14/2024] Open
Abstract
This study aims to optimize the evaluation and decision-making of ethnic tourism resources through the utilization of deep learning algorithms and Internet of Things (IoT) technology. Specifically, emphasis is placed on the recognition and feature extraction of Mongolian decorative patterns, providing new insights for the deep application of cultural heritage and visual design. In this study, the existing DL algorithm is improved, integrating the feature extraction algorithm of ResNet + Canny + Local Binary Pattern (LBP), and utilizing an intelligent decision method to analyze the intelligent development of indigenous tourism resources. Simultaneously, the DL algorithm and IoT technology are combined with visual design and convolutional neural networks to perform feature extraction and technology recognition. Visual design offers an intuitive representation of tourism resources, while fuzzy decision-making provides a more accurate evaluation in the face of uncertainty. By implementing an intelligent decision-making system, this study achieves a multiplier effect. The integration of intelligent methods not only enhances the accuracy of tourism resource evaluation and decision-making but also elevates the quality and efficiency of the tourism experience. This multiplier effect is evident in the system's capacity to manage substantial datasets and deliver prompt, precise decision support, thus playing a pivotal role in tourism resource management and planning. The findings of this study demonstrate that optimizing intelligent development technology for rural tourism through IoT can enhance the efficacy of intelligent solutions. In terms of pattern recognition accuracy, AlexNet, VGGNet, and ResNet achieve accuracies of 90.8 %, 94.5 %, and 96.9 %, respectively, while the proposed fusion algorithm attains an accuracy of 98.8 %. These results offer practical insights for rural tourism brand strategy and underscore the utility of applying fuzzy decision systems in urban tourism and visual design. Moreover, the research outcomes hold significant practical implications for the advancement of Mongolian cultural tourism and provide valuable lessons for exploring novel paradigms in image analysis and pattern recognition. This study contributes beneficial insights for future research endeavors in related domains.
Collapse
Affiliation(s)
- Hong Yu
- Academy of fine arts, Inner Mongolia Minzu University, Tongliao Inner Mongolia, 028000, China
| |
Collapse
|
27
|
Schilter O, Gutierrez DP, Folkmann LM, Castrogiovanni A, García-Durán A, Zipoli F, Roch LM, Laino T. Combining Bayesian optimization and automation to simultaneously optimize reaction conditions and routes. Chem Sci 2024; 15:7732-7741. [PMID: 38784737 PMCID: PMC11110165 DOI: 10.1039/d3sc05607d] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 10/20/2023] [Accepted: 04/05/2024] [Indexed: 05/25/2024] Open
Abstract
Reaching optimal reaction conditions is crucial to achieve high yields, minimal by-products, and environmentally sustainable chemical reactions. With the recent rise of artificial intelligence, there has been a shift from traditional Edisonian trial-and-error optimization to data-driven and automated approaches, which offer significant advantages. Here, we showcase the capabilities of an integrated platform; we conducted simultaneous optimizations of four different terminal alkynes and two reaction routes using an automation platform combined with a Bayesian optimization platform. Remarkably, we achieved a conversion rate of over 80% for all four substrates in 23 experiments, covering ca. 0.2% of the combinatorial space. Further analysis allowed us to identify the influence of different reaction parameters on the reaction outcomes, demonstrating the potential for expedited reaction condition optimization and the prospect of more efficient chemical processes in the future.
Collapse
Affiliation(s)
- Oliver Schilter
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| | | | - Linnea M Folkmann
- Atinary Technologies Route de la Corniche 4 1066 Epalinges Switzerland
| | | | | | - Federico Zipoli
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| | - Loïc M Roch
- Atinary Technologies Route de la Corniche 4 1066 Epalinges Switzerland
| | - Teodoro Laino
- IBM Research Europe Säumerstrasse 4 8803 Rüschlikon Switzerland
- National Center for Competence in Research-Catalysis (NCCR-Catalysis) Switzerland
| |
Collapse
|
28
|
Humer C, Nicholls R, Heberle H, Heckmann M, Pühringer M, Wolf T, Lübbesmeyer M, Heinrich J, Hillenbrand J, Volpin G, Streit M. CIME4R: Exploring iterative, AI-guided chemical reaction optimization campaigns in their parameter space. J Cheminform 2024; 16:51. [PMID: 38730469 PMCID: PMC11636728 DOI: 10.1186/s13321-024-00840-1] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/21/2023] [Accepted: 04/05/2024] [Indexed: 05/12/2024] Open
Abstract
Chemical reaction optimization (RO) is an iterative process that results in large, high-dimensional datasets. Current tools allow for only limited analysis and understanding of parameter spaces, making it hard for scientists to review or follow changes throughout the process. With the recent emergence of using artificial intelligence (AI) models to aid RO, another level of complexity has been added. Helping to assess the quality of a model's prediction and understand its decision is critical to supporting human-AI collaboration and trust calibration. To address this, we propose CIME4R-an open-source interactive web application for analyzing RO data and AI predictions. CIME4R supports users in (i) comprehending a reaction parameter space, (ii) investigating how an RO process developed over iterations, (iii) identifying critical factors of a reaction, and (iv) understanding model predictions. This facilitates making informed decisions during the RO process and helps users to review a completed RO process, especially in AI-guided RO. CIME4R aids decision-making through the interaction between humans and AI by combining the strengths of expert experience and high computational precision. We developed and tested CIME4R with domain experts and verified its usefulness in three case studies. Using CIME4R the experts were able to produce valuable insights from past RO campaigns and to make informed decisions on which experiments to perform next. We believe that CIME4R is the beginning of an open-source community project with the potential to improve the workflow of scientists working in the reaction optimization domain. SCIENTIFIC CONTRIBUTION: To the best of our knowledge, CIME4R is the first open-source interactive web application tailored to the peculiar analysis requirements of reaction optimization (RO) campaigns. Due to the growing use of AI in RO, we developed CIME4R with a special focus on facilitating human-AI collaboration and understanding of AI models. We developed and evaluated CIME4R in collaboration with domain experts to verify its practical usefulness.
Collapse
Affiliation(s)
| | - Rachel Nicholls
- Division Crop Science, Bayer AG, Monheim am Rhein, 40789, Germany
| | - Henry Heberle
- Division Crop Science, Bayer AG, Monheim am Rhein, 40789, Germany
| | | | | | - Thomas Wolf
- Division Crop Science, Bayer AG, Frankfurt, 65926, Germany
| | | | - Julian Heinrich
- Division Crop Science, Bayer AG, Monheim am Rhein, 40789, Germany
| | | | - Giulio Volpin
- Division Crop Science, Bayer AG, Frankfurt, 65926, Germany.
| | - Marc Streit
- Johannes Kepler University Linz, Linz, 4040, Austria.
- datavisyn GmbH, Linz, 4040, Austria.
| |
Collapse
|
29
|
Enquist BJ, Kempes CP, West GB. Developing a predictive science of the biosphere requires the integration of scientific cultures. Proc Natl Acad Sci U S A 2024; 121:e2209196121. [PMID: 38640256 PMCID: PMC11087787 DOI: 10.1073/pnas.2209196121] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 04/21/2024] Open
Abstract
Increasing the speed of scientific progress is urgently needed to address the many challenges associated with the biosphere in the Anthropocene. Consequently, the critical question becomes: How can science most rapidly progress to address large, complex global problems? We suggest that the lag in the development of a more predictive science of the biosphere is not only because the biosphere is so much more complex, or because we do not have enough data, or are not doing enough experiments, but, in large part, because of unresolved tension between the three dominant scientific cultures that pervade the research community. We introduce and explain the concept of the three scientific cultures and present a novel analysis of their characteristics, supported by examples and a formal mathematical definition/representation of what this means and implies. The three cultures operate, to varying degrees, across all of science. However, within the biosciences, and in contrast to some of the other sciences, they remain relatively more separated, and their lack of integration has hindered their potential power and insight. Our solution to accelerating a broader, predictive science of the biosphere is to enhance integration of scientific cultures. The process of integration-Scientific Transculturalism-recognizes that the push for interdisciplinary research, in general, is just not enough. Unless these cultures of science are formally appreciated and their thinking iteratively integrated into scientific discovery and advancement, there will continue to be numerous significant challenges that will increasingly limit forecasting and prediction efforts.
Collapse
Affiliation(s)
- Brian J. Enquist
- Department of Ecology & Evolutionary Biology, University of Arizona, Tucson, AZ85721
- The Santa Fe Institute, Santa Fe, NM87501
| | | | | |
Collapse
|
30
|
Helal H, Firoz J, Bilbrey JA, Sprueill H, Herman KM, Krell MM, Murray T, Roldan ML, Kraus M, Li A, Das P, Xantheas SS, Choudhury S. Acceleration of Graph Neural Network-Based Prediction Models in Chemistry via Co-Design Optimization on Intelligence Processing Units. J Chem Inf Model 2024; 64:1568-1580. [PMID: 38382011 DOI: 10.1021/acs.jcim.3c01312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Indexed: 02/23/2024]
Abstract
Atomic structure prediction and associated property calculations are the bedrock of chemical physics. Since high-fidelity ab initio modeling techniques for computing the structure and properties can be prohibitively expensive, this motivates the development of machine-learning (ML) models that make these predictions more efficiently. Training graph neural networks over large atomistic databases introduces unique computational challenges, such as the need to process millions of small graphs with variable size and support communication patterns that are distinct from learning over large graphs, such as social networks. We demonstrate a novel hardware-software codesign approach to scale up the training of atomistic graph neural networks (GNN) for structure and property prediction. First, to eliminate redundant computation and memory associated with alternative padding techniques and to improve throughput via minimizing communication, we formulate the effective coalescing of the batches of variable-size atomistic graphs as the bin packing problem and introduce a hardware-agnostic algorithm to pack these batches. In addition, we propose hardware-specific optimizations, including a planner and vectorization for the gather-scatter operations targeted for Graphcore's Intelligence Processing Unit (IPU), as well as model-specific optimizations such as merged communication collectives and optimized softplus. Putting these all together, we demonstrate the effectiveness of the proposed codesign approach by providing an implementation of a well-established atomistic GNN on the Graphcore IPUs. We evaluate the training performance on multiple atomistic graph databases with varying degrees of graph counts, sizes, and sparsity. We demonstrate that such a codesign approach can reduce the training time of atomistic GNNs and can improve their performance by up to 1.5× compared to the baseline implementation of the model on the IPUs. Additionally, we compare our IPU implementation with a Nvidia GPU-based implementation and show that our atomistic GNN implementation on the IPUs can run 1.8× faster on average compared to the execution time on the GPUs.
Collapse
Affiliation(s)
- Hatem Helal
- Graphcore, Kett House, Station Rd, Cambridge CB1 2JH, U.K
| | - Jesun Firoz
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, 1100 Dexter Ave N, Seattle, Washington 98109, United States
| | - Jenna A Bilbrey
- Artificial Intelligence and Data Analytics Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, Washington 99352, United States
| | - Henry Sprueill
- Artificial Intelligence and Data Analytics Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, Washington 99352, United States
| | - Kristina M Herman
- Department of Chemistry, University of Washington, Seattle, Washington 98185, United States
| | | | - Tom Murray
- Graphcore, Kett House, Station Rd, Cambridge CB1 2JH, U.K
| | | | - Mike Kraus
- Graphcore, Kett House, Station Rd, Cambridge CB1 2JH, U.K
| | - Ang Li
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, Washington 99352, United States
| | - Payel Das
- IBM Research, Yorktown Heights, New York 10598, United States
| | - Sotiris S Xantheas
- Department of Chemistry, University of Washington, Seattle, Washington 98185, United States
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, Washington 99352, United States
| | - Sutanay Choudhury
- Advanced Computing, Mathematics and Data Division, Pacific Northwest National Laboratory, 902 Battelle Boulevard, Richland, Washington 99352, United States
| |
Collapse
|
31
|
Messeri L, Crockett MJ. Artificial intelligence and illusions of understanding in scientific research. Nature 2024; 627:49-58. [PMID: 38448693 DOI: 10.1038/s41586-024-07146-0] [Citation(s) in RCA: 49] [Impact Index Per Article: 49.0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/31/2023] [Accepted: 01/31/2024] [Indexed: 03/08/2024]
Abstract
Scientists are enthusiastically imagining ways in which artificial intelligence (AI) tools might improve research. Why are AI tools so attractive and what are the risks of implementing them across the research pipeline? Here we develop a taxonomy of scientists' visions for AI, observing that their appeal comes from promises to improve productivity and objectivity by overcoming human shortcomings. But proposed AI solutions can also exploit our cognitive limitations, making us vulnerable to illusions of understanding in which we believe we understand more about the world than we actually do. Such illusions obscure the scientific community's ability to see the formation of scientific monocultures, in which some types of methods, questions and viewpoints come to dominate alternative approaches, making science less innovative and more vulnerable to errors. The proliferation of AI tools in science risks introducing a phase of scientific enquiry in which we produce more but understand less. By analysing the appeal of these tools, we provide a framework for advancing discussions of responsible knowledge production in the age of AI.
Collapse
Affiliation(s)
- Lisa Messeri
- Department of Anthropology, Yale University, New Haven, CT, USA.
| | - M J Crockett
- Department of Psychology, Princeton University, Princeton, NJ, USA.
- University Center for Human Values, Princeton University, Princeton, NJ, USA.
| |
Collapse
|
32
|
Back S, Aspuru-Guzik A, Ceriotti M, Gryn'ova G, Grzybowski B, Gu GH, Hein J, Hippalgaonkar K, Hormázabal R, Jung Y, Kim S, Kim WY, Moosavi SM, Noh J, Park C, Schrier J, Schwaller P, Tsuda K, Vegge T, von Lilienfeld OA, Walsh A. Accelerated chemical science with AI. DIGITAL DISCOVERY 2024; 3:23-33. [PMID: 38239898 PMCID: PMC10793638 DOI: 10.1039/d3dd00213f] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 10/25/2023] [Accepted: 12/06/2023] [Indexed: 01/22/2024]
Abstract
In light of the pressing need for practical materials and molecular solutions to renewable energy and health problems, to name just two examples, one wonders how to accelerate research and development in the chemical sciences, so as to address the time it takes to bring materials from initial discovery to commercialization. Artificial intelligence (AI)-based techniques, in particular, are having a transformative and accelerating impact on many if not most, technological domains. To shed light on these questions, the authors and participants gathered in person for the ASLLA Symposium on the theme of 'Accelerated Chemical Science with AI' at Gangneung, Republic of Korea. We present the findings, ideas, comments, and often contentious opinions expressed during four panel discussions related to the respective general topics: 'Data', 'New applications', 'Machine learning algorithms', and 'Education'. All discussions were recorded, transcribed into text using Open AI's Whisper, and summarized using LG AI Research's EXAONE LLM, followed by revision by all authors. For the broader benefit of current researchers, educators in higher education, and academic bodies such as associations, publishers, librarians, and companies, we provide chemistry-specific recommendations and summarize the resulting conclusions.
Collapse
Affiliation(s)
- Seoin Back
- Department of Chemical and Biomolecular Engineering, Institute of Emergent Materials, Sogang University Seoul Republic of Korea
| | - Alán Aspuru-Guzik
- Departments of Chemistry, Computer Science, University of Toronto St. George Campus Toronto ON Canada
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
| | - Michele Ceriotti
- Laboratory of Computational Science and Modeling (COSMO), École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Ganna Gryn'ova
- Heidelberg Institute for Theoretical Studies (HITS gGmbH) 69118 Heidelberg Germany
- Interdisciplinary Center for Scientific Computing, Heidelberg University 69120 Heidelberg Germany
| | - Bartosz Grzybowski
- Center for Algorithmic and Robotized Synthesis (CARS), Institute for Basic Science (IBS) Ulsan Republic of Korea
- Institute of Organic Chemistry, Polish Academy of Sciences Warsaw Poland
- Department of Chemistry, Ulsan National Institute of Science and Technology Ulsan Republic of Korea
| | - Geun Ho Gu
- Department of Energy Engineering, Korea Institute of Energy Technology (KENTECH) Naju 58330 Republic of Korea
| | - Jason Hein
- Department of Chemistry, University of British Columbia Vancouver BC V6T 1Z1 Canada
| | - Kedar Hippalgaonkar
- School of Materials Science and Engineering, Nanyang Technological University 50 Nanyang Avenue Singapore 639798 Singapore
- Institute of Materials Research and Engineering, Agency for Science Technology and Research 2 Fusionopolis Way, 08-03 Singapore 138634 Singapore
| | | | - Yousung Jung
- Department of Chemical and Biomolecular Engineering, KAIST Daejeon Republic of Korea
- School of Chemical and Biological Engineering, Interdisciplinary Program in Artificial Intelligence, Seoul National University 1 Gwanak-ro, Gwanak-gu Seoul 08826 Republic of Korea
| | - Seonah Kim
- Department of Chemistry, Colorado State University 1301 Center Avenue Fort Collins CO 80523 USA
| | - Woo Youn Kim
- Department of Chemistry, KAIST Daejeon Republic of Korea
| | - Seyed Mohamad Moosavi
- Chemical Engineering & Applied Chemistry, University of Toronto Toronto Ontario M5S 3E5 Canada
| | - Juhwan Noh
- Chemical Data-Driven Research Center, Korea Research Institute of Chemical Technology Daejeon 34114 Republic of Korea
| | | | - Joshua Schrier
- Department of Chemistry, Fordham University The Bronx NY 10458 USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence (LIAC) & National Centre of Competence in Research (NCCR) Catalysis, École Polytechnique Fédérale de Lausanne Lausanne Switzerland
| | - Koji Tsuda
- Graduate School of Frontier Sciences, The University of Tokyo Kashiwa Chiba 277-8561 Japan
- Center for Basic Research on Materials, National Institute for Materials Science Tsukuba Ibaraki 305-0044 Japan
- RIKEN Center for Advanced Intelligence Project Tokyo 103-0027 Japan
| | - Tejs Vegge
- Department of Energy Conversion and Storage, Technical University of Denmark 301 Anker Engelunds vej, Kongens Lyngby Copenhagen 2800 Denmark
| | - O Anatole von Lilienfeld
- Acceleration Consortium and Vector Institute for Artificial Intelligence Toronto ON M5S 1M1 Canada
- Departments of Chemistry, Materials Science and Engineering, and Physics, University of Toronto, St George Campus Toronto ON Canada
- Machine Learning Group, Technische Universität Berlin and Berlin Institute for the Foundations of Learning and Data 10587 Berlin Germany
| | - Aron Walsh
- Department of Materials, Imperial College London London SW7 2AZ UK
- Department of Physics, Ewha Women's University Seoul Republic of Korea
| |
Collapse
|
33
|
Sadeghi S, Bateni F, Kim T, Son DY, Bennett JA, Orouji N, Punati VS, Stark C, Cerra TD, Awad R, Delgado-Licona F, Xu J, Mukhin N, Dickerson H, Reyes KG, Abolhasani M. Autonomous nanomanufacturing of lead-free metal halide perovskite nanocrystals using a self-driving fluidic lab. NANOSCALE 2024; 16:580-591. [PMID: 38116636 DOI: 10.1039/d3nr05034c] [Citation(s) in RCA: 3] [Impact Index Per Article: 3.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Indexed: 12/21/2023]
Abstract
Lead-based metal halide perovskite (MHP) nanocrystals (NCs) have emerged as a promising class of semiconducting nanomaterials for a wide range of optoelectronic and photoelectronic applications. However, the intrinsic lead toxicity of MHP NCs has significantly hampered their large-scale device applications. Copper-base MHP NCs with composition-tunable optical properties have emerged as a prominent lead-free MHP NC candidate. However, comprehensive synthesis space exploration, development, and synthesis science studies of copper-based MHP NCs have been limited by the manual nature of flask-based synthesis and characterization methods. In this study, we present an autonomous approach for the development of lead-free MHP NCs via seamless integration of a modular microfluidic platform with machine learning-assisted NC synthesis modeling and experiment selection to establish a self-driving fluidic lab for accelerated NC synthesis science studies. For the first time, a successful and reproducible in-flow synthesis of Cs3Cu2I5 NCs is presented. Autonomous experimentation is then employed for rapid in-flow synthesis science studies of Cs3Cu2I5 NCs. The autonomously generated experimental NC synthesis dataset is then utilized for fast-tracked synthetic route optimization of high-performing Cs3Cu2I5 NCs.
Collapse
Affiliation(s)
- Sina Sadeghi
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Fazel Bateni
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Taekhoon Kim
- Synthesis Technical Unit, Material Research Center, Samsung Advanced Institute of Technology, SEC, 130, Samsung-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Dae Yong Son
- Synthesis Technical Unit, Material Research Center, Samsung Advanced Institute of Technology, SEC, 130, Samsung-ro, Yeongtong-gu, Suwon-si, Gyeonggi-do, Republic of Korea
| | - Jeffrey A Bennett
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Negin Orouji
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Venkat S Punati
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Christine Stark
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Teagan D Cerra
- Department of Physics, Weber State University, Ogden, UT 84408, USA
| | - Rami Awad
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Fernando Delgado-Licona
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Jinge Xu
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Nikolai Mukhin
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Hannah Dickerson
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| | - Kristofer G Reyes
- Department of Materials Design and Innovation, University at Buffalo, Buffalo, NY 14260, USA
| | - Milad Abolhasani
- Department of Chemical and Biomolecular Engineering, North Carolina State University, Raleigh, NC 27695, USA.
| |
Collapse
|
34
|
Brown OR, Hullender DA. Darwinian evolution has become dogma; AI can rescue what is salvageable. PROGRESS IN BIOPHYSICS AND MOLECULAR BIOLOGY 2024; 186:53-56. [PMID: 38145808 DOI: 10.1016/j.pbiomolbio.2023.12.001] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [MESH Headings] [Track Full Text] [Subscribe] [Scholar Register] [Received: 07/06/2023] [Revised: 09/06/2023] [Accepted: 12/22/2023] [Indexed: 12/27/2023]
Abstract
Artificial Intelligence (AI), as an academic discipline, is traceable to the mid-1950s but it is currently exploding in applications with successes and concerns. AI can be defined as intelligence demonstrated by computers, with intelligence difficult to define but it must include concepts of ability to learn, reason, and generalize from a vast amount of information and, we propose, to infer meaning. The type of AI known as general AI, has strong, but unrealized potential both for assessing and also for solving major problems with the scientific theory of Darwinian evolution, including its modern variants and for origin of life studies. Specifically, AI should be applied first to evaluate the strengths and weaknesses of the assumptions and empirical information underpinning theories of the origin of life and probability of its evolution. AI should then be applied to assess the scientific validity of the theory of how abundant life came to be on earth.
Collapse
Affiliation(s)
- Olen R Brown
- Emeritus of Biomedical Sciences, at the University of Missouri, Columbia, MO, USA.
| | - David A Hullender
- Mechanical and Aerospace Engineering at the University of Texas at Arlington, USA
| |
Collapse
|
35
|
Li K, Persaud D, Choudhary K, DeCost B, Greenwood M, Hattrick-Simpers J. Exploiting redundancy in large materials datasets for efficient machine learning with less data. Nat Commun 2023; 14:7283. [PMID: 37949845 PMCID: PMC10638383 DOI: 10.1038/s41467-023-42992-y] [Citation(s) in RCA: 11] [Impact Index Per Article: 5.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/30/2023] [Accepted: 10/26/2023] [Indexed: 11/12/2023] Open
Abstract
Extensive efforts to gather materials data have largely overlooked potential data redundancy. In this study, we present evidence of a significant degree of redundancy across multiple large datasets for various material properties, by revealing that up to 95% of data can be safely removed from machine learning training with little impact on in-distribution prediction performance. The redundant data is related to over-represented material types and does not mitigate the severe performance degradation on out-of-distribution samples. In addition, we show that uncertainty-based active learning algorithms can construct much smaller but equally informative datasets. We discuss the effectiveness of informative data in improving prediction performance and robustness and provide insights into efficient data acquisition and machine learning training. This work challenges the "bigger is better" mentality and calls for attention to the information richness of materials data rather than a narrow emphasis on data volume.
Collapse
Affiliation(s)
- Kangming Li
- Department of Materials Science and Engineering, University of Toronto, 27 King's College Cir, Toronto, ON, Canada
| | - Daniel Persaud
- Department of Materials Science and Engineering, University of Toronto, 27 King's College Cir, Toronto, ON, Canada
| | - Kamal Choudhary
- Material Measurement Laboratory, National Institute of Standards and Technology, 100 Bureau Dr, Gaithersburg, MD, USA
| | - Brian DeCost
- Material Measurement Laboratory, National Institute of Standards and Technology, 100 Bureau Dr, Gaithersburg, MD, USA
| | - Michael Greenwood
- Canmet MATERIALS, Natural Resources Canada, 183 Longwood Road south, Hamilton, ON, Canada
| | - Jason Hattrick-Simpers
- Department of Materials Science and Engineering, University of Toronto, 27 King's College Cir, Toronto, ON, Canada.
- Acceleration Consortium, University of Toronto, 27 King's College Cir, Toronto, ON, Canada.
- Vector Institute for Artificial Intelligence, 661 University Ave, Toronto, ON, Canada.
- Schwartz Reisman Institute for Technology and Society, 101 College St, Toronto, ON, Canada.
| |
Collapse
|
36
|
Qian X, Yoon BJ, Arróyave R, Qian X, Dougherty ER. Knowledge-driven learning, optimization, and experimental design under uncertainty for materials discovery. PATTERNS (NEW YORK, N.Y.) 2023; 4:100863. [PMID: 38035192 PMCID: PMC10682757 DOI: 10.1016/j.patter.2023.100863] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Indexed: 12/02/2023]
Abstract
Significant acceleration of the future discovery of novel functional materials requires a fundamental shift from the current materials discovery practice, which is heavily dependent on trial-and-error campaigns and high-throughput screening, to one that builds on knowledge-driven advanced informatics techniques enabled by the latest advances in signal processing and machine learning. In this review, we discuss the major research issues that need to be addressed to expedite this transformation along with the salient challenges involved. We especially focus on Bayesian signal processing and machine learning schemes that are uncertainty aware and physics informed for knowledge-driven learning, robust optimization, and efficient objective-driven experimental design.
Collapse
Affiliation(s)
- Xiaoning Qian
- Department of Electrical & Computer Engineering, Texas A&M University, College Station, TX 77843, USA
- Computational Science Initiative, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Byung-Jun Yoon
- Department of Electrical & Computer Engineering, Texas A&M University, College Station, TX 77843, USA
- Computational Science Initiative, Brookhaven National Laboratory, Upton, NY 11973, USA
| | - Raymundo Arróyave
- Department of Materials Science & Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Xiaofeng Qian
- Department of Materials Science & Engineering, Texas A&M University, College Station, TX 77843, USA
| | - Edward R. Dougherty
- Department of Electrical & Computer Engineering, Texas A&M University, College Station, TX 77843, USA
| |
Collapse
|
37
|
Williams JJ, Tractenberg RE, Batut B, Becker EA, Brown AM, Burke ML, Busby B, Cooch NK, Dillman AA, Donovan SS, Doyle MA, van Gelder CWG, Hall CR, Hertweck KL, Jordan KL, Jungck JR, Latour AR, Lindvall JM, Lloret-Llinares M, McDowell GS, Morris R, Mourad T, Nisselle A, Ordóñez P, Paladin L, Palagi PM, Sukhai MA, Teal TK, Woodley L. An international consensus on effective, inclusive, and career-spanning short-format training in the life sciences and beyond. PLoS One 2023; 18:e0293879. [PMID: 37943810 PMCID: PMC10635508 DOI: 10.1371/journal.pone.0293879] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [MESH Headings] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 07/14/2023] [Accepted: 10/23/2023] [Indexed: 11/12/2023] Open
Abstract
Science, technology, engineering, mathematics, and medicine (STEMM) fields change rapidly and are increasingly interdisciplinary. Commonly, STEMM practitioners use short-format training (SFT) such as workshops and short courses for upskilling and reskilling, but unaddressed challenges limit SFT's effectiveness and inclusiveness. Education researchers, students in SFT courses, and organizations have called for research and strategies that can strengthen SFT in terms of effectiveness, inclusiveness, and accessibility across multiple dimensions. This paper describes the project that resulted in a consensus set of 14 actionable recommendations to systematically strengthen SFT. A diverse international group of 30 experts in education, accessibility, and life sciences came together from 10 countries to develop recommendations that can help strengthen SFT globally. Participants, including representation from some of the largest life science training programs globally, assembled findings in the educational sciences and encompassed the experiences of several of the largest life science SFT programs. The 14 recommendations were derived through a Delphi method, where consensus was achieved in real time as the group completed a series of meetings and tasks designed to elicit specific recommendations. Recommendations cover the breadth of SFT contexts and stakeholder groups and include actions for instructors (e.g., make equity and inclusion an ethical obligation), programs (e.g., centralize infrastructure for assessment and evaluation), as well as organizations and funders (e.g., professionalize training SFT instructors; deploy SFT to counter inequity). Recommendations are aligned with a purpose-built framework-"The Bicycle Principles"-that prioritizes evidenced-based teaching, inclusiveness, and equity, as well as the ability to scale, share, and sustain SFT. We also describe how the Bicycle Principles and recommendations are consistent with educational change theories and can overcome systemic barriers to delivering consistently effective, inclusive, and career-spanning SFT.
Collapse
Affiliation(s)
- Jason J. Williams
- DNA Learning Center, Cold Spring Harbor Laboratory, Cold Spring Harbor, New York, United States of America
| | - Rochelle E. Tractenberg
- Collaborative for Research on Outcomes and Metrics, Georgetown University, Washington, DC, United States of America
| | - Bérénice Batut
- Albert-Ludwigs-University Freiburg, Freiburg, Germany
- Open Life Science, Freiburg, Germany
| | | | - Anne M. Brown
- Virginia Tech, Blacksburg, Virginia, United States of America
| | - Melissa L. Burke
- Australian BioCommons, North Melbourne, Australia
- Queensland Cyber Infrastructure Foundation, Research Computing Centre
- The University of Queensland
| | - Ben Busby
- DNAnexus, Mountain View, California, United States of America
| | | | | | | | | | | | - Christina R. Hall
- Australian BioCommons, North Melbourne, Australia
- University of Melbourne, Melbourne, Australia
| | - Kate L. Hertweck
- Chan Zuckerberg Initiative, Redwood City, California, United States of America
| | | | - John R. Jungck
- University of Delaware, Newark, DE, United States of America
| | | | | | - Marta Lloret-Llinares
- European Molecular Biology Laboratory, European Bioinformatics Institute, Cambridge, United Kingdom
| | - Gary S. McDowell
- Lightoller LLC
- The Ronin Institute, Montclair, NJ, United States of America
- Institute for Globally Distributed Open Research and Education
| | - Rana Morris
- National Center for Biotechnology Information, National Library of Medicine, National Institutes of Health
| | - Teresa Mourad
- Ecological Society of America, Washington, DC, United States of America
| | - Amy Nisselle
- Murdoch Children’s Research Institute, Melbourne, Australia
- Melbourne Genomics, The University of Melbourne, Melbourne, Australia
| | - Patricia Ordóñez
- University of Maryland Baltimore County, Catonsville, Maryland, United States of America
| | - Lisanna Paladin
- European Molecular Biology Laboratory, Structural and Computational Biology Unit, Heidelberg, Germany
| | | | - Mahadeo A. Sukhai
- Canadian National Institute for the Blind, Toronto, Canada
- Queen’s University School of Medicine, Kingston, Canada
| | - Tracy K. Teal
- Posit, PBC, Boston, Massachusetts, United States of America
| | - Louise Woodley
- Center for Scientific Collaboration and Community Engagement, Oakland, California, United States of America
| |
Collapse
|
38
|
Korolev V, Protsenko P. Accurate, interpretable predictions of materials properties within transformer language models. PATTERNS (NEW YORK, N.Y.) 2023; 4:100803. [PMID: 37876904 PMCID: PMC10591138 DOI: 10.1016/j.patter.2023.100803] [Citation(s) in RCA: 4] [Impact Index Per Article: 2.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 03/28/2023] [Revised: 06/06/2023] [Accepted: 07/04/2023] [Indexed: 10/26/2023]
Abstract
Property prediction accuracy has long been a key parameter of machine learning in materials informatics. Accordingly, advanced models showing state-of-the-art performance turn into highly parameterized black boxes missing interpretability. Here, we present an elegant way to make their reasoning transparent. Human-readable text-based descriptions automatically generated within a suite of open-source tools are proposed as materials representation. Transformer language models pretrained on 2 million peer-reviewed articles take as input well-known terms such as chemical composition, crystal symmetry, and site geometry. Our approach outperforms crystal graph networks by classifying four out of five analyzed properties if one considers all available reference data. Moreover, fine-tuned text-based models show high accuracy in the ultra-small data limit. Explanations of their internal machinery are produced using local interpretability techniques and are faithful and consistent with domain expert rationales. This language-centric framework makes accurate property predictions accessible to people without artificial-intelligence expertise.
Collapse
Affiliation(s)
- Vadim Korolev
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| | - Pavel Protsenko
- Department of Chemistry, Lomonosov Moscow State University, 119991 Moscow, Russia
| |
Collapse
|
39
|
Jablonka KM, Ai Q, Al-Feghali A, Badhwar S, Bocarsly JD, Bran AM, Bringuier S, Brinson LC, Choudhary K, Circi D, Cox S, de Jong WA, Evans ML, Gastellu N, Genzling J, Gil MV, Gupta AK, Hong Z, Imran A, Kruschwitz S, Labarre A, Lála J, Liu T, Ma S, Majumdar S, Merz GW, Moitessier N, Moubarak E, Mouriño B, Pelkie B, Pieler M, Ramos MC, Ranković B, Rodriques SG, Sanders JN, Schwaller P, Schwarting M, Shi J, Smit B, Smith BE, Van Herck J, Völker C, Ward L, Warren S, Weiser B, Zhang S, Zhang X, Zia GA, Scourtas A, Schmidt KJ, Foster I, White AD, Blaiszik B. 14 examples of how LLMs can transform materials science and chemistry: a reflection on a large language model hackathon. DIGITAL DISCOVERY 2023; 2:1233-1250. [PMID: 38013906 PMCID: PMC10561547 DOI: 10.1039/d3dd00113j] [Citation(s) in RCA: 15] [Impact Index Per Article: 7.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 06/12/2023] [Accepted: 08/08/2023] [Indexed: 11/04/2023]
Abstract
Large-language models (LLMs) such as GPT-4 caught the interest of many scientists. Recent studies suggested that these models could be useful in chemistry and materials science. To explore these possibilities, we organized a hackathon. This article chronicles the projects built as part of this hackathon. Participants employed LLMs for various applications, including predicting properties of molecules and materials, designing novel interfaces for tools, extracting knowledge from unstructured data, and developing new educational applications. The diverse topics and the fact that working prototypes could be generated in less than two days highlight that LLMs will profoundly impact the future of our fields. The rich collection of ideas and projects also indicates that the applications of LLMs are not limited to materials science and chemistry but offer potential benefits to a wide range of scientific disciplines.
Collapse
Affiliation(s)
- Kevin Maik Jablonka
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Qianxiang Ai
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| | | | | | - Joshua D Bocarsly
- Yusuf Hamied Department of Chemistry, University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Andres M Bran
- Laboratory of Artificial Chemical Intelligence (LIAC), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
| | | | | | - Kamal Choudhary
- Material Measurement Laboratory, National Institute of Standards and Technology Maryland 20899 USA
| | - Defne Circi
- Mechanical Engineering and Materials Science, Duke University USA
| | - Sam Cox
- Department of Chemical Engineering, University of Rochester USA
| | - Wibe A de Jong
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Matthew L Evans
- Institut de la Matière Condensée et des Nanosciences (IMCN), UCLouvain Chemin des Étoiles 8 Louvain-la-Neuve 1348 Belgium
- Matgenix SRL 185 Rue Armand Bury 6534 Gozée Belgium
| | - Nicolas Gastellu
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Jerome Genzling
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - María Victoria Gil
- Instituto de Ciencia y Tecnología del Carbono (INCAR), CSIC Francisco Pintado Fe 26 33011 Oviedo Spain
| | - Ankur K Gupta
- Applied Mathematics and Computational Research Division, Lawrence Berkeley National Laboratory Berkeley CA 94720 USA
| | - Zhi Hong
- Department of Computer Science, University of Chicago Chicago Illinois 60637 USA
| | - Alishba Imran
- Computer Science, University of California Berkeley CA 94704 USA
| | - Sabine Kruschwitz
- Bundesanstalt für Materialforschung und -prüfung Unter den Eichen 87 12205 Berlin Germany
| | - Anne Labarre
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Jakub Lála
- Francis Crick Institute 1 Midland Rd London NW1 1AT UK
| | - Tao Liu
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Steven Ma
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Sauradeep Majumdar
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Garrett W Merz
- American Family Insurance Data Science Institute, University of Wisconsin-Madison Madison WI 53706 USA
| | | | - Elias Moubarak
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Beatriz Mouriño
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Brenden Pelkie
- Department of Chemical Engineering, University of Washington Seattle WA 98105 USA
| | | | | | - Bojana Ranković
- Laboratory of Artificial Chemical Intelligence (LIAC), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
| | | | - Jacob N Sanders
- Department of Chemistry and Biochemistry, University of California Los Angeles CA 90095 USA
| | - Philippe Schwaller
- Laboratory of Artificial Chemical Intelligence (LIAC), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
- National Centre of Competence in Research (NCCR) Catalysis, Ecole Polytechnique Fédérale de Lausanne (EPFL) Lausanne Switzerland
| | - Marcus Schwarting
- Department of Computer Science, University of Chicago Chicago IL 60490 USA
| | - Jiale Shi
- Department of Chemical Engineering, Massachusetts Institute of Technology Cambridge Massachusetts 02139 USA
| | - Berend Smit
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Ben E Smith
- Yusuf Hamied Department of Chemistry, University of Cambridge Lensfield Road Cambridge CB2 1EW UK
| | - Joren Van Herck
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Christoph Völker
- Bundesanstalt für Materialforschung und -prüfung Unter den Eichen 87 12205 Berlin Germany
| | - Logan Ward
- Data Science and Learning Division, Argonne National Lab USA
| | - Sean Warren
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Benjamin Weiser
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Sylvester Zhang
- Department of Chemistry, McGill University Montreal Quebec Canada
| | - Xiaoqi Zhang
- Laboratory of Molecular Simulation (LSMO), Institut des Sciences et Ingénierie Chimiques, Ecole Polytechnique Fédérale de Lausanne (EPFL) Sion Valais Switzerland
| | - Ghezal Ahmad Zia
- Bundesanstalt für Materialforschung und -prüfung Unter den Eichen 87 12205 Berlin Germany
| | - Aristana Scourtas
- Globus, University of Chicago, Data Science and Learning Division, Argonne National Lab USA
| | - K J Schmidt
- Globus, University of Chicago, Data Science and Learning Division, Argonne National Lab USA
| | - Ian Foster
- Department of Computer Science, University of Chicago, Data Science and Learning Division, Argonne National Lab USA
| | - Andrew D White
- Department of Chemical Engineering, University of Rochester USA
| | - Ben Blaiszik
- Globus, University of Chicago, Data Science and Learning Division, Argonne National Lab USA
| |
Collapse
|
40
|
Buslón N, Cortés A, Catuara-Solarz S, Cirillo D, Rementeria MJ. Raising awareness of sex and gender bias in artificial intelligence and health. Front Glob Womens Health 2023; 4:970312. [PMID: 37746321 PMCID: PMC10512182 DOI: 10.3389/fgwh.2023.970312] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 06/15/2022] [Accepted: 08/18/2023] [Indexed: 09/26/2023] Open
Abstract
Historically, biomedical research has been led by and focused on men. The recent introduction of Artificial Intelligence (AI) in this area has further proven this practice to be discriminatory for other sexes and genders, more noticeably for women. To move towards a fair AI development, it is essential to include sex and gender diversity both in research practices and in the workplace. In this context, the Bioinfo4women (B4W) program of the Barcelona Supercomputing Center (i) promotes the participation of women scientists by improving their visibility, (ii) fosters international collaborations between institutions and programs and (iii) advances research on sex and gender bias in AI and health. In this article, we discuss methodology and results of a series of conferences, titled “Sex and Gender Bias in Artificial Intelligence and Health, organized by B4W and La Caixa Foundation from March to June 2021 in Barcelona, Spain. The series consisted of nine hybrid events, composed of keynote sessions and seminars open to the general audience, and two working groups with invited experts from different professional backgrounds (academic fields such as biology, engineering, and sociology, as well as NGOs, journalists, lawyers, policymakers, industry). Based on this awareness-raising action, we distilled key recommendations to facilitate the inclusion of sex and gender perspective into public policies, educational programs, industry, and biomedical research, among other sectors, and help overcome sex and gender biases in AI and health.
Collapse
Affiliation(s)
- Nataly Buslón
- Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
| | - Atia Cortés
- Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
| | | | - Davide Cirillo
- Life Sciences Department, Barcelona Supercomputing Center, Barcelona, Spain
| | | |
Collapse
|
41
|
Seifermann M, Reiser P, Friederich P, Levkin PA. High-Throughput Synthesis and Machine Learning Assisted Design of Photodegradable Hydrogels. SMALL METHODS 2023; 7:e2300553. [PMID: 37287430 DOI: 10.1002/smtd.202300553] [Citation(s) in RCA: 3] [Impact Index Per Article: 1.5] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Indexed: 06/09/2023]
Abstract
Due to the large chemical space, the design of functional and responsive soft materials poses many challenges but also offers a wide range of opportunities in terms of the scope of possible properties. Herein, an experimental workflow for miniaturized combinatorial high-throughput screening of functional hydrogel libraries is reported. The data created from the analysis of the photodegradation process of more than 900 different types of hydrogel pads are used to train a machine learning model for automated decision making. Through iterative model optimization based on Bayesian optimization, a substantial improvement in response properties is achieved and thus expanded the scope of material properties obtainable within the chemical space of hydrogels in the study. It is therefore demonstrated that the potential of combining miniaturized high-throughput experiments with smart optimization algorithms for cost and time efficient optimization of materials properties.
Collapse
Affiliation(s)
- Maximilian Seifermann
- Institute of Biological and Chemical Systems-Functional Molecular Systems, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
| | - Patrick Reiser
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131, Karlsruhe, Germany
| | - Pascal Friederich
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131, Karlsruhe, Germany
| | - Pavel A Levkin
- Institute of Biological and Chemical Systems-Functional Molecular Systems, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344, Eggenstein-Leopoldshafen, Germany
- Institute of Organic Chemistry, Karlsruhe Institute of Technology, Fritz-Haber-Weg 6, Karlsruhe, Germany
| |
Collapse
|
42
|
Huang B, von Rudorff GF, von Lilienfeld OA. The central role of density functional theory in the AI age. Science 2023; 381:170-175. [PMID: 37440654 DOI: 10.1126/science.abn3445] [Citation(s) in RCA: 39] [Impact Index Per Article: 19.5] [Reference Citation Analysis] [Abstract] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 04/27/2023] [Accepted: 05/30/2023] [Indexed: 07/15/2023]
Abstract
Density functional theory (DFT) plays a pivotal role in chemical and materials science because of its relatively high predictive power, applicability, versatility, and computational efficiency. We review recent progress in machine learning (ML) model developments, which have relied heavily on DFT for synthetic data generation and for the design of model architectures. The general relevance of these developments is placed in a broader context for chemical and materials sciences. DFT-based ML models have reached high efficiency, accuracy, scalability, and transferability and pave the way to the routine use of successful experimental planning software within self-driving laboratories.
Collapse
Affiliation(s)
- Bing Huang
- University of Vienna, Faculty of Physics, AT1090 Wien, Austria
| | - Guido Falk von Rudorff
- University Kassel, Department of Chemistry, 34132 Kassel, Germany
- Center for Interdisciplinary Nanostructure Science and Technology (CINSaT), 34132 Kassel, Germany
| | - O Anatole von Lilienfeld
- Vector Institute for Artificial Intelligence, Toronto, Ontario M5S 1M1, Canada
- Department of Chemistry, University of Toronto, St. George Campus, Toronto, Ontario M5S 3H6, Canada
- Department of Materials Science and Engineering, University of Toronto, St. George Campus, Toronto, Ontario M5S 3E4, Canada
- Department of Physics, University of Toronto, St. George Campus, Toronto, Ontario M5S 1A7, Canada
- Machine Learning Group, Technische Universität Berlin and Berlin Institute for the Foundations of Learning and Data, 10587 Berlin, Germany
| |
Collapse
|
43
|
Sawicki J, Berner R, Loos SAM, Anvari M, Bader R, Barfuss W, Botta N, Brede N, Franović I, Gauthier DJ, Goldt S, Hajizadeh A, Hövel P, Karin O, Lorenz-Spreen P, Miehl C, Mölter J, Olmi S, Schöll E, Seif A, Tass PA, Volpe G, Yanchuk S, Kurths J. Perspectives on adaptive dynamical systems. CHAOS (WOODBURY, N.Y.) 2023; 33:071501. [PMID: 37486668 DOI: 10.1063/5.0147231] [Citation(s) in RCA: 18] [Impact Index Per Article: 9.0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 02/20/2023] [Accepted: 05/24/2023] [Indexed: 07/25/2023]
Abstract
Adaptivity is a dynamical feature that is omnipresent in nature, socio-economics, and technology. For example, adaptive couplings appear in various real-world systems, such as the power grid, social, and neural networks, and they form the backbone of closed-loop control strategies and machine learning algorithms. In this article, we provide an interdisciplinary perspective on adaptive systems. We reflect on the notion and terminology of adaptivity in different disciplines and discuss which role adaptivity plays for various fields. We highlight common open challenges and give perspectives on future research directions, looking to inspire interdisciplinary approaches.
Collapse
Affiliation(s)
- Jakub Sawicki
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Rico Berner
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| | - Sarah A M Loos
- DAMTP, University of Cambridge, Wilberforce Road, Cambridge CB3 0WA, United Kingdom
| | - Mehrnaz Anvari
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Fraunhofer Institute for Algorithms and Scientific Computing, Schloss Birlinghoven, 53757 Sankt-Augustin, Germany
| | - Rolf Bader
- Institute of Systematic Musicology, University of Hamburg, Hamburg, Germany
| | - Wolfram Barfuss
- Transdisciplinary Research Area: Sustainable Futures, University of Bonn, 53113 Bonn, Germany
- Center for Development Research (ZEF), University of Bonn, 53113 Bonn, Germany
| | - Nicola Botta
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Computer Science and Engineering, Chalmers University of Technology, 412 96 Göteborg, Sweden
| | - Nuria Brede
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Computer Science, University of Potsdam, An der Bahn 2, 14476 Potsdam, Germany
| | - Igor Franović
- Scientific Computing Laboratory, Center for the Study of Complex Systems, Institute of Physics Belgrade, University of Belgrade, Pregrevica 118, 11080 Belgrade, Serbia
| | - Daniel J Gauthier
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
| | - Sebastian Goldt
- Department of Physics, International School of Advanced Studies (SISSA), Trieste, Italy
| | - Aida Hajizadeh
- Research Group Comparative Neuroscience, Leibniz Institute for Neurobiology, Magdeburg, Germany
| | - Philipp Hövel
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
| | - Omer Karin
- Department of Mathematics, Imperial College London, London SW7 2AZ, United Kingdom
| | - Philipp Lorenz-Spreen
- Center for Adaptive Rationality, Max Planck Institute for Human Development, Lentzeallee 94, 14195 Berlin, Germany
| | - Christoph Miehl
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Jan Mölter
- Department of Mathematics, School of Computation, Information and Technology, Technical University of Munich, Boltzmannstraße 3, 85748 Garching bei München, Germany
| | - Simona Olmi
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Eckehard Schöll
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Akademie Basel, Fachhochschule Nordwestschweiz FHNW, Leonhardsstrasse 6, 4009 Basel, Switzerland
| | - Alireza Seif
- Pritzker School of Molecular Engineering, The University of Chicago, Chicago, Illinois 60637, USA
| | - Peter A Tass
- Department of Neurosurgery, Stanford University School of Medicine, Stanford, California 94304, USA
| | - Giovanni Volpe
- Department of Physics, University of Gothenburg, Gothenburg, Sweden
| | - Serhiy Yanchuk
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| | - Jürgen Kurths
- Potsdam Institute for Climate Impact Research, Telegrafenberg, 14473 Potsdam, Germany
- Department of Physics, Humboldt-Universität zu Berlin, Newtonstraße 15, 12489 Berlin, Germany
| |
Collapse
|
44
|
Cluitmans M, Walton R, Plank G. Editorial: Computational methods in cardiac electrophysiology. Front Physiol 2023; 14:1231342. [PMID: 37435304 PMCID: PMC10332857 DOI: 10.3389/fphys.2023.1231342] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Key Words] [Track Full Text] [Figures] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/30/2023] [Accepted: 06/12/2023] [Indexed: 07/13/2023] Open
Affiliation(s)
- Matthijs Cluitmans
- Department of Cardiology, Cardiovascular Research Institute Maastricht, Maastricht University, Maastricht, Netherlands
| | - Richard Walton
- INSERM Institut de Rythmologie et Modélisation Cardiaque (IHU-Liryc), Pessac, Aquitaine, France
| | - Gernot Plank
- Gottfried Schatz Research Center for Cellular Signaling, Metabolism and Aging, Medical University of Graz, Graz, Styria, Austria
| |
Collapse
|
45
|
Corrales-Hernández MG, Villarroel-Hagemann SK, Mendoza-Rodelo IE, Palacios-Sánchez L, Gaviria-Carrillo M, Buitrago-Ricaurte N, Espinosa-Lugo S, Calderon-Ospina CA, Rodríguez-Quintana JH. Development of Antiepileptic Drugs throughout History: From Serendipity to Artificial Intelligence. Biomedicines 2023; 11:1632. [PMID: 37371727 DOI: 10.3390/biomedicines11061632] [Citation(s) in RCA: 8] [Impact Index Per Article: 4.0] [Reference Citation Analysis] [Abstract] [Key Words] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 05/07/2023] [Revised: 05/24/2023] [Accepted: 05/31/2023] [Indexed: 06/29/2023] Open
Abstract
This article provides a comprehensive narrative review of the history of antiepileptic drugs (AEDs) and their development over time. Firstly, it explores the significant role of serendipity in the discovery of essential AEDs that continue to be used today, such as phenobarbital and valproic acid. Subsequently, it delves into the historical progression of crucial preclinical models employed in the development of novel AEDs, including the maximal electroshock stimulation test, pentylenetetrazol-induced test, kindling models, and other animal models. Moving forward, a concise overview of the clinical advancement of major AEDs is provided, highlighting the initial milestones and the subsequent refinement of this process in recent decades, in line with the emergence of evidence-based medicine and the implementation of increasingly rigorous controlled clinical trials. Lastly, the article explores the contributions of artificial intelligence, while also offering recommendations and discussing future perspectives for the development of new AEDs.
Collapse
Affiliation(s)
- María Gabriela Corrales-Hernández
- Pharmacology Unit, Department of Biomedical Sciences, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | - Sebastián Kurt Villarroel-Hagemann
- Pharmacology Unit, Department of Biomedical Sciences, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | | | - Leonardo Palacios-Sánchez
- Neuroscience Research Group (NeURos), NeuroVitae Center for Neuroscience, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | - Mariana Gaviria-Carrillo
- Neuroscience Research Group (NeURos), NeuroVitae Center for Neuroscience, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | | | - Santiago Espinosa-Lugo
- Pharmacology Unit, Department of Biomedical Sciences, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | - Carlos-Alberto Calderon-Ospina
- Pharmacology Unit, Department of Biomedical Sciences, School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
- Research Group in Applied Biomedical Sciences (UR Biomed), School of Medicine and Health Sciences, Universidad del Rosario, Bogotá 111221, Colombia
| | - Jesús Hernán Rodríguez-Quintana
- Fundacion CardioInfantil-Instituto de Cardiologia, Calle 163a # 13B-60, Bogotá 111156, Colombia
- Hospital Universitario Mayor Mederi, Calle 24 # 29-45, Bogotá 111411, Colombia
| |
Collapse
|
46
|
Wu YD, Zhu Y, Bai G, Wang Y, Chiribella G. Quantum Similarity Testing with Convolutional Neural Networks. PHYSICAL REVIEW LETTERS 2023; 130:210601. [PMID: 37295121 DOI: 10.1103/physrevlett.130.210601] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Track Full Text] [Subscribe] [Scholar Register] [Received: 11/04/2022] [Revised: 04/08/2023] [Accepted: 04/25/2023] [Indexed: 06/12/2023]
Abstract
The task of testing whether two uncharacterized quantum devices behave in the same way is crucial for benchmarking near-term quantum computers and quantum simulators, but has so far remained open for continuous variable quantum systems. In this Letter, we develop a machine learning algorithm for comparing unknown continuous variable states using limited and noisy data. The algorithm works on non-Gaussian quantum states for which similarity testing could not be achieved with previous techniques. Our approach is based on a convolutional neural network that assesses the similarity of quantum states based on a lower-dimensional state representation built from measurement data. The network can be trained off-line with classically simulated data from a fiducial set of states sharing structural similarities with the states to be tested, with experimental data generated by measurements on the fiducial states, or with a combination of simulated and experimental data. We test the performance of the model on noisy cat states and states generated by arbitrary selective number-dependent phase gates. Our network can also be applied to the problem of comparing continuous variable states across different experimental platforms, with different sets of achievable measurements, and to the problem of experimentally testing whether two states are equivalent up to Gaussian unitary transformations.
Collapse
Affiliation(s)
- Ya-Dong Wu
- Department of Computer Science, QICI Quantum Information and Computation Initiative, The University of Hong Kong, Pokfulam Road, Hong Kong
| | - Yan Zhu
- Department of Computer Science, QICI Quantum Information and Computation Initiative, The University of Hong Kong, Pokfulam Road, Hong Kong
| | - Ge Bai
- Centre for Quantum Technologies, National University of Singapore, Block S15, 3 Science Drive 2, 117543, Singapore
| | - Yuexuan Wang
- Department of Computer Science, AI Technology Laboratory, The University of Hong Kong, Pokfulam Road, Hong Kong
- College of Computer Science and Technology, Zhejiang University, Zhejiang Province 310058, China
| | - Giulio Chiribella
- Department of Computer Science, QICI Quantum Information and Computation Initiative, The University of Hong Kong, Pokfulam Road, Hong Kong
- Department of Computer Science, Parks Road, Oxford OX1 3QD, United Kingdom
- Perimeter Institute for Theoretical Physics, Waterloo, Ontario N2L 2Y5, Canada
| |
Collapse
|
47
|
Wellawatte G, Gandhi HA, Seshadri A, White AD. A Perspective on Explanations of Molecular Prediction Models. J Chem Theory Comput 2023; 19:2149-2160. [PMID: 36972469 PMCID: PMC10134429 DOI: 10.1021/acs.jctc.2c01235] [Citation(s) in RCA: 21] [Impact Index Per Article: 10.5] [Reference Citation Analysis] [Abstract] [Grants] [Track Full Text] [Journal Information] [Subscribe] [Scholar Register] [Received: 12/06/2022] [Indexed: 03/29/2023]
Abstract
Chemists can be skeptical in using deep learning (DL) in decision making, due to the lack of interpretability in "black-box" models. Explainable artificial intelligence (XAI) is a branch of artificial intelligence (AI) which addresses this drawback by providing tools to interpret DL models and their predictions. We review the principles of XAI in the domain of chemistry and emerging methods for creating and evaluating explanations. Then, we focus on methods developed by our group and their applications in predicting solubility, blood-brain barrier permeability, and the scent of molecules. We show that XAI methods like chemical counterfactuals and descriptor explanations can explain DL predictions while giving insight into structure-property relationships. Finally, we discuss how a two-step process of developing a black-box model and explaining predictions can uncover structure-property relationships.
Collapse
Affiliation(s)
- Geemi
P. Wellawatte
- Department
of Chemistry, University of Rochester, Rochester, New York 14627, United States
| | - Heta A. Gandhi
- Department
of Chemical Engineering, University of Rochester, Rochester, New York 14627, United States
| | - Aditi Seshadri
- Department
of Chemical Engineering, University of Rochester, Rochester, New York 14627, United States
| | - Andrew D. White
- Department
of Chemical Engineering, University of Rochester, Rochester, New York 14627, United States
| |
Collapse
|
48
|
Raubitzek S, Mallinger K, Neubauer T. Combining Fractional Derivatives and Machine Learning: A Review. ENTROPY (BASEL, SWITZERLAND) 2022; 25:35. [PMID: 36673176 PMCID: PMC9858603 DOI: 10.3390/e25010035] [Citation(s) in RCA: 0] [Impact Index Per Article: 0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Figures] [Subscribe] [Scholar Register] [Received: 12/07/2022] [Revised: 12/20/2022] [Accepted: 12/21/2022] [Indexed: 06/17/2023]
Abstract
Fractional calculus has gained a lot of attention in the last couple of years. Researchers have discovered that processes in various fields follow fractional dynamics rather than ordinary integer-ordered dynamics, meaning that the corresponding differential equations feature non-integer valued derivatives. There are several arguments for why this is the case, one of which is that fractional derivatives inherit spatiotemporal memory and/or the ability to express complex naturally occurring phenomena. Another popular topic nowadays is machine learning, i.e., learning behavior and patterns from historical data. In our ever-changing world with ever-increasing amounts of data, machine learning is a powerful tool for data analysis, problem-solving, modeling, and prediction. It has provided many further insights and discoveries in various scientific disciplines. As these two modern-day topics hold a lot of potential for combined approaches in terms of describing complex dynamics, this article review combines approaches from fractional derivatives and machine learning from the past, puts them into context, and thus provides a list of possible combined approaches and the corresponding techniques. Note, however, that this article does not deal with neural networks, as there is already extensive literature on neural networks and fractional calculus. We sorted past combined approaches from the literature into three categories, i.e., preprocessing, machine learning and fractional dynamics, and optimization. The contributions of fractional derivatives to machine learning are manifold as they provide powerful preprocessing and feature augmentation techniques, can improve physically informed machine learning, and are capable of improving hyperparameter optimization. Thus, this article serves to motivate researchers dealing with data-based problems, to be specific machine learning practitioners, to adopt new tools, and enhance their existing approaches.
Collapse
Affiliation(s)
- Sebastian Raubitzek
- Data Science Research Unit, TU Wien, Favoritenstrasse 9-11/194, 1040 Vienna, Austria
| | | | | |
Collapse
|
49
|
Reiser P, Neubert M, Eberhard A, Torresi L, Zhou C, Shao C, Metni H, van Hoesel C, Schopmans H, Sommer T, Friederich P. Graph neural networks for materials science and chemistry. COMMUNICATIONS MATERIALS 2022; 3:93. [PMID: 36468086 PMCID: PMC9702700 DOI: 10.1038/s43246-022-00315-6] [Citation(s) in RCA: 132] [Impact Index Per Article: 44.0] [Reference Citation Analysis] [Abstract] [Key Words] [Grants] [Track Full Text] [Download PDF] [Figures] [Subscribe] [Scholar Register] [Received: 05/10/2022] [Accepted: 11/07/2022] [Indexed: 05/14/2023]
Abstract
Machine learning plays an increasingly important role in many areas of chemistry and materials science, being used to predict materials properties, accelerate simulations, design new structures, and predict synthesis routes of new materials. Graph neural networks (GNNs) are one of the fastest growing classes of machine learning models. They are of particular relevance for chemistry and materials science, as they directly work on a graph or structural representation of molecules and materials and therefore have full access to all relevant information required to characterize materials. In this Review, we provide an overview of the basic principles of GNNs, widely used datasets, and state-of-the-art architectures, followed by a discussion of a wide range of recent applications of GNNs in chemistry and materials science, and concluding with a road-map for the further development and application of GNNs.
Collapse
Affiliation(s)
- Patrick Reiser
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Marlen Neubert
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
| | - André Eberhard
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
| | - Luca Torresi
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
| | - Chen Zhou
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
| | - Chen Shao
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Present Address: Institute for Applied Informatics and Formal Description Systems, Karlsruhe Institute of Technology, Kaiserstr. 89, 76133 Karlsruhe, Germany
| | - Houssam Metni
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- ECPM, Université de Strasbourg, 25 Rue Becquerel, 67087 Strasbourg, France
| | - Clint van Hoesel
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Department of Applied Physics, Eindhoven University of Technology, Groene Loper 19, 5612 AP Eindhoven, The Netherlands
| | - Henrik Schopmans
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| | - Timo Sommer
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Institute for Theory of Condensed Matter, Karlsruhe Institute of Technology, Wolfgang-Gaede-Str. 1, 76131 Karlsruhe, Germany
- Present Address: School of Chemistry, Trinity College Dublin, College Green, Dublin 2, Ireland
| | - Pascal Friederich
- Institute of Theoretical Informatics, Karlsruhe Institute of Technology, Am Fasanengarten 5, 76131 Karlsruhe, Germany
- Institute of Nanotechnology, Karlsruhe Institute of Technology, Hermann-von-Helmholtz-Platz 1, 76344 Eggenstein-Leopoldshafen, Germany
| |
Collapse
|